搜索 — ResearchTracker

Attribution maps for semantic segmentation are almost always judged by visual plausibility. Yet looking convincing does not guarantee that the highlighted pixels actually drive the model's prediction, nor that attribution credit stays within the target region. These questions require a dedicated evaluation protocol. We introduce a reproducible benchmark that tests intervention-based faithfulness, off-target leakage, perturbation robustness, and runtime on Pascal VOC and SBD across three pretrained backbones. To further demonstrate the benchmark, we propose Dual-Evidence Attribution (DEA), a lightweight correction that fuses gradient evidence with region-level intervention signals through agreement-weighted fusion. DEA increases emphasis where both sources agree and retains causal support when gradient responses are unstable. Across all completed runs, DEA consistently improves deletion-based faithfulness over gradient-only baselines and preserves strong robustness, at the cost of additional compute from intervention passes. The benchmark exposes a faithfulness-stability tradeoff among attribution families that is entirely hidden under visual evaluation, providing a foundation for p

UniCSG: Unified High-Fidelity Content-Constrained Style-Driven Generation via Staged Semantic and Frequency Disentanglement

arXiv2026-04-20作者：Jingwei Yang, Ruoxi Wu, Wei Shen

Style transfer must match a target style while preserving content semantics. DiT-based diffusion models often suffer from content-style entanglement, leading to reference-content leakage and unstable generation. We present UniCSG, a unified framework for content-constrained, style-driven generation in both text-guided and reference-guided settings. UniCSG employs staged training: (i) a latent-space semantic disentanglement stage that combines low-frequency preprocessing with conditioning corruption to encourage content-style separation, and (ii) a latent-space frequency-aware detail reconstruction stage that refines details via multi-scale frequency supervision. We further incorporate pixel-space reward learning to align latent objectives with perceptual quality after decoding. Experiments demonstrate improved content faithfulness, style alignment, and robustness in both settings.

搜索结果：Pixel-faithful

Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion

UniCSG: Unified High-Fidelity Content-Constrained Style-Driven Generation via Staged Semantic and Frequency Disentanglement

From Local to Global to Mechanistic: An iERF-Centered Unified Framework for Interpreting Vision Models

Exploiting Semantic and Pixel Representations for Ultra-Low Bitrate Image Compression

Pixal3D: Pixel-Aligned 3D Generation from Images

GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution

Mitigating Content Shift and Hallucination in GenAI Image Editing via Structural Refinement

H-Sets: Hessian-Guided Discovery of Set-Level Feature Interactions in Image Classifiers

Mid-Infrared Single-Photon Compressive Spectroscopy

Segment Anything with Robust Uncertainty-Accuracy Correlation

When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don't

An Implementation of the Crack Topology Score with Extensions

FG-Portrait: 3D Flow Guided Editable Portrait Animation

EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing

On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations

Pixel-level Certified Explanations via Randomized Smoothing

Markov-Renewal Single-Photon LiDAR Simulator

Faithful Counterfactual Visual Explanations (FCVE)

TraNCE: Transformative Non-linear Concept Explainer for CNNs

STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing