搜索结果：Training-Free

共找到 20 条结果

高级筛选 ▾

iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.

PubMed2026-04-06作者：Sun L, Cao J, Xie J

Stable Diffusion has demonstrated strong image synthesis ability to given text descriptions, suggesting it to contain strong semantic clue for grouping objects. The researchers have explored employing Stable Diffusion for training-free segmentation. Most existing approaches refine cross-attention map by self-attention map once, demonstrating that self-attention map contains useful semantic information to improve segmentation. To fully utilize self-attention map, we present a deep experimental analysis on iteratively refining cross-attention map with self-attention map, and propose an effective iterative refinement framework for training-free segmentation, named iSeg. Our iSeg introduces an entropy-reduced self-attention module that utilizes a gradient descent scheme to reduce the entropy of self-attention map, thereby suppressing the weak responses corresponding to irrelevant global information. Leveraging the entropy-reduced self-attention module, our iSeg stably improves cross-attention map with iterative refinement. Further, we design a category-enhanced cross-attention module to generate accurate cross-attention map, providing a better initial input for iterative refinement. Extensive experiments across different datasets and diverse segmentation tasks (weakly-supervised semantic segmentation, open-vocabulary semantic segmentation, unsupervised segmentation, and mask generation on synthetic dataset) reveal the merits of proposed contributions, leading to promising performance. For unsupervised semantic segmentation on Cityscapes, our iSeg achieves an absolute gain of $3.8\%$ in terms of mIoU compared to the best existing training-free approach in literature. Moreover, our proposed iSeg can support segmentation with different kinds of images and interactions, and also be used as a post-processing, or in different frameworks, to improve training-free segmentation. The project is available at https://linsun449.github.io/iSeg.

IEEE transactions on pattern analysis and machine intelligence

搜索结果：Training-Free

iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.

Enhancing Zero-Shot Adversarial Robustness of Vision-Language Models With Training-Free Adaptive Feature Movement.

Training-free detection and 6D pose estimation of unseen surgical instruments.

VISTA-3D : Training-free unfolding for vision-based 3D object detection.

TF-VSF: A Novel Training-Free Visual-Semantic Fusion Rare Medical Morning Glory Syndrome Diseases Severity Assessment Method.

Flow Matching Posterior Sampling: A Training-free Conditional Generation for Flow Matching.

Training-Free Ultra Small Model for Universal Sparse Reconstruction in Compressed Sensing.

RSTFA: Efficient Training-Free Human-Preference Alignment via Rejection Sampling for Text-to-Image Diffusion Models.

A Training-Free Paradigm for Data-Scarce Maritime Scene Classification Using Vision-Language Models.

Training-Free Quantum Architecture Search Under Realistic Noise via Expressibility-Guided Evolution.

FreeMD: Training-free multi-domain text-to-image generation with any control.

Detail++: Training-Free Detail Enhancer for T2I Diffusion Models.

Defect-Intent Ambiguity Addressing for Training-Free Deterministic PCB Defect Localization via Template Selection and Dissimilarity Mapping.

SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation.

A high-performance training-free pipeline for robust random telegraph signal characterization via adaptive wavelet-based denoising and Bayesian digitization methods.

Training-Free Open-Set Domain Adaptation With Vision-Language Models.

MultiTabPFN: Codebook-based extensions of TabPFN for high-class-count tabular classification.

An artificial intelligence framework for universal landmark matching and morphometry in musculoskeletal radiography.

"Stones From Other Hills Can Polish Jade": Zero-Shot Anomaly Synthesis via Cross-Domain Anomaly Injection.

RA-COD: Retrieval-Augmented Camouflaged Object Detection.

搜索结果：Training-Free

iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.

Enhancing Zero-Shot Adversarial Robustness of Vision-Language Models With Training-Free Adaptive Feature Movement.

Training-free detection and 6D pose estimation of unseen surgical instruments.

VISTA-3D : Training-free unfolding for vision-based 3D object detection.

TF-VSF: A Novel Training-Free Visual-Semantic Fusion Rare Medical Morning Glory Syndrome Diseases Severity Assessment Method.

Flow Matching Posterior Sampling: A Training-free Conditional Generation for Flow Matching.

Training-Free Ultra Small Model for Universal Sparse Reconstruction in Compressed Sensing.

RSTFA: Efficient Training-Free Human-Preference Alignment via Rejection Sampling for Text-to-Image Diffusion Models.

A Training-Free Paradigm for Data-Scarce Maritime Scene Classification Using Vision-Language Models.

Training-Free Quantum Architecture Search Under Realistic Noise via Expressibility-Guided Evolution.

FreeMD: Training-free multi-domain text-to-image generation with any control.

Detail++: Training-Free Detail Enhancer for T2I Diffusion Models.

Defect-Intent Ambiguity Addressing for Training-Free Deterministic PCB Defect Localization via Template Selection and Dissimilarity Mapping.

SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation.

A high-performance training-free pipeline for robust random telegraph signal characterization via adaptive wavelet-based denoising and Bayesian digitization methods.

Training-Free Open-Set Domain Adaptation With Vision-Language Models.

MultiTabPFN: Codebook-based extensions of TabPFN for high-class-count tabular classification.

An artificial intelligence framework for universal landmark matching and morphometry in musculoskeletal radiography.

"Stones From Other Hills Can Polish Jade": Zero-Shot Anomaly Synthesis via Cross-Domain Anomaly Injection.

RA-COD: Retrieval-Augmented Camouflaged Object Detection.