搜索结果：Training-Free

共找到 20 条结果

高级筛选 ▾

Training-Free Image Editing with Visual Context Integration and Concept Alignment

arXiv2026-04-06作者：Rui Song, Guo-Hua Wang, Qing-Guo Chen

In image editing, it is essential to incorporate a context image to convey the user's precise requirements, such as subject appearance or image style. Existing training-based visual context-aware editing methods incur data collection effort and training cost. On the other hand, the training-free alternatives are typically established on diffusion inversion, which struggles with consistency and flexibility. In this work, we propose VicoEdit, a training-free and inversion-free method to inject the visual context into the pretrained text-prompted editing model. More specifically, VicoEdit directly transforms the source image into the target one based on the visual context, thereby eliminating the need for inversion that can lead to deviated trajectories. Moreover, we design a posterior sampling approach guided by concept alignment to enhance the editing consistency. Empirical results demonstrate that our training-free method achieves even better editing performance than the state-of-the-art training-based models.

DouC: Dual-Branch CLIP for Training-Free Open-Vocabulary Segmentation

arXiv2026-04-27作者：Mohamad Zamini, Diksha Shukla

Open-vocabulary semantic segmentation requires assigning pixel-level semantic labels while supporting an open and unrestricted set of categories. Training-free CLIP-based approaches preserve strong zero-shot generalization but typically rely on a single inference mechanism, limiting their ability to jointly address unreliable local tokens and insufficient spatial coherence. We propose DouC, a training-free dual-branch CLIP framework that decomposes dense prediction into two complementary components. OG-CLIP improves patch-level reliability via lightweight, inference-time token gating, while FADE-CLIP injects external structural priors through proxy attention guided by frozen vision foundation models. The two branches are fused at the logit level, enabling local token reliability and structure-aware patch interactions to jointly influence final predictions, with optional instance-aware correction applied as post-processing. DouC introduces no additional learnable parameters, requires no retraining, and preserves CLIP's zero-shot generalization. Extensive experiments across eight benchmarks and multiple CLIP backbones demonstrate that DouC consistently outperforms prior training-free

搜索结果：Training-Free

Training-Free Image Editing with Visual Context Integration and Concept Alignment

DouC: Dual-Branch CLIP for Training-Free Open-Vocabulary Segmentation

Robustifying and Boosting Training-Free Neural Architecture Search

Fast Training-free Perceptual Image Compression

CRoPS: A Training-Free Hallucination Mitigation Framework for Vision-Language Models

A Control Architecture for Training-Free Memory Use

FlowMotion: Training-Free Flow Guidance for Video Motion Transfer

Multilevel and Sequential Monte Carlo for Training-Free Diffusion Guidance

SCOPE: Deterministic and Training-Free 3D UAV Deployment via Perimeter-based Heuristics

The Geometry of Harmful Intent: Training-Free Anomaly Detection via Angular Deviation in LLM Residual Streams

ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation

FreeOcc: Training-free Panoptic Occupancy Prediction via Foundation Models

Towards Training-Free Scene Text Editing

Training-Free In-Context Forensic Chain for Image Manipulation Detection and Localization

Training-free Spatially Grounded Geometric Shape Encoding (Technical Report)

Training-Free Multi-Concept Image Editing

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

Language-to-Space Programming for Training-Free 3D Visual Grounding

Training-Free Group Relative Policy Optimization

Teleportraits: Training-Free People Insertion into Any Scene