搜索 — ResearchTracker

A fundamental challenge in image editing lies in preserving spatial locality: edits should improve targeted content without inadvertently altering surrounding regions. However, most optimization-based editing approaches treat images as holistic entities, causing global policy updates that undermine locality and introduce undesired context changes. We observe that this issue stems from a mismatch between localized editing intent and globally applied optimization signals. Motivated by this insight, we propose Edit-GRPO, preserving Locality while optimizing image editing, a locality-preserving policy optimization framework that explicitly decouples editing and preservation objectives. By assigning region-specific optimization signals to edit and non-edit areas, Edit-GRPO aligns policy updates with the spatial structure of editing tasks, enabling localized improvements while maintaining global visual coherence. This design effectively suppresses common artifacts such as context distortion and boundary inconsistency. Extensive experiments across diverse image editing scenarios demonstrate that Edit-GRPO significantly improves locality preservation while maintaining strong editing perfor

NEP: Autoregressive Image Editing via Next Editing Token Prediction

arXiv2025-08-08作者：Huimin Wu, Xiaojian Ma, Haozhe Zhao

Text-guided image editing involves modifying a source image based on a language instruction and, typically, requires changes to only small local regions. However, existing approaches generate the entire target image rather than selectively regenerate only the intended editing areas. This results in (1) unnecessary computational costs and (2) a bias toward reconstructing non-editing regions, which compromises the quality of the intended edits. To resolve these limitations, we propose to formulate image editing as Next Editing-token Prediction (NEP) based on autoregressive image generation, where only regions that need to be edited are regenerated, thus avoiding unintended modification to the non-editing areas. To enable any-region editing, we propose to pre-train an any-order autoregressive text-to-image (T2I) model. Once trained, it is capable of zero-shot image editing and can be easily adapted to NEP for image editing, which achieves a new state-of-the-art on widely used image editing benchmarks. Moreover, our model naturally supports test-time scaling (TTS) through iteratively refining its generation in a zero-shot manner. The project page is: https://nep-bigai.github.io/

搜索结果：editing

Edit-GRPO: A Locality-Preserving Policy Optimization Framework for Image Editing

NEP: Autoregressive Image Editing via Next Editing Token Prediction

HP-Edit: A Human-Preference Post-Training Framework for Image Editing

TalkPhoto: A Versatile Training-Free Conversational Assistant for Intelligent Image Editing

PhotoAgent: Agentic Photo Editing with Exploratory Visual Aesthetic Planning

3D-Consistent Multi-View Editing by Correspondence Guidance

Constraining Sequential Model Editing with Editing Anchor Compression

h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform

FREE-Edit: Using Editing-aware Injection in Rectified Flow Models for Zero-shot Image-Driven Video Editing

FlowAnchor: Stabilizing the Editing Signal for Inversion-Free Video Editing

Beyond Local Edits: Embedding-Virtualized Knowledge for Broader Evaluation and Preservation of Model Editing

Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries

MIND-Edit: MLLM Insight-Driven Editing via Language-Vision Projection

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer

Are Watermarked Images Editable? SafeMark for Watermark-Preserving Text-Guided Image Editing