搜索 — ResearchTracker

Large-scale text-to-image foundation models have achieved remarkable visual realism, yet generating human images with correct anatomical structures remains challenging. Existing approaches enforce anatomical constraints through part-specific modules or localized loss weighting during supervised fine-tuning on high-quality human photos, but such datasets are limited and often provide ambiguous optimization signals due to confounding factors such as lighting, pose, and background. Preference-based alignment offers an alternative, but standard Direct Preference Optimization (DPO) treats all pixels equally and therefore fails to exploit the localized nature of anatomical artifacts. To address this, we propose the framework of Alignment via Synthetic Anatomical Preference (ASAP), which constructs controlled preference pairs through a localized degradation mechanism applied to high-fidelity human images. This mechanism performs a controlled experiment on images by introducing explicit anatomical errors in targeted regions while preserving the remaining content. With this mechanism, we create the Human Anatomical Preference (HAP) dataset with over 10K curated pairs for effective anatomica

Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation

arXiv2026-03-09作者：Daniele Molino, Camillo Maria Caruso, Paolo Soda

Text-conditioned generative models for volumetric medical imaging provide semantic control but lack explicit anatomical guidance, often resulting in outputs that are spatially ambiguous or anatomically inconsistent. In contrast, structure-driven methods ensure strong anatomical consistency but typically assume access to ground-truth annotations, which are unavailable when the target image is to be synthesized. We propose a retrieval-augmented approach for Text-to-CT generation that integrates semantic and anatomical information under a realistic inference setting. Given a radiology report, our method retrieves a semantically related clinical case using a 3D vision-language encoder and leverages its associated anatomical annotation as a structural proxy. This proxy is injected into a text-conditioned latent diffusion model via a ControlNet branch, providing coarse anatomical guidance while maintaining semantic flexibility. Experiments on the CT-RATE dataset show that retrieval-augmented generation improves image fidelity and clinical consistency compared to text-only baselines, while additionally enabling explicit spatial controllability, a capability inherently absent in such appro

搜索结果：anatomic

Towards Anatomically Plausible Human Image Generation via Synthetic Localized Preferences

Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation

Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models

Anatomically Constrained Transformers for Echocardiogram Analysis

Anatomical Consistency Distillation and Inconsistency Synthesis for Brain Tumor Segmentation with Missing Modalities

BioAtt: Anatomical Prior Driven Low-Dose CT Denoising

Anatomically Guided Motion Correction for Placental IVIM Parameter Estimation with Accelerated Sampling Method

Steerable Anatomical Shape Synthesis with Implicit Neural Representations

Integrating Anatomical Priors into a Causal Diffusion Model

Augmented Equivariant Mesh Networks for Anatomical Segmentation

Anatomical grounding pre-training for medical phrase grounding

$TrIND$: Representing Anatomical Trees by Denoising Diffusion of Implicit Neural Fields

Automated Dose-Based Anatomic Region Classification of Radiotherapy Treatment for Big Data Applications

Anatomically Constrained Transformers for Cardiac Amyloidosis Classification

Benchmark-Ready 3D Anatomical Shape Classification

Anatomical Positional Embeddings

Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query

Second Order Kinematic Surface Fitting in Anatomical Structures

Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data

Anatomically Constrained Implicit Face Models