搜索 — ResearchTracker

Limited-angle computed tomography (LACT) improves temporal resolution and reduces radiation dose, but suffers from severe artifacts due to missing projections. Clinical workflows record abundant patient- and acquisition-level metadata, yet such information remains underutilized in image reconstruction. To tackle the ill-posed LACT inverse problem, we propose a metadata-guided two-stage diffusion framework that leverages structured clinical contexts as semantic priors for robust reconstruction. In Stage-I, we learn a metadata-to-anatomy generative prior by conditioning a transformer-based diffusion model on clinical metadata (acquisition parameters, patient demographics, and diagnostic impressions), and sampling a coarse anatomical estimate from Gaussian noise. In Stage-II, a second conditional diffusion model performs coarse-to-fine refinement, using the Stage-I estimate as an image prior while re-injecting the same metadata to recover full-resolution anatomy. To preserve anatomical fidelity and suppress hallucinations, projection-domain data consistency is enforced periodically after denoising update via an ADMM-based solver. Experiments on the public multimodal CTRATE dataset demonstrate that the proposed framework outperforms iterative, CNN-based, and diffusion-based baselines, with the greatest gains under severe truncation, e.g., up to 5.23%/11.21% higher SSIM/PSNR than the strongest metadata-free diffusion competitor at 90°. On real clinical cardiac CT, it yields coronary artery calcium scores closer to full-view references, indicating improved clinical utility. Furthermore, the proposed method is generalized to out-of-distribution angular ranges and projection geometries, and ablation results confirm complementary contributions from different metadata types under limited-angle conditions. Our results highlight clinical metadata as actionable semantic priors to synergize with physics-informed constraints to improve both reconstruction fidelity and clinical quantification in LACT.

Refine Then Fusion: Robust 3D Brain MRI Synthesis via Vision-Language Collaboration.

PubMed2026-05-12作者：Wei J, Yang G, Wei W

Metadata-guided cross-modality 3D MRI synthesis aims to generate target-contrast volumes from source-modality data conditioned on clinically available metadata, which is important for enhancing clinical imaging flexibility. However, existing methods still suffer from two main limitations: 1) They neglect spatial dependencies within volumetric representations, yielding structurally ambiguous features that blur anatomical boundaries and hinder precise semantic integration. 2) They rely on conventional cross-attention between visual and textual features, limiting the precision of visual-semantic alignment, which reduces robustness across challenging conditions. To address these issues, we propose RTFSyn, a metadata-guided 3D MRI synthesis framework that achieves effective vision-language collaboration through a refine-then-fusion paradigm. The proposed RTFSyn benefits from several merits. First, we design an axis-aware visual refinement module that captures directional dependencies within volumetric features, enabling redundancy suppression and improved structural representation before fusion. Second, we propose a cross-modal adaptive fusion module that leverages pixel packing-recovery to realize efficient cross-attention for improved alignment, while text-conditioned dynamic convolution enables fine-grained semantic injection, together enhancing vision-language collaboration. Lastly, an implicit neural decoder reconstructs the target modality as a continuous function, enabling flexible high-fidelity synthesis. Under this synergistic paradigm, RTFSyn seamlessly unites robust spatial refinement with adaptive feature fusion to achieve highly precise cross-modal alignment. Extensive experiments across four multi-center datasets demonstrate that RTFSyn not only surpasses state-of-the-art methods quantitatively, but also exhibits robust performance under diverse imaging artifacts, zero-shot evaluations, and multi-dimensional clinical validations, all with favorable computational efficiency. The high fidelity, robustness, and efficiency of RTFSyn demonstrate its great potential for clinical applications.

搜索结果：IEEE transactions on medical imaging

Clinical Metadata Guided Limited-Angle CT Image Reconstruction.

Refine Then Fusion: Robust 3D Brain MRI Synthesis via Vision-Language Collaboration.

Anatomy-Guided Self-Supervised Distillation Learning for Medical Image Analysis.

Improved Nondestructive Ultrasound Molecular Imaging with Lightweight Convolutional Neural Network.

ConfIC-RCA: Statistically Grounded Efficient Estimation of Segmentation Quality.

Text-Image Co-Alignment for Weakly Supervised Polyp Segmentation.

Neural Optimization for Image Registration via Joint Modeling of Global Affine and Local Deformation Transformations.

Uncertainty-Aware Information Pursuit for Interpretable and Reliable Medical Image Analysis.

GLEAM: A Multimodal Imaging Dataset and HAMM for Glaucoma Classification.

Cross-Distribution Diffusion Priors-Driven Iterative Reconstruction for Sparse-View CT.

Segmentation-Guided Accelerating Diffusion Model for Cardiac CT Motion Artifact Reduction via Limited-Angle Imaging.

Bilevel Optimized Implicit Neural Representation for Scan-Specific Accelerated MRI Reconstruction.

Multi-granularity Adversarial Generation Integrated Consistency Representation for Chest Low-Contrast-Enhanced CT Synthesis.

Keypoint-Guided Medical Video Segmentation Model with Spatiotemporal Feature Fusion.

StableMIL: Entropy-Stabilized Attention-based Multiple Instance Learning for Morphologically Variable Whole Slide Images.

DiffBulk: Enhancing Spatial Transcriptomic Prediction with Diffusion-Based Training.

Temporal Conditioning for Longitudinal Brain MRI Registration and Aging Analysis.

ColoDiff: Integrating Dynamic Consistency With Content Awareness for Colonoscopy Video Generation.

Revisiting Reconstruction-based Anomaly Detection for Whole Slide Image.

CIM-VTP: Correlation-Guided Image Modeling with Visual-Textual Task Prompt for Universal Medical Image Registration.