搜索 — ResearchTracker

The integration of artificial intelligence (AI) with radiology marks a transformative era in medicine. Vision foundation models have been adopted to enhance radiologic imaging analysis. However, the distinct complexities of radiologic 2D and 3D radiologic data pose unique challenges that existing models, pre-trained on general non-medical images, fail to address adequately. To bridge this gap and capitalize on the diagnostic precision required in radiologic imaging, we introduce Radiologic Contrastive Language-Image Pre-training (RadCLIP): a cross-modal vision-language foundational model that harnesses Vision Language Pre-training (VLP) framework to improve radiologic image analysis. Building upon Contrastive Language-Image Pre-training (CLIP), RadCLIP incorporates a slice pooling mechanism tailored for volumetric image analysis and is pre-trained using a large and diverse dataset of radiologic image-text pairs. The RadCLIP was pre-trained to effectively align radiologic images with their corresponding text annotations, creating a robust vision backbone for radiologic images. Extensive experiments demonstrate RadCLIP's superior performance in both uni-modal radiologic image classif

D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions

arXiv2024-07-02作者：Hareem Nisar, Syed Muhammad Anwar, Zhifan Jiang

Large vision language models (VLMs) have progressed incredibly from research to applicability for general-purpose use cases. LLaVA-Med, a pioneering large language and vision assistant for biomedicine, can perform multi-modal biomedical image and data analysis to provide a natural language interface for radiologists. While it is highly generalizable and works with multi-modal data, it is currently limited by well-known challenges that exist in the large language model space. Hallucinations and imprecision in responses can lead to misdiagnosis which currently hinder the clinical adaptability of VLMs. To create precise, user-friendly models in healthcare, we propose D-Rax -- a domain-specific, conversational, radiologic assistance tool that can be used to gain insights about a particular radiologic image. In this study, we enhance the conversational analysis of chest X-ray (CXR) images to support radiological reporting, offering comprehensive insights from medical imaging and aiding in the formulation of accurate diagnosis. D-Rax is achieved by fine-tuning the LLaVA-Med architecture on our curated enhanced instruction-following data, comprising of images, instructions, as well as dis

搜索结果：radiologic

RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training

D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

RadTimeline: Timeline Summarization for Longitudinal Radiological Lung Findings

The Intrinsic Manifolds of Radiological Images and their Role in Deep Learning

Evaluation of the Management of Hospital Radiological Protection

Learning Diagnosis of COVID-19 from a Single Radiological Image

Radiological images and machine learning: trends, perspectives, and prospects

Dual-Modal Lung Cancer AI: Interpretable Radiology and Microscopy with Clinical Risk Integration

Curia: A Multi-Modal Foundation Model for Radiology

Calibrated Confidence Expression for Radiology Report Generation

Automated Structured Radiology Report Generation

Learning Segmentation from Radiology Reports

MAARTA:Multi-Agentic Adaptive Radiology Teaching Assistant

VLM-KG: Multimodal Radiology Knowledge Graph Generation

RadGame: An AI-Powered Platform for Radiology Education

Pillar-0: A New Frontier for Radiology Foundation Models

PARROT: An Open Multilingual Radiology Reports Dataset

RadioRAG: Online Retrieval-augmented Generation for Radiology Question Answering

RadEval: A framework for radiology text evaluation