搜索 — ResearchTracker

Vision-Language Models (VLMs) have demonstrated significant potential in medical image analysis, yet their application in intraoral photography remains largely underexplored due to the lack of fine-grained, annotated datasets and comprehensive benchmarks. To address this, we present MetaDent, a comprehensive resource that includes (1) a novel and large-scale dentistry image dataset collected from clinical, public, and web sources; (2) a semi-structured annotation framework designed to capture the hierarchical and clinically nuanced nature of dental photography; and (3) comprehensive benchmark suites for evaluating state-of-the-art VLMs on clinical image understanding. Our labeling approach combines a high-level image summary with point-by-point, free-text descriptions of abnormalities. This method enables rich, scalable, and task-agnostic representations. We curated 60,669 dental images from diverse sources and annotated a representative subset of 2,588 images using this meta-labeling scheme. Leveraging Large Language Models (LLMs), we derive standardized benchmarks: approximately 15K Visual Question Answering (VQA) pairs and an 18-class multi-label classification dataset, which we

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

arXiv2025-12-12作者：Zhenyang Cai, Jiaming Zhang, Junjie Zhao

Reliable interpretation of multimodal data in dentistry is essential for automated oral healthcare, yet current multimodal large language models (MLLMs) struggle to capture fine-grained dental visual details and lack sufficient reasoning ability for precise diagnosis. To address these limitations, we present DentalGPT, a specialized dental MLLM developed through high-quality domain knowledge injection and reinforcement learning. Specifically, the largest annotated multimodal dataset for dentistry to date was constructed by aggregating over 120k dental images paired with detailed descriptions that highlight diagnostically relevant visual features, making it the multimodal dataset with the most extensive collection of dental images to date. Training on this dataset significantly enhances the MLLM's visual understanding of dental conditions, while the subsequent reinforcement learning stage further strengthens its capability for multimodal complex reasoning. Comprehensive evaluations on intraoral and panoramic benchmarks, along with dental subsets of medical VQA benchmarks, show that DentalGPT achieves superior performance in disease classification and dental VQA tasks, outperforming

搜索结果：Dentistry

MetaDent: Labeling Clinical Images for Vision-Language Models in Dentistry

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

DentalBench: Benchmarking and Advancing LLMs Capability for Bilingual Dentistry Understanding

Towards Generalist Intelligence in Dentistry: Vision Foundation Models for Oral and Maxillofacial Radiology

A step-by-step guide to generalized estimating equations using SPSS in the field of dentistry

Generative artificial intelligence in dentistry: Current approaches and future challenges

Leveraging Point Transformers for Detecting Anatomical Landmarks in Digital Dentistry

AI Techniques for Cone Beam Computed Tomography in Dentistry: Trends and Practices

ChatGPT for Shaping the Future of Dentistry: The Potential of Multi-Modal Large Language Model

Construction of unbiased dental template and parametric dental model for precision digital dentistry

3D and 4D printing in dentistry and maxillofacial surgery: Recent advances and future perspectives

SemiTooth: a Generalizable Semi-supervised Framework for Multi-Source Tooth Segmentation

High-Fidelity 3D Tooth Reconstruction by Fusing Intraoral Scans and CBCT Data via a Deep Implicit Representation

TCATSeg: A Tooth Center-Wise Attention Network for 3D Dental Model Semantic Segmentation

OralGPT-Omni: A Versatile Dental Multimodal Large Language Model

Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis

Meta-analysis in dental research

CrownGen: Patient-customized Crown Generation via Point Diffusion Model

ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings

3D Dental Model Segmentation with Geometrical Boundary Preserving