搜索 — ResearchTracker

A large number of embeddings trained on medical data have emerged, but it remains unclear how well they represent medical terminology, in particular whether the close relationship of semantically similar medical terms is encoded in these embeddings. To date, only small datasets for testing medical term similarity are available, not allowing to draw conclusions about the generalisability of embeddings to the enormous amount of medical terms used by doctors. We present multiple automatically created large-scale medical term similarity datasets and confirm their high quality in an annotation study with doctors. We evaluate state-of-the-art word and contextual embeddings on our new datasets, comparing multiple vector similarity metrics and word vector aggregation techniques. Our results show that current embeddings are limited in their ability to adequately encode medical terms. The novel datasets thus form a challenging new benchmark for the development of medical embeddings able to accurately represent the whole medical terminology.

Terminology-aware Medical Dialogue Generation

arXiv2022-10-27作者：Chen Tang, Hongbo Zhang, Tyler Loakman

Medical dialogue generation aims to generate responses according to a history of dialogue turns between doctors and patients. Unlike open-domain dialogue generation, this requires background knowledge specific to the medical domain. Existing generative frameworks for medical dialogue generation fall short of incorporating domain-specific knowledge, especially with regard to medical terminology. In this paper, we propose a novel framework to improve medical dialogue generation by considering features centered on domain-specific terminology. We leverage an attention mechanism to incorporate terminologically centred features, and fill in the semantic gap between medical background knowledge and common utterances by enforcing language models to learn terminology representations with an auxiliary terminology recognition task. Experimental results demonstrate the effectiveness of our approach, in which our proposed framework outperforms SOTA language models. Additionally, we provide a new dataset with medical terminology annotations to support the research on medical dialogue generation. Our dataset and code are available at https://github.com/tangg555/meddialog.

搜索结果：Medical Terminology

Can Embeddings Adequately Represent Medical Terminology? New Large-Scale Medical Term Similarity Datasets Have the Answer!

Terminology-aware Medical Dialogue Generation

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

Medical Knowledge Intervention Prompt Tuning for Medical Image Classification

Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation

Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting

Adaptive Differential Privacy for Federated Medical Image Segmentation Across Diverse Modalities

GAN-GA: A Generative Model based on Genetic Algorithm for Medical Image Generation

Instruction-tuned Large Language Models for Machine Translation in the Medical Domain

Ambient-Pix2PixGAN for Translating Medical Images from Noisy Data

A comprehensive survey on deep active learning in medical image analysis

Introduction of Medical Imaging Modalities

Invariant Scattering Transform for Medical Imaging

MedIAnomaly: A comparative study of anomaly detection in medical images

Fréchet Radiomic Distance (FRD): A Versatile Metric for Comparing Medical Imaging Datasets

Test-time generative augmentation for medical image segmentation

MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation

The Need for Medically Aware Video Compression in Gastroenterology

HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation

Towards objective and systematic evaluation of bias in artificial intelligence for medical imaging