搜索 — ResearchTracker

Due to the significant resemblance in visual appearance, pill misuse is prevalent and has become a critical issue, responsible for one-third of all deaths worldwide. Pill identification, thus, is a crucial concern needed to be investigated thoroughly. Recently, several attempts have been made to exploit deep learning to tackle the pill identification problem. However, most published works consider only single-pill identification and fail to distinguish hard samples with identical appearances. Also, most existing pill image datasets only feature single pill images captured in carefully controlled environments under ideal lighting conditions and clean backgrounds. In this work, we are the first to tackle the multi-pill detection problem in real-world settings, aiming at localizing and identifying pills captured by users in a pill intake. Moreover, we also introduce a multi-pill image dataset taken in unconstrained conditions. To handle hard samples, we propose a novel method for constructing heterogeneous a priori graphs incorporating three forms of inter-pill relationships, including co-occurrence likelihood, relative size, and visual semantic correlation. We then offer a framework

Evaluating Few-Shot Pill Recognition Under Visual Domain Shift

arXiv2026-03-11作者：W. I. Chu, G. Tarroni, L. Li

Adverse drug events are a significant source of preventable harm, which has led to the development of automated pill recognition systems to enhance medication safety. Real-world deployment of these systems is hindered by visually complex conditions, including cluttered scenes, overlapping pills, reflections, and diverse acquisition environments. This study investigates few-shot pill recognition from a deployment-oriented perspective, prioritizing generalization under realistic cross-dataset domain shifts over architectural innovation. A two-stage object detection framework is employed, involving base training followed by few-shot fine-tuning. Models are adapted to novel pill classes using one, five, or ten labeled examples per class and are evaluated on a separate deployment dataset featuring multi-object, cluttered scenes. The evaluation focuses on classification-centric and error-based metrics to address heterogeneous annotation strategies. Findings indicate that semantic pill recognition adapts rapidly with few-shot supervision, with classification performance reaching saturation even with a single labeled example. However, stress testing under overlapping and occluded condition

搜索结果：pill

High Accurate and Explainable Multi-Pill Detection Framework with Graph Neural Network-Assisted Multimodal Data Fusion

Evaluating Few-Shot Pill Recognition Under Visual Domain Shift

PILL-CoDe: Inverse Design of Polypills via Automatic Differentiation for Prescribed Drug-Release Kinetics

Image-based Contextual Pill Recognition with Medical Knowledge Graph Assistance

A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning

Real-Time Pill Identification for the Visually Impaired Using Deep Learning

VeriMedi: Pill Identification using Proxy-based Deep Metric Learning and Exact Solution

A Forward and Backward Compatible Framework for Few-shot Class-incremental Pill Recognition

Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs

Multi-stream Fusion for Class Incremental Learning in Pill Image Classification

GPT-based Textile Pilling Classification Using 3D Point Cloud Data

Bifurcation in the angular velocity of a circular disk propelled by symmetrically distributed camphor pills

ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification

Network pharmacology on the mechanism of Yi Qi Tong Qiao Pill inhibiting allergic rhinitis

PILL: Plug Into LLM with Adapter Expert and Attention Gate

Poisoning with A Pill: Circumventing Detection in Federated Learning

A dataset of medication images with instance segmentation masks for preventing adverse drug events

Physics-Informed Tracking (PIT)

Global pluripotential theory for adelic line bundles

Generative Muscle Stimulation: Providing Users with Physical Assistance by Constraining Multimodal-AI with Embodied Knowledge