搜索结果：silent

共找到 20 条结果

高级筛选 ▾

Ekka: Automated Diagnosis of Silent Errors in LLM Inference

arXiv2026-06-03

LLM serving frameworks are quickly evolving with a complex software stack and a vast number of optimizations. The rapid development process can introduce silent errors where output quality silently degrades without any explicit error signals. Diagnosing silent errors is notoriously difficult due to the substantial semantic gap between the high-level symptoms and the low-level root causes. We observe that diagnosis of silent errors can be effectively framed as a differential debugging problem by leveraging the existence of semantically correct reference implementations. We propose Ekka, an automated diagnosis system that identifies root causes by systematically aligning and comparing intermediate execution states between a target and a reference framework. We constructed a benchmark of real-world silent errors from popular serving frameworks, where Ekka shows 80% pass@1 diagnosis accuracy and 88% pass@5 diagnosis accuracy, outperforming state-of-the-art systems. Ekka also diagnoses 4 new silent errors from serving frameworks, all of which have been confirmed by the developers.

LLM-Powered Silent Bug Fuzzing in Deep Learning Libraries via Versatile and Controlled Bug Transfer

arXiv2026-02-26作者：Kunpeng Zhang, Dongwei Xiao, Daoyuan Wu

Deep learning (DL) libraries are widely used in critical applications, where even subtle silent bugs can lead to serious consequences. While existing DL fuzzing techniques have made progress in detecting crashes, they inherently struggle to detect silent bugs due to the lack of effective test programs and corresponding oracles. Building on the observation that historical bug reports contain rich, underutilized information about silent bugs, we leverage large language models (LLMs) to perform versatile yet controlled bug transfer for silent bug fuzzing. Specifically, our approach uses LLMs to extract context-aware bug patterns from historical issues, match semantically related Application Programming Interfaces (APIs) using functionality-based embeddings, and synthesize test cases with customized oracles. This enables proactive detection of silent bugs by transferring high-risk contexts and oracle designs from known buggy APIs to functionally similar target APIs. To ensure the reliability of our context-aware bug transfer, we introduce an LLM-powered self-validation module that systematically evaluates the validity of each transferred bug instance. We implement this methodology in a

搜索结果：silent

Ekka: Automated Diagnosis of Silent Errors in LLM Inference

LLM-Powered Silent Bug Fuzzing in Deep Learning Libraries via Versatile and Controlled Bug Transfer

An Introduction to Silent Paralinguistics

Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts

Silent Speech Sentence Recognition with Six-Axis Accelerometers using Conformer and CTC Algorithm

Training with Confidence: Catching Silent Errors in Deep Learning Training with Automated Proactive Checks

Silent Failures in Federated Personalization of Foundation Models

The Quiet Contributions: Insights into AI-Generated Silent Pull Requests

The Silent Brush: Evaluating Artistic Style Leakage in AI Art Generation

MuteSwap: Visual-informed Silent Video Identity Conversion

A Parallel Ultra-Low Power Silent Speech Interface based on a Wearable, Fully-dry EMG Neckband

DESIL: Detecting Silent Bugs in MLIR Compiler Infrastructure

Pretraining Large Brain Language Model for Active BCI: Silent Speech

Silent Collapse in Recursive Learning Systems

A Silent Speech Decoding System from EEG and EMG with Heterogenous Electrode Configurations

Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models

DART: Distilling Autoregressive Reasoning to Silent Thought

A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition

Predicting the Silent Majority on Graphs: Knowledge Transferable Graph Neural Network

Taming Silent Failures: A Framework for Verifiable AI Reliability