搜索 — ResearchTracker

The Linux kernel is a critical system, serving as the foundation for numerous systems. Bugs in the Linux kernel can cause serious consequences, affecting billions of users. Fault localization (FL), which aims at identifying the buggy code elements in software, plays an essential role in software quality assurance. While recent LLM agents have achieved promising accuracy in FL on recent benchmarks like SWE-bench, it remains unclear how well these methods perform in the Linux kernel, where FL is much more challenging due to the large-scale code base, limited observability, and diverse impact factors. In this paper, we introduce LinuxFLBench, a FL benchmark constructed from real-world Linux kernel bugs. We conduct an empirical study to assess the performance of state-of-the-art LLM agents on the Linux kernel. Our initial results reveal that existing agents struggle with this task, achieving a best top-1 accuracy of only 41.6% at file level. To address this challenge, we propose LinuxFL$^+$, an enhancement framework designed to improve FL effectiveness of LLM agents for the Linux kernel. LinuxFL$^+$ substantially improves the FL accuracy of all studied agents (e.g., 7.2% - 11.2% accura

Patch-to-PoC: A Systematic Study of Agentic LLM Systems for Linux Kernel N-Day Reproduction

arXiv2026-02-07作者：Juefei Pu, Xingyu Li, Zhengchuan Liang

Autonomous large language model (LLM) based systems have recently shown promising results across a range of cybersecurity tasks. However, there is no systematic study on their effectiveness in autonomously reproducing Linux kernel vulnerabilities with concrete proofs-of-concept (PoCs). Owing to the size, complexity, and low-level nature of the Linux kernel, such tasks are widely regarded as particularly challenging for current LLM-based approaches. In this paper, we present the first large-scale study of LLM-based Linux kernel vulnerability reproduction. For this purpose, we develop K-Repro, an LLM-based agentic system equipped with controlled code-browsing, virtual machine management, interaction, and debugging capabilities. Using kernel security patches as input, K-Repro automates end-to-end bug reproduction of N-day vulnerabilities in the Linux kernel. On a dataset of 100 real-world exploitable Linux kernel vulnerabilities collected from KernelCTF, our results show that K-Repro can generate PoCs that reproduce over 50\% of the cases with practical time and monetary cost. Beyond aggregate success rates, we perform an extensive study of effectiveness, efficiency, stability, and im

搜索结果：Linux

Benchmarking and Enhancing LLM Agents in Localizing Linux Kernel Bugs

Patch-to-PoC: A Systematic Study of Agentic LLM Systems for Linux Kernel N-Day Reproduction

Machine Learning (ML) library in Linux kernel

An Investigation of Patch Porting Practices of the Linux Kernel Ecosystem

Linux for Everyone: Can Standardization Drive Mainstream Adoption?

When Radiation Meets Linux: Analyzing Soft Errors in Linux on COTS SoCs under Proton Irradiation

Characteristics, Root Causes, and Detection of Incomplete Security Bug Fixes in the Linux Kernel

A Study of Malware Prevention in Linux Distributions

A Security Analysis of CheriBSD and Morello Linux

CrashFixer: A crash resolution agent for the Linux kernel

Ransomware: Analysis and Evaluation of Live Forensic Techniques and the Impact on Linux based IoT Systems

A First Look at Package-to-Group Mechanism: An Empirical Study of the Linux Distributions

KGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution

Got Root? A Linux Priv-Esc Benchmark

Comparing Security and Efficiency of WebAssembly and Linux Containers in Kubernetes Cloud Computing

Joint Time-and Event-Triggered Scheduling in the Linux Kernel

Fuzzing the Latest NTFS in Linux with Papora: An Empirical Study

eBPF-mm: Userspace-guided memory management in Linux with eBPF

The Sense of Logging in the Linux Kernel

The eBPF Runtime in the Linux Kernel