搜索 — ResearchTracker

Training large language models (LLMs) on Python execution traces grounds them in code execution and enables the line-by-line execution prediction of whole Python programs, effectively turning them into neural interpreters (FAIR CodeGen Team et al., 2025). However, developers rarely execute programs step by step; instead, they use debuggers to stop execution at certain breakpoints and step through relevant portions only while inspecting or modifying program variables. Existing neural interpreter approaches lack such interactive control. To address this limitation, we introduce neural debuggers: language models that emulate traditional debuggers, supporting operations such as stepping into, over, or out of functions, as well as setting breakpoints at specific source lines. We show that neural debuggers -- obtained via fine-tuning large LLMs or pre-training smaller models from scratch -- can reliably model both forward execution (predicting future states and outputs) and inverse execution (inferring prior states or inputs) conditioned on debugger actions. Evaluated on CruxEval, our models achieve strong performance on both output and input prediction tasks, demonstrating robust condit

A Novel Interactive-Guided Differential Testing Approach for FPGA Simulation Debugger Tools

arXiv2025-03-03作者：Shikai Guo, Xiaoyu Wang, Xiaochen Li

Field-Programmable Gate Array (FPGA) development tool chains are widely used in FPGA design, simulation, and verification in critical areas like communications, automotive electronics, and aerospace. Commercial FPGA tool chains such as Xilinx' Vivado aids developers in swiftly identifying and rectifying bugs and issues in FPGA designs through a robust built-in debugger, ensuring the correctness and development efficiency of the FPGA design. Hardening such FPGA chip debugger tools by testing is crucial since engineers might misinterpret code and introduce incorrect fixes, leading to security risks. However, FPGA chip debugger tools are challenging to test as they require assessing both RTL designs and a series of debugging actions, including setting breakpoints and stepping through the code. To address this issue, we propose a interactive differential testing approach called DB-Hunter to detect bugs in Vivado's FPGA chip debugger tools. Specifically, DB-Hunter consists of three components: RTL design transformation component, debug action transformation component, and interactive differential testing component. By performing RTL design and debug action transformations, DB-Hunter gen

搜索结果：debugger

Towards a Neural Debugger for Python

A Novel Interactive-Guided Differential Testing Approach for FPGA Simulation Debugger Tools

The Visual Debugger: Past, Present, and Future

Designing for Novice Debuggers: A Pilot Study on an AI-Assisted Debugging Tool

Poster: libdebug, Build Your Own Debugger for a Better (Hello) World

InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration

LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models

An Interactive Debugger for Rust Trait Errors

RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance

MIP-DD: A Delta Debugger for Mixed Integer Programming Solvers

Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

GoTcha: An Interactive Debugger for GoT-Based Distributed Systems

Chrowned by an Extension: Abusing the Chrome DevTools Protocol through the Debugger API

The Chameleon Type Debugger (Tool Demonstration)

The Visual Debugger Tool

How Generation Architecture Shapes Code Complexity in Multi-Agent LLM Systems: A Paired Study on HumanEval

Remote Concolic Multiverse Debugging -- Extended Version with Additional Appendices

Moldable Exceptions

Debugging Functional Programs by Interpretation

MIO: Multiverse Debugging in the Face of Input/Output -- Extended Version with Additional Appendices