搜索 — ResearchTracker

Code summarization has emerged as a fundamental technique in the field of program comprehension. While code language models have shown significant advancements, the current models and benchmarks are confined to high-readability code, which contains sufficient semantic cues such as function and variable names. In the real world, however, code is often poorly structured or obfuscated, significantly degrading model performance. In this paper, we first empirically evaluate the robustness of state-of-the-art language models on poor-readability code for the task of code summarization, focusing on (1) their effectiveness, (2) the impact of prompt engineering, and (3) the robustness of different variants. Experimental results reveal that state-of-the-art models-including GPT-4o and DeepSeek-V3 experience a substantial performance drop when faced with poorly readable code, and that prompt engineering and reasoning-enhanced models offer limited improvements. Motivated by these findings, we propose RoFTCodeSum, a novel fine-tuning method that enhances the robustness of code summarization against poorly readable code. RoFTCodeSum marries the concepts of curriculum learning and meta-learning: b

Code vs Serialized AST Inputs for LLM-Based Code Summarization: An Empirical Study

arXiv2026-02-06作者：Shijia Dong, Haoruo Zhao, Paul Harvey

Summarizing source code into natural language descriptions (code summarization) helps developers better understand program functionality and reduce the burden of software maintenance. Abstract Syntax Trees (ASTs), as opposed to source code, have been shown to improve summarization quality in traditional encoder-decoder-based code summarization models. However, most large language model (LLM)-based code summarization methods rely on raw code or only incorporate partial AST signals, meaning that the potential of complete AST representation has not been fully explored for LLMs. This paper presents AST(NIT), an AST augmentation and serialization method that preserves lexical details and encodes structural information into LLM-compatible sequences. Experiments with the LLaMA-3.1-8B model on the CodeXGLUE Python dataset show that the proposed serialized ASTs reduce the length of LLM inputs, require shorter training times, and achieve summarization quality comparable to existing approaches.

搜索结果：code

Readability-Robust Code Summarization via Meta Curriculum Learning

Code vs Serialized AST Inputs for LLM-Based Code Summarization: An Empirical Study

LoRACode: LoRA Adapters for Code Embeddings

Robust Learning of Diverse Code Edits

Automorphism Ensemble Decoding of Quantum LDPC Codes

Quantum Tanner codes

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

PrivCode: When Code Generation Meets Differential Privacy

Infinite families of constacyclic codes supporting 3-designs and their applications in coding theory

Case2Code: Scalable Synthetic Data for Code Generation

DeepCode: Open Agentic Coding

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Code as Agent Harness

Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective

Your Code Agent Can Grow Alongside You with Structured Memory

Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text

On Decoding High-Order Interleaved Sum-Rank-Metric Codes

Storage and Retrieval Codes in PIR Schemes with Colluding Servers

DeCoMa: Detecting and Purifying Code Dataset Watermarks through Dual Channel Code Abstraction

Thinking with Spatial Code for Physical-World Video Reasoning