ResearchTracker科研与行业发展动态追踪平台

搜索结果：Code

共找到 20 条结果

来源：全部

Readability-Robust Code Summarization via Meta Curriculum Learning

arXiv2026-01-09作者：Wenhao Zeng, Yitian Chai, Hao Zhou

Code summarization has emerged as a fundamental technique in the field of program comprehension. While code language models have shown significant advancements, the current models and benchmarks are confined to high-readability code, which contains sufficient semantic cues such as function and variable names. In the real world, however, code is often poorly structured or obfuscated, significantly degrading model performance. In this paper, we first empirically evaluate the robustness of state-of-the-art language models on poor-readability code for the task of code summarization, focusing on (1) their effectiveness, (2) the impact of prompt engineering, and (3) the robustness of different variants. Experimental results reveal that state-of-the-art models-including GPT-4o and DeepSeek-V3 experience a substantial performance drop when faced with poorly readable code, and that prompt engineering and reasoning-enhanced models offer limited improvements. Motivated by these findings, we propose RoFTCodeSum, a novel fine-tuning method that enhances the robustness of code summarization against poorly readable code. RoFTCodeSum marries the concepts of curriculum learning and meta-learning: b

查看原文 ↗

Code vs Serialized AST Inputs for LLM-Based Code Summarization: An Empirical Study

arXiv2026-02-06作者：Shijia Dong, Haoruo Zhao, Paul Harvey

Summarizing source code into natural language descriptions (code summarization) helps developers better understand program functionality and reduce the burden of software maintenance. Abstract Syntax Trees (ASTs), as opposed to source code, have been shown to improve summarization quality in traditional encoder-decoder-based code summarization models. However, most large language model (LLM)-based code summarization methods rely on raw code or only incorporate partial AST signals, meaning that the potential of complete AST representation has not been fully explored for LLMs. This paper presents AST(NIT), an AST augmentation and serialization method that preserves lexical details and encodes structural information into LLM-compatible sequences. Experiments with the LLaMA-3.1-8B model on the CodeXGLUE Python dataset show that the proposed serialized ASTs reduce the length of LLM inputs, require shorter training times, and achieve summarization quality comparable to existing approaches.

LoRACode: LoRA Adapters for Code Embeddings

arXiv2025-03-07作者：Saumya Chaturvedi, Aman Chadha, Laurent Bindschaedler

Automorphism Ensemble Decoding of Quantum LDPC Codes

arXiv2025-03-03作者：Stergios Koutsioumpas, Hasan Sayginel, Mark Webster

We introduce AutDEC, a fast and accurate decoder for quantum error-correcting codes with large automorphism groups. Our decoder employs a set of automorphisms of the quantum code and an ensemble of belief propagation (BP) decoders. Each BP decoder is given a syndrome which is transformed by one of the automorphisms, and is run in parallel. For quantum codes, the accuracy of BP decoders is limited because short cycles occur in the Tanner graph and our approach mitigates this effect. We demonstrate decoding accuracy comparable to BP-OSD-0 with a lower time overhead for Quantum Reed-Muller (QRM) codes in the code capacity setting, and Bivariate Bicycle (BB) codes under circuit level noise. We provide a Python repository for use by the community and the results of our simulations.

Robust Learning of Diverse Code Edits

arXiv

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

arXiv

Build Code is Still Code: Finding the Antidote for Pipeline Poisoning

arXiv

Transversal dimension jump for product qLDPC codes

arXiv

PrivCode: When Code Generation Meets Differential Privacy

arXiv

Quantum Tanner codes

arXiv

On Decoding High-Order Interleaved Sum-Rank-Metric Codes

arXiv

DeepCode: Open Agentic Coding

arXiv2025-12-08

Thinking with Spatial Code for Physical-World Video Reasoning

arXiv2026-03-05作者：Jieneng Chen, Wenxin Ma, Ruisheng Yuan

We introduce Thinking with Spatial Code, a framework that transforms RGB video into explicit, temporally coherent 3D representations for physical-world visual question answering. We highlight the empirical finding that our proposed spatial encoder can parse videos into structured spatial code with explicit 3D oriented bounding boxes and semantic labels, enabling large language models (LLMs) to reason directly over explicit spatial variables. Specifically, we propose the spatial encoder that encodes image and geometric features by unifying 6D object parsing and tracking backbones with geometric prediction, and we further finetuning LLMs with reinforcement learning using a spatial rubric reward that encourages perspective-aware, geometrically grounded inference. As a result, our model outperforms proprietary vision-language models on VSI-Bench, setting a new state-of-the-art. Code is available at https://github.com/Beckschen/spatialcode.

Case2Code: Scalable Synthetic Data for Code Generation

arXiv

DeCoMa: Detecting and Purifying Code Dataset Watermarks through Dual Channel Code Abstraction

arXiv

Your Code Agent Can Grow Alongside You with Structured Memory

arXiv

Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective

arXiv

Storage and Retrieval Codes in PIR Schemes with Colluding Servers

arXiv

Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text

arXiv

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

arXiv2023-06-14作者：Ziyang Luo, Can Xu, Pu Zhao

Code Large Language Models (Code LLMs), such as StarCoder, have demonstrated exceptional performance in code-related tasks. However, most existing models are solely pre-trained on extensive raw code data without instruction fine-tuning. In this paper, we introduce WizardCoder, which empowers Code LLMs with complex instruction fine-tuning, by adapting the Evol-Instruct method to the domain of code. Through comprehensive experiments on four prominent code generation benchmarks, namely HumanEval, HumanEval+, MBPP, and DS-1000, we unveil the exceptional capabilities of our model. It surpasses all other open-source Code LLMs by a substantial margin. Moreover, our model even outperforms the largest closed LLMs, Anthropic's Claude and Google's Bard, on HumanEval and HumanEval+. Our code, model weights, and data are public at https://github.com/nlpxucan/WizardLM