搜索 — ResearchTracker

Code summarization has emerged as a fundamental technique in the field of program comprehension. While code language models have shown significant advancements, the current models and benchmarks are confined to high-readability code, which contains sufficient semantic cues such as function and variable names. In the real world, however, code is often poorly structured or obfuscated, significantly degrading model performance. In this paper, we first empirically evaluate the robustness of state-of-the-art language models on poor-readability code for the task of code summarization, focusing on (1) their effectiveness, (2) the impact of prompt engineering, and (3) the robustness of different variants. Experimental results reveal that state-of-the-art models-including GPT-4o and DeepSeek-V3 experience a substantial performance drop when faced with poorly readable code, and that prompt engineering and reasoning-enhanced models offer limited improvements. Motivated by these findings, we propose RoFTCodeSum, a novel fine-tuning method that enhances the robustness of code summarization against poorly readable code. RoFTCodeSum marries the concepts of curriculum learning and meta-learning: b

LoRACode: LoRA Adapters for Code Embeddings

arXiv2025-03-07作者：Saumya Chaturvedi, Aman Chadha, Laurent Bindschaedler

Code embeddings are essential for semantic code search; however, current approaches often struggle to capture the precise syntactic and contextual nuances inherent in code. Open-source models such as CodeBERT and UniXcoder exhibit limitations in scalability and efficiency, while high-performing proprietary systems impose substantial computational costs. We introduce a parameter-efficient fine-tuning method based on Low-Rank Adaptation (LoRA) to construct task-specific adapters for code retrieval. Our approach reduces the number of trainable parameters to less than two percent of the base model, enabling rapid fine-tuning on extensive code corpora (2 million samples in 25 minutes on two H100 GPUs). Experiments demonstrate an increase of up to 9.1% in Mean Reciprocal Rank (MRR) for Code2Code search, and up to 86.69% for Text2Code search tasks across multiple programming languages. Distinction in task-wise and language-wise adaptation helps explore the sensitivity of code retrieval for syntactical and linguistic variations. To foster research in this area, we make our code and pre-trained models publicly available.

搜索结果：Code

Readability-Robust Code Summarization via Meta Curriculum Learning

LoRACode: LoRA Adapters for Code Embeddings

Code vs Serialized AST Inputs for LLM-Based Code Summarization: An Empirical Study

PrivCode: When Code Generation Meets Differential Privacy

Automorphism Ensemble Decoding of Quantum LDPC Codes

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Robust Learning of Diverse Code Edits

Quantum Tanner codes

On Decoding High-Order Interleaved Sum-Rank-Metric Codes

Transversal dimension jump for product qLDPC codes

Build Code is Still Code: Finding the Antidote for Pipeline Poisoning

Case2Code: Scalable Synthetic Data for Code Generation

Thinking with Spatial Code for Physical-World Video Reasoning

DeepCode: Open Agentic Coding

Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text

Storage and Retrieval Codes in PIR Schemes with Colluding Servers

Your Code Agent Can Grow Alongside You with Structured Memory

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation

Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective