搜索结果：smaller

共找到 20 条结果

高级筛选 ▾

How Software Engineering Research Overlooks Local Industry: A Smaller Economy Perspective

arXiv2026-01-28作者：Klara Borowa, Andrzej Zalewski, Lech Madeyski

The software engineering researchers from countries with smaller economies, particularly non-English speaking ones, represent valuable minorities within the software engineering community. As researchers from Poland, we represent such a country. We analyzed the ICSE FOSE (Future of Software Engineering) community survey through reflexive thematic analysis to show our viewpoint on key software community issues. We believe that the main problem is the growing research-industry gap, which particularly impacts smaller communities and small local companies. Based on this analysis and our experiences, we present a set of recommendations for improvements that would enhance software engineering research and industrial collaborations in smaller economies.

Enhancing Generalization in Chain of Thought Reasoning for Smaller Models

arXiv2025-01-16作者：Maxwell J. Yin, Dingyi Jiang, Yongbing Chen

Chain-of-Thought (CoT) reasoning in smaller language models is a challenging natural language process problem yet highly desirable in many real-life applications. Existing CoT knowledge distillation methods often suffer from overly conservative memorization in smaller LLMs, leading to low generalization confidence. As fully preserving the CoT ability of teacher model is impossible, we hypothesize that adversarial CoT fine-tuning is crucial for developing smaller LLM with robust CoT generalization. To this end, we propose \textit{PRompt-Assisted Domain-Adversarial fine-tuning} (PRADA), a principled fine-tuning framework that integrates diverse CoT domains. Specifically, PRADA pioneers two CoT improvements in smaller LLM: (1) Recovering the domain-invariant feature insight which typically lost during distillation with domain adversarial fine-tuning; (2) Enhancing the domain adaptability of CoT prompt engineering by employing domain-adversarial approaches. We theoretically demonstrate the effectiveness of our approach and empirically show that it significantly outperforms the state of the arts in a wide range of tasks. Moreover, our empirical findings reveal that the smaller LLM, when

搜索结果：smaller

How Software Engineering Research Overlooks Local Industry: A Smaller Economy Perspective

Enhancing Generalization in Chain of Thought Reasoning for Smaller Models

GOLFer: Smaller LM-Generated Documents Hallucination Filter &amp; Combiner for Query Expansion in Information Retrieval

Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet

DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers

Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models

Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs

On Importance of Layer Pruning for Smaller BERT Models and Low Resource Languages

Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs

Fine-tuning Smaller Language Models for Question Answering over Financial Documents

Can Smaller Large Language Models Evaluate Research Quality?

Rejection-Sampled Universal Quantization for Smaller Quantization Errors

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Smaller Language Models Are Better Instruction Evolvers

Mixed Distillation Helps Smaller Language Model Better Reasoning

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions

Turning block-sequential automata networks into smaller parallel networks with isomorphic limit dynamics

Specializing Smaller Language Models towards Multi-Step Reasoning

Late-time data require smaller sound horizon at recombination

Blar-SQL: Faster, Stronger, Smaller NL2SQL

GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval