搜索 — ResearchTracker

Reinforcement learning (RL) has emerged as a key paradigm for aligning and optimizing large language models (LLMs). Standard approaches treat the LLM as the policy and apply RL directly over the full vocabulary space. However, this formulation includes the massive tail of contextually irrelevant tokens in the action space, which could distract the policy from focusing on decision-making among the truly reasonable tokens. In this work, we verify that valid reasoning paths could inherently concentrate within a low-rank subspace. Based on this insight, we introduce Reinforcement Learning with Promising Tokens (RLPT), a framework that mitigates the action space issue by decoupling strategic decision-making from token generation. Specifically, RLPT leverages the semantic priors of the base model to identify a dynamic set of promising tokens and constrains policy optimization exclusively to this refined subset via masking. Theoretical analysis and empirical results demonstrate that RLPT effectively reduces gradient variance, stabilizes the training process, and improves sample efficiency. Experiment results on math, coding, and telecom reasoning show that RLPT outperforms standard RL bas

Sustainability and Artificial Intelligence: Necessary, Challenging, and Promising Intersections

arXiv2026-06-08作者：Han-Teng Liao, Zijia Wang

Both digital economy and digital technology researchers increasingly recognize the need to better address the role that artificial intelligence (AI) plays in shaping the evolution of the environmental, social and governance aspects of development. It appears that sustainability and AI research converge on the features of wicked problems that are complex, interconnected and dynamic. Building off such convergence, this article aims to map out the necessary, challenging, and promising intersections by providing an overview of the state of art research. Based on 541 bibliographic data collected from the Web of Science (WoS) database, the findings reveal the increasingly central body of work on green and sustainable science and technology in bridging various disciplines, main journals and key topics and concepts. The findings reveal how such interactions can be necessary, challenging, and promising. The article concludes with few general arguments regarding how to diversify and expand the community of practice regarding AI for sustainable development, especially in the areas of expected AI application areas and institutions.

搜索结果：Promising

Reinforcement Learning with Promising Tokens for Large Language Models

Sustainability and Artificial Intelligence: Necessary, Challenging, and Promising Intersections

BaCd2P2: a promising impurity-tolerant counterpart of GaAs for photovoltaics

PromiseTune: Unveiling Causally Promising and Explainable Configuration Tuning

Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought

The promising potential of vision language models for the generation of textual weather forecasts

LLM Social Simulations Are a Promising Research Method

RIS-Empowered LEO Satellite Networks for 6G: Promising Usage Scenarios and Future Directions

Enhance Connectivity of Promising Regions for Sampling-based Path Planning

Spectral distortions from promising single and multifield inflationary models

Organic Electronics in Biosensing: A Promising Frontier for Medical and Environmental Applications

Theoretical investigation of delafossite-Cu2ZnSnO4 as a promising photovoltaic absorber

Unmanned Aerial Vehicle Swarm-Enabled Edge Computing: Potentials, Promising Technologies, and Challenges

Consensusless Blockchain: A Promising High-Performance Blockchain without Consensus

DNA Storage: A Promising Large Scale Archival Storage?

Screening Promising Thermoelectric Materials in Binary Chalcogenides through High-Throughput Computations

Singlet Fission Photovoltaics: Progress and Promising Pathways

Massive MIMO for Cellular-Connected UAV: Challenges and Promising Solutions

Cheap Talk, Empty Promise: Frontier LLMs easily break public promises for self-interest

On the Usefulness of Promises