搜索 — ResearchTracker

Large Reasoning Models (LRMs) achieve impressive performance on complex reasoning tasks via Chain-of-Thought (CoT) reasoning, which enables them to generate intermediate thinking tokens before arriving at the final answer. However, LRMs often suffer from significant overthinking, spending excessive compute time even after the answer is generated early on. Prior work has identified the existence of an optimal reasoning length such that truncating reasoning at this point significantly shortens CoT outputs with virtually no change in performance. However, determining optimal CoT lengths for practical datasets is highly non-trivial as they are fully task and model-dependent. In this paper, we precisely address this and design TERMINATOR, an early-exit strategy for LRMs at inference to mitigate overthinking. The central idea underpinning TERMINATOR is that the first arrival of an LRM's final answer is often predictable, and we leverage these first answer positions to create a novel dataset of optimal reasoning lengths to train TERMINATOR. Powered by this approach, TERMINATOR achieves significant reductions in CoT lengths of 14%-55% on average across four challenging practical datasets:

Inhomogeneous terminators on the exoplanet WASP-39 b

arXiv2024-07-14作者：Néstor Espinoza, Maria E. Steinrueck, James Kirk

Transmission spectroscopy has been a workhorse technique over the past two decades to constrain the physical and chemical properties of exoplanet atmospheres. One of its classical key assumptions is that the portion of the atmosphere it probes -- the terminator region -- is homogeneous. Several works in the past decade, however, have put this into question for highly irradiated, hot ($T_{eq}\gtrsim 1000$ K) gas giant exoplanets both empirically and via 3-dimensional modelling. While models predict clear differences between the evening (day-to-night) and morning (night-to-day) terminators, direct morning/evening transmission spectra in a wide wavelength range has not been reported for an exoplanet to date. Under the assumption of precise and accurate orbital parameters on WASP-39 b, here we report the detection of inhomogeneous terminators on the exoplanet WASP-39 b, which allows us to retrieve its morning and evening transmission spectra in the near-infrared ($2-5\ μ$m) using JWST. We observe larger transit depths in the evening which are, on average, $405 \pm 88$ ppm larger than the morning ones, also having qualitatively larger features than the morning spectrum. The spectra are

搜索结果：terminator

TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

Inhomogeneous terminators on the exoplanet WASP-39 b

"Trust me on this" Explaining Agent Behavior to a Human Terminator

Atmospheric waves disturbances from the solar terminator according to the VLF radio stations data

Timing Terminators: Forecasting Sunspot Cycle 25 Onset

Terminator Habitability: the Case for Limited Water Availability on M-dwarf Planets

Deciphering Solar Magnetic Activity: The (Solar) Hale Cycle Terminator of 2021

TERMinator: A system for scientific texts processing

TERMinator: A Neural Framework for Structure-Based Protein Design using Tertiary Repeating Motifs

On Atmospheric Retrievals of Exoplanets with Inhomogeneous Terminators

Reinforcement Learning with a Terminator

Folding Kinetics of Riboswitch Transcriptional Terminators and Sequesterers

Modified SIS model applied to a Zombie apocalypse with terminators

Why does the Moon's terminator not appear orthogonal to the direction of the Sun?

Neutral Solar Wind Generated by Lunar Exospheric Dust at the Terminator

Endless Terminals: Scaling RL Environments for Terminal Agents

Transformers for Program Termination

Terminal Lucidity: Envisioning the Future of the Terminal

On 3-terminal positions in Hex

A framework for joint assessment of a terminal event and a score existing only in the absence of the terminal event