搜索结果：hundreds

共找到 20 条结果

高级筛选 ▾

Evaluating Collective Behaviour of Hundreds of LLM Agents

arXiv2026-02-18作者：Richard Willis, Jianing Zhao, Yali Du

As autonomous agents powered by LLM are increasingly deployed in society, understanding their collective behaviour in social dilemmas becomes critical. We introduce an evaluation framework where LLMs generate strategies encoded as algorithms, enabling inspection prior to deployment and scaling to populations of hundreds of agents -- substantially larger than in previous work. We find that more recent models tend to produce worse societal outcomes compared to older models when agents prioritise individual gain over collective benefits. Using cultural evolution to model user selection of agents, our simulations reveal a significant risk of convergence to poor societal equilibria, particularly when the relative benefit of cooperation diminishes and population sizes increase. We release our code as an evaluation suite for developers to assess the emergent collective behaviour of their models.

Hundreds of TESS exoplanets might be larger than we thought

arXiv2025-06-24作者：Te Han, Paul Robertson, Timothy D. Brandt

The radius of a planet is a fundamental parameter that probes its composition and habitability. Precise radius measurements are typically derived from the fraction of starlight blocked when a planet transits its host star. The wide-field Transiting Exoplanet Survey Satellite (TESS) has discovered hundreds of new exoplanets, but its low angular resolution means that the light from a star hosting a transiting exoplanet can be blended with the light from background stars. If not fully corrected, this extra light can dilute the transit signal and result in a smaller measured planet radius. In a study of hundreds of TESS planet discoveries using deblended light curves from our validated methodology, we show that systematically incorrect planet radii are common in the literature: studies using various public TESS photometry pipelines have underestimated the planet radius by a weighted median of $6.1\% \pm 0.3\%$, leading to a $\sim20\%$ overestimation of planet density. The widespread presence of these biases in the literature has profoundly shaped-and potentially misrepresented-our understanding of the exoplanet population. Addressing these biases will refine the exoplanet mass-radius r

搜索结果：hundreds

Evaluating Collective Behaviour of Hundreds of LLM Agents

Hundreds of TESS exoplanets might be larger than we thought

ArtifactLens: Hundreds of Labels Are Enough for Artifact Detection with VLMs

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

Massive Memorization with Hundreds of Trillions of Parameters for Sequential Transducer Generative Recommenders

Obelia: Scaling DAG-Based Blockchains to Hundreds of Validators

A mathematical language for linking fine-scale structure in spikes from hundreds to thousands of neurons with behaviour

Estimating Channels With Hundreds of Sub-Paths for MU-MIMO Uplink: A Structured High-Rank Tensor Approach

How to Build a Quantum Supercomputer: Scaling from Hundreds to Millions of Qubits

Strategies for running the QAOA at hundreds of qubits

Multiple overspill flood channels from young craters require surface melting and hundreds of meters of mid-latitude ice late in Mars history

Evolution of the electron distribution function during gas ionization by a sub-nanosecond microwave pulse of hundreds MW power

Crowd3D: Towards Hundreds of People Reconstruction from a Single Image

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation

Optimized detector tomography for photon-number resolving detectors with hundreds of pixels

XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters

A Site-Resolved 2D Quantum Simulator with Hundreds of Trapped Ions

GRAVITY+ Wide: Towards hundreds of z $\sim$ 2 AGN

From Images to Dark Matter: End-To-End Inference of Substructure From Hundreds of Strong Gravitational Lenses