搜索 — ResearchTracker

With Large Language Models (LLMs) rapidly approaching and potentially surpassing human-level performance, it has become imperative to develop approaches capable of effectively supervising and enhancing these powerful models using smaller, human-level models exposed to only human-level data. We address this critical weak-to-strong (W2S) generalization challenge by proposing a novel method aimed at improving weak experts, by training on the same limited human-level data, enabling them to generalize to complex, super-human-level tasks. Our approach, called **EnsemW2S**, employs a token-level ensemble strategy that iteratively combines multiple weak experts, systematically addressing the shortcomings identified in preceding iterations. By continuously refining these weak models, we significantly enhance their collective ability to supervise stronger student models. We extensively evaluate the generalization performance of both the ensemble of weak experts and the subsequent strong student model across in-distribution (ID) and out-of-distribution (OOD) datasets. For OOD, we specifically introduce question difficulty as an additional dimension for defining distributional shifts. Our empi

Uniform-in-time weak propagation of chaos for consensus-based optimization

arXiv2025-02-01作者：Erhan Bayraktar, Ibrahim Ekren, Hongyi Zhou

We study the uniform-in-time weak propagation of chaos for the consensus-based optimization (CBO) method on a bounded searching domain. We apply the methodology for studying long-time behaviors of interacting particle systems developed in the work of Delarue and Tse (ArXiv:2104.14973). Our work shows that the weak error has order $O(N^{-1})$ uniformly in time, where $N$ denotes the number of particles. The main strategy behind the proofs are the decomposition of the weak errors using the linearized Fokker-Planck equations and the exponential decay of their Sobolev norms. Consequently, our result leads to the joint convergence of the empirical distribution of the CBO particle system to the Dirac-delta distribution at the global minimizer in population size and running time in Wasserstein-type metrics.

搜索结果：weak

EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles

Uniform-in-time weak propagation of chaos for consensus-based optimization

Weak approximation of kinetic SDEs: closing the criticality gap

Spatio-Temporal Weak Measurement of Chiral Ultra short Laser Pulse

What do quantum "weak" measurements actually measure?

Weak type $(2,H)$ and weak cotype $(2,H)$ of operator spaces

Weak Proregularity, Weak Stability, and the Noncommutative MGM Equivalence

Neural style transfer of weak lensing mass maps

Metacirculants and split weak metacirculants

Improving Weak PINNs for Hyperbolic Conservation Laws: Dual Norm Computation, Boundary Conditions and Systems

Suppression of one-dimensional weak localization by band asymmetry

Solving stochastic weak Minty variational inequalities without increasing batch size

Direct observation of temporal coherence by weak projective measurements of photon arrival time

Full counting statistics of weak measurement

Differential operators for harmonic weak Maass forms and the vanishing of Hecke eigenvalues

Chiral Symmetry and Weak Decay of Hypernuclei

Weak bimonads and weak Hopf monads

Weak Gravitational Lensing

Scatter and bias in weak lensing selected clusters

How weak values emerge in joint measurements on cloned quantum systems