搜索 — ResearchTracker

In financial predictions, the performance of machine learning models is often assessed by Rank IC, which is the Spearman rank correlation between the model predictions and the realized asset returns. Despite its wide adoption, most existing models are trained using regression losses or ranking objectives that may not align with Rank IC. We propose LambdaRankIC, a novel learning-to-rank approach that directly optimizes Rank IC. We circumvent the non-differentiability of the ranking operator by deriving the closed-form expression for the lambda gradients induced by the pairwise rank swaps, which enables efficient gradient-based optimization within the LambdaRank framework. We implement LambdaRankIC as a custom objective in XGBoost. Theoretically, we show that our approach optimizes an upper bound on Rank IC. We evaluate the proposed approach on both simulated and real-world financial data. In simulation studies, LambdaRankIC accurately recovers the true ranking structure in noiseless settings and consistently outperforms regression-based and NDCG-oriented ranking methods under low signal-to-noise ratios and heavy-tailed noise regimes. In empirical experiments using real market data,

Directly Forecasting Belief for Reinforcement Learning with Delays

arXiv2025-05-01作者：Qingyuan Wu, Yuhui Wang, Simon Sinong Zhan

Reinforcement learning (RL) with delays is challenging as sensory perceptions lag behind the actual events: the RL agent needs to estimate the real state of its environment based on past observations. State-of-the-art (SOTA) methods typically employ recursive, step-by-step forecasting of states. This can cause the accumulation of compounding errors. To tackle this problem, our novel belief estimation method, named Directly Forecasting Belief Transformer (DFBT), directly forecasts states from observations without incrementally estimating intermediate states step-by-step. We theoretically demonstrate that DFBT greatly reduces compounding errors of existing recursively forecasting methods, yielding stronger performance guarantees. In experiments with D4RL offline datasets, DFBT reduces compounding errors with remarkable prediction accuracy. DFBT's capability to forecast state sequences also facilitates multi-step bootstrapping, thus greatly improving learning efficiency. On the MuJoCo benchmark, our DFBT-based method substantially outperforms SOTA baselines. Code is available at https://github.com/QingyuanWuNothing/DFBT.

搜索结果：directly

LambdaRankIC: Directly Optimizing Rank IC for Financial Prediction

Directly Forecasting Belief for Reinforcement Learning with Delays

Event2Vec: Processing Neuromorphic Events Directly by Representations in Vector Space

Tackling the Zero-Shot Reinforcement Learning Loss Directly

ELemental abundances of Planets and brown dwarfs Imaged around Stars (ELPIS): II. The Jupiter-like Inhomogeneous Atmosphere of the First Directly Imaged Planetary-Mass Companion 2MASS 1207 b

Directly observing relativistic Bohmian mechanics

On Some Hereditary and Super Classes of Directly Finite Abelian Groups

Directly Optimizing for Synthesizability in Generative Molecular Design using Retrosynthesis Models

Deep Directly-Trained Spiking Neural Networks for Object Detection

Directly Denoising Diffusion Models

Directly Estimating Mixed-State Entanglement with Bell Measurement Assistance

Directly Attention Loss Adjusted Prioritized Experience Replay

Generate Coherent Rays Directly

Quantifying Mn diffusion through transferred versus directly-grown graphene barriers

Directly accessible entangling gates for capacitively coupled singlet-triplet qubits

Can the Existence of Dark Energy Be Directly Detected?

Detecting Exomoons Via Doppler Monitoring of Directly Imaged Exoplanets

On the method of directly defining inverse mapping for nonlinear differential equations

Directly finite algebras of pseudofunctions on locally compact groups

Learning 4DVAR inversion directly from observations