搜索 — ResearchTracker

The recent advancement of Large Language Models (LLMs) offers new opportunities to generate user preference data to warm-start bandits. Recent studies on contextual bandits with LLM initialization (CBLI) have shown that these synthetic priors can significantly lower early regret. However, these findings assume that LLM-generated choices are reasonably aligned with actual user preferences. In this paper, we systematically examine how LLM-generated preferences perform when random and label-flipping noise is injected into the synthetic training data. For aligned domains, we find that warm-starting remains effective up to 30% corruption, loses its advantage around 40%, and degrades performance beyond 50%. When there is systematic misalignment, even without added noise, LLM-generated priors can lead to higher regret than a cold-start bandit. To explain these behaviors, we develop a theoretical analysis that decomposes the effect of random label noise and systematic misalignment on the prior error driving the bandit's regret, and derive a sufficient condition under which LLM-based warm starts are provably better than a cold-start bandit. We validate these results across multiple conjoint

HydraServe: Minimizing Cold Start Latency for Serverless LLM Serving in Public Clouds

arXiv2025-02-21作者：Chiheng Lou, Sheng Qi, Chao Jin

With the proliferation of large language model (LLM) variants, developers are turning to serverless computing for cost-efficient LLM deployment. However, public cloud providers often struggle to provide performance guarantees for serverless LLM serving due to significant cold start latency caused by substantial model sizes and complex runtime dependencies. To address this problem, we present HydraServe, a serverless LLM serving system designed to minimize cold start latency in public clouds. HydraServe proactively distributes models across servers to quickly fetch them, and overlaps cold-start stages within workers to reduce startup latency. Additionally, HydraServe strategically places workers across GPUs to avoid network contention among cold-start instances. To minimize resource consumption during cold starts, HydraServe further introduces pipeline consolidation that can merge groups of workers into individual serving endpoints. Our comprehensive evaluations under diverse settings demonstrate that HydraServe reduces the cold start latency by 1.7$\times$-- 4.7$\times$ and improves service level objective attainment by 1.43$\times$--1.74$\times$ compared to baselines.

搜索结果：Start

Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits

HydraServe: Minimizing Cold Start Latency for Serverless LLM Serving in Public Clouds

Algorithmic warm starts for Hamiltonian Monte Carlo

Serverless Cold Starts and Where to Find Them

Starting vortex strength in an impulsively started airfoil

SSthreshless Start: A Sender-Side TCP Intelligence for Long Fat Network

Optimizing Start Locations in Ergodic Search for Disaster Response

Warm Starts Accelerate Conditional Diffusion

A Black Start Strategy for Hydrogen-integrated Renewable Grids with Energy Storage Systems

Transformer-Based Model for Cold Start Mitigation in FaaS Architecture

Diagnosing LLM-based Rerankers in Cold-Start Recommender Systems: Coverage, Exposure and Practical Mitigations

Cold Start Latency in Serverless Computing: A Systematic Review, Taxonomy, and Future Directions

Shallow AutoEncoding Recommender with Cold Start Handling via Side Features

Contrastive Learning for Cold Start Recommendation with Adaptive Feature Fusion

Start Stop Bit Method for Efficient Data Communication in 6G Mobile Radio Systems

Verification and Validation of the Stakeholder Tool for Assessing Radioactive Transportation (START)

START: Self-taught Reasoner with Tools

Dynamic Interval Scheduling with Random Start and End Times

START: Traversing Sparse Footholds with Terrain Reconstruction

Graph Reasoning for Explainable Cold Start Recommendation