搜索 — ResearchTracker

VLMs have broad potential in privacy-sensitive domains such as healthcare and finance, yet strict data-sharing constraints render centralized training infeasible. FL mitigates this issue by enabling decentralized training, but practical deployments face challenges due to client heterogeneity in computational resources, application requirements, and model architectures. We argue that while replacing data with model parameters characterizes the present of FL, replacing parameters with preferences represents a more scalable and privacy-preserving future. Motivated by this perspective, we propose MoR, a federated alignment framework based on GRPO with Mixture-of-Rewards for heterogeneous VLMs. MoR initializes a visual foundation model as a KL-regularized reference, while each client locally trains a reward model from local preference annotations, capturing specific evaluation signals without exposing raw data. To reconcile heterogeneous rewards, we introduce a routing-based fusion mechanism that adaptively aggregates client reward signals. Finally, the server performs GRPO with this mixed reward to optimize the base VLM. Experiments on three public VQA benchmarks demonstrate that MoR c

LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing

arXiv2026-03-13作者：Jiawei Hao, Zhiwei Hao, Jianyuan Guo

Mixture-of-Experts (MoE) based Large Language Models (LLMs) have demonstrated impressive performance and computational efficiency. However, their deployment is often constrained by substantial memory demands, primarily due to the need to load numerous expert modules. While existing expert compression techniques like pruning or merging attempt to mitigate this, they often suffer from irreversible knowledge loss or high training overhead. In this paper, we propose a novel expert compression paradigm termed expert replacing, which replaces redundant experts with parameter-efficient modules and recovers their capabilities with low training costs. We find that even a straightforward baseline of this paradigm yields promising performance. Building on this foundation, we introduce LightMoE, a framework that enhances the paradigm by introducing adaptive expert selection, hierarchical expert construction, and an annealed recovery strategy. Experimental results show that LightMoE matches the performance of LoRA fine-tuning at a 30% compression ratio. Even under a more aggressive 50% compression rate, it outperforms existing methods and achieves average performance improvements of 5.6% across

搜索结果：Replacing

Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing

TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers

ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model

Replacing CAPTCHA with XNO micropayments

Is deeper always better? Replacing linear mappings with deep learning networks in the Discriminative Lexicon Model

The asymptotic distribution of maxima of stationary random sequences under random replacing

Replacing softmax with ReLU in Vision Transformers

Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization

Replacing bar-like resolutions in a simplicial setting

Replacing Language Model for Style Transfer

Compressing Models with Few Samples: Mimicking then Replacing

Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise

From Sparsity to Simplicity: Enabling Simpler Sequential Replacements via Sparse Attention Distillation

Memory-Modular Classification: Learning to Generalize with Memory Replacement

Deterministic Continuous Replacement: Fast and Stable Module Replacement in Pretrained Transformers

MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference

Tangle replacement on spatial graphs

Replacement-Type Quantum Gates

Replacement and Reputation