搜索 — ResearchTracker

Recent Mixture-of-Experts (MoE)-based large language models (LLMs) such as Qwen-MoE and DeepSeek-MoE are transforming generative AI in natural language processing. However, these models require vast and diverse training data. Federated learning (FL) addresses this challenge by leveraging private data from heterogeneous edge devices for privacy-preserving MoE training. Nonetheless, traditional FL approaches require devices to host local MoE models, which is impractical for resource-constrained devices due to large model sizes. To address this, we propose DeepFusion, the first scalable federated MoE training framework that enables the fusion of heterogeneous on-device LLM knowledge via federated knowledge distillation, yielding a knowledge-abundant global MoE model. Specifically, DeepFusion features each device to independently configure and train an on-device LLM tailored to its own needs and hardware limitations. Furthermore, we propose a novel View-Aligned Attention (VAA) module that integrates multi-stage feature representations from the global MoE model to construct a predictive perspective aligned with on-device LLMs, thereby enabling effective cross-architecture knowledge dist

Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge

arXiv2025-05-03作者：Florian Schmid, Paul Primus, Toni Heittola

This paper presents the Low-Complexity Acoustic Scene Classification with Device Information Task of the DCASE 2025 Challenge, along with its baseline system. Continuing the focus on low-complexity models, data efficiency, and device mismatch from previous editions (2022-2024), this year's task introduces a key change: recording device information is now provided at inference time. This enables the development of device-specific models that leverage device characteristics-reflecting real-world deployment scenarios in which a model is designed with awareness of the underlying hardware. The training set matches the 25% subset used in the corresponding DCASE 2024 challenge, with no restrictions on external data use, highlighting transfer learning as a central topic. The baseline achieves 50.72% accuracy with a device-agnostic model, improving to 51.89% when incorporating device-specific fine-tuning. The task attracted 31 submissions from 12 teams, with 11 teams outperforming the baseline. The top-performing submission achieved an accuracy gain of more than 8 percentage points over the baseline on the evaluation set.

搜索结果：Device

DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices

Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge

The Effects of Electronic and Photonic Coupling on the Performance of a Photothermionic-Photovoltaic Hybrid Solar Device

TensorSLM: Energy-efficient Embedding Compression of Sub-billion Parameter Language Models on Low-end Devices

On-Device Vision Training, Deployment, and Inference on a Thumb-Sized Microcontroller

IoTSense: Behavioral Fingerprinting of IoT Devices

Device Context Protocol: A Compact, Safety-First Architecture for LLM-Driven Control of Constrained Devices

Scaling On-Device GPU Inference for Large Generative Models

Physics-Informed Neural Networks for Device and Circuit Modeling: A Case Study of NeuroSPICE

HEP digital micromirror devices for precision solar spectroscopy

End-to-end Recording Device Identification Based on Deep Representation Learning

Bridging On-Device and Cloud LLMs for Collaborative Reasoning: A Unified Methodology for Local Routing and Post-Training

WhisperKit: On-device Real-time ASR with Billion-Scale Transformers

Memory Attacks on Device-Independent Quantum Cryptography

Multi-User Multi-IoT-Device Symbiotic Radio: A Novel Massive Access Scheme for Cellular IoT

EdgeFusion: On-Device Text-to-Image Generation

A novel device for controlling the flow of information based on Weyl fermions and some interesting remarks regarding the electromagnetic interactions of high energy particles

QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models -- Extended Version

Improvements on Device Independent and Semi-Device Independent Protocols of Randomness Expansion

Optimization of state parameters in displacement assisted photon subtracted measurement-device-independent quantum key distribution