搜索结果：Commander

共找到 20 条结果

高级筛选 ▾

Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection

arXiv2025-06-24作者：Yazhou Zhang, Chunwang Zou, Bo Wang

Multimodal sarcasm understanding is a high-order cognitive task. Although large language models (LLMs) have shown impressive performance on many downstream NLP tasks, growing evidence suggests that they struggle with sarcasm understanding. In this paper, we propose Commander-GPT, a modular decision routing framework inspired by military command theory. Rather than relying on a single LLM's capability, Commander-GPT orchestrates a team of specialized LLM agents where each agent will be selectively assigned to a focused sub-task such as keyword extraction, sentiment analysis, etc. Their outputs are then routed back to the commander, which integrates the information and performs the final sarcasm judgment. To coordinate these agents, we introduce three types of centralized commanders: (1) a trained lightweight encoder-based commander (e.g., multi-modal BERT); (2) four small autoregressive language models, serving as moderately capable commanders (e.g., DeepSeek-VL); (3) two large LLM-based commander (Gemini Pro and GPT-4o) that performs task routing, output aggregation, and sarcasm decision-making in a zero-shot fashion. We evaluate Commander-GPT on the MMSD and MMSD 2.0 benchmarks, c

搜索结果：Commander

Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection

Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization

Global Commander and Local Operative: A Dual-Agent Framework for Scene Navigation

Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models

Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation

Predicting Enemy's Actions Improves Commander Decision-Making

Information-Theoretic Aggregation of Ethical Attributes in Simulated-Command

One Goal, Many Commands: Characterizing Denylist Fragility in AI Agents

Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning

The Command Line GUIde: Graphical Interfaces from Man Pages via AI

Hello Afrika: Speech Commands in Kinyarwanda

Command A: An Enterprise-Ready Large Language Model

Speech Command + Speech Emotion: Exploring Emotional Speech Commands as a Compound and Playful Modality

Multimodal Deep Learning for ATCO Command Lifecycle Modeling and Workload Prediction

Analyzing Multimodal Features of Spontaneous Voice Assistant Commands for Mild Cognitive Impairment Detection

Handling abort commands for household kitchen robots

RACONTEUR: A Knowledgeable, Insightful, and Portable LLM-Powered Shell Command Explainer

CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research

ge_gravity2: a command for solving universal gravity models

搜索结果：Commander

Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection

Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization

Global Commander and Local Operative: A Dual-Agent Framework for Scene Navigation

Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models

Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation

Predicting Enemy's Actions Improves Commander Decision-Making

Information-Theoretic Aggregation of Ethical Attributes in Simulated-Command

One Goal, Many Commands: Characterizing Denylist Fragility in AI Agents

Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning

The Command Line GUIde: Graphical Interfaces from Man Pages via AI

Hello Afrika: Speech Commands in Kinyarwanda

Command A: An Enterprise-Ready Large Language Model

Speech Command + Speech Emotion: Exploring Emotional Speech Commands as a Compound and Playful Modality

Multimodal Deep Learning for ATCO Command Lifecycle Modeling and Workload Prediction

Analyzing Multimodal Features of Spontaneous Voice Assistant Commands for Mild Cognitive Impairment Detection

Handling abort commands for household kitchen robots

RACONTEUR: A Knowledgeable, Insightful, and Portable LLM-Powered Shell Command Explainer

CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research

ge_gravity2: a command for solving universal gravity models