搜索 — ResearchTracker

Understanding real-world videos such as movies requires integrating visual and dialogue cues. Yet existing VideoQA benchmarks struggle to capture this multimodal reasoning and, given the difficulty of evaluating free-form answers, largely resort to simple multiple choice questions. We introduce a novel open-ended multimodal VideoQA benchmark, MovieRecapsQA, created using movie recap videos -- a distinctive type of YouTube content that summarizes a film via a voiceover description of key clips from the movie (recap video). From the transcribed voiceover (recap summary) of 60 recap videos, we generate $\approx$8.2K questions along with the necessary ``facts'' expected in each answer; the former facilitates the creation of questions that require mutimodal reasoning and the latter allow the construction of a reference-free evaluation metric that can be applied to open-ended responses. To our knowledge, this is the first reference-free open-ended VideoQA benchmark. The benchmark allows each question to be evaluated in different input video settings: given (a) the full-length movie, (b) the full ($\approx$11 min) recap video (visual only), (c) $\approx$14 min of aligned movie scenes, i.e

RECAP: Resistance Capture in Text-based Mental Health Counseling with Large Language Models

arXiv2026-01-21作者：Anqi Li, Yuqian Chen, Yu Lu

Recognizing and navigating client resistance is critical for effective mental health counseling, yet detecting such behaviors is particularly challenging in text-based interactions. Existing NLP approaches oversimplify resistance categories, ignore the sequential dynamics of therapeutic interventions, and offer limited interpretability. To address these limitations, we propose PsyFIRE, a theoretically grounded framework capturing 13 fine-grained resistance behaviors alongside collaborative interactions. Based on PsyFIRE, we construct the ClientResistance corpus with 23,930 annotated utterances from real-world Chinese text-based counseling, each supported by context-specific rationales. Leveraging this dataset, we develop RECAP, a two-stage framework that detects resistance and fine-grained resistance types with explanations. RECAP achieves 91.25% F1 for distinguishing collaboration and resistance and 66.58% macro-F1 for fine-grained resistance categories classification, outperforming leading prompt-based LLM baselines by over 20 points. Applied to a separate counseling dataset and a pilot study with 62 counselors, RECAP reveals the prevalence of resistance, its negative impact on t

搜索结果：recap

MovieRecapsQA: A Multimodal Open-Ended Video Question-Answering Benchmark

RECAP: Resistance Capture in Text-based Mental Health Counseling with Large Language Models

RECAP: Local Hebbian Prototype Learning as a Self-Organizing Readout for Reservoir Dynamics

ReCap: Lightweight Referential Grounding for Coherent Story Visualization

RECAP: An End-to-End Platform for Capturing, Replaying, and Analyzing AI-Assisted Programming Interactions

ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization

RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems

ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents

Towards robust long-context understanding of large language model via active recap learning

RECAP: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline

RECAP: REwriting Conversations for Intent Understanding in Agentic Planning

RECAP: A Resource-Efficient Method for Adversarial Prompting in Large Language Models

RECAP Framework v1.0: A Multi-Layer Inheritance Architecture for Evidence Synthesis

"Previously on ..." From Recaps to Story Summarization

RECAP: Regression Evaluation for Continual Adaptation of Prompts

ReCap: Better Gaussian Relighting with Cross-Environment Captures

Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system

Previously on the Stories: Recap Snippet Identification for Story Reading

RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding

RECAP: Retrieval-Augmented Audio Captioning