搜索 — ResearchTracker

Human language production exhibits remarkable richness and variation, reflecting diverse communication styles and intents. However, this variation is often overlooked in summarization evaluation. While having multiple reference summaries is known to improve correlation with human judgments, the impact of the reference set on reference-based metrics has not been systematically investigated. This work examines the sensitivity of widely used reference-based metrics in relation to the choice of reference sets, analyzing three diverse multi-reference summarization datasets: SummEval, GUMSum, and DUC2004. We demonstrate that many popular metrics exhibit significant instability. This instability is particularly concerning for n-gram-based metrics like ROUGE, where model rankings vary depending on the reference sets, undermining the reliability of model comparisons. We also collect human judgments on LLM outputs for genre-diverse data and examine their correlation with metrics to supplement existing findings beyond newswire summaries, finding weak-to-no correlation. Taken together, we recommend incorporating reference set variation into summarization evaluation to enhance consistency along

Enhancing Reference-based Sketch Colorization via Separating Reference Representations

arXiv2025-08-25作者：Dingkun Yan, Xinrui Wang, Zhuoru Li

Reference-based sketch colorization methods have garnered significant attention for the potential application in animation and digital illustration production. However, most existing methods are trained with image triplets of sketch, reference, and ground truth that are semantically and spatially similar, while real-world references and sketches often exhibit substantial misalignment. This mismatch in data distribution between training and inference leads to overfitting, consequently resulting in artifacts and signif- icant quality degradation in colorization results. To address this issue, we conduct an in-depth analysis of the reference representations, defined as the intermedium to transfer information from reference to sketch. Building on this analysis, we introduce a novel framework that leverages distinct reference representations to optimize different aspects of the colorization process. Our approach decomposes colorization into modular stages, al- lowing region-specific reference injection to enhance visual quality and reference similarity while mitigating spatial artifacts. Specifically, we first train a backbone network guided by high-level semantic embeddings. We then in

搜索结果：reference

References Matter: Investigating the Impact of Reference Set Variation on Summarization Evaluation

Enhancing Reference-based Sketch Colorization via Separating Reference Representations

Responses of any arbitrary initially stressed reference and the stress-free reference

References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation

Lunar Reference Timescale

RefOnce: Distilling References into a Prototype Memory for Referring Camouflaged Object Detection

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

The Cascade Log: Reference-Stable Windowing over Tiered Append Sequences

Can No-reference features help in Full-reference image quality estimation?

Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics

Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents

Bilateral Reference for High-Resolution Dichotomous Image Segmentation

LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors

Less is More: Learning Reference Knowledge Using No-Reference Image Quality Assessment

Reference-guided Controllable Inpainting of Neural Radiance Fields

Guarding against artificial intelligence--hallucinated citations: the case for full-text reference deposit

On the choice of reference in offset calibration

LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology