搜索 — ResearchTracker

As large language models (LLMs) are more frequently used in retrieval-augmented generation pipelines, it is increasingly relevant to study their behavior under knowledge conflicts. Thus far, the role of the source of the retrieved information has gone unexamined. We address this gap with a novel framework to investigate how source preferences affect LLM resolution of inter-context knowledge conflicts in English, motivated by interdisciplinary research on credibility. By using synthetic sources, we study preferences for different types of sources without inheriting the biases of specific real-world sources. With a comprehensive, tightly-controlled evaluation of 13 open-weight LLMs, we find that LLMs prefer institutionally-corroborated information (e.g., government or newspaper sources) over information from people and social media. However, these source preferences can be reversed by simply repeating information from less credible sources. To mitigate repetition effects and maintain consistent preferences, we propose a novel method that reduces repetition bias by up to 79.2%, while also maintaining at least 72.5% of original preferences. We release all data and code to encourage fut

Gotta Hear Them All: Towards Sound Source Aware Audio Generation

arXiv2024-11-23作者：Wei Guo, Heng Wang, Jianbo Ma

Audio synthesis has broad applications in multimedia. Recent advancements have made it possible to generate relevant audios from inputs describing an audio scene, such as images or texts. However, the immersiveness and expressiveness of the generation are limited. One possible problem is that existing methods solely rely on the global scene and overlook details of local sounding objects (i.e., sound sources). To address this issue, we propose a Sound Source-Aware Audio (SS2A) generator. SS2A is able to locally perceive multimodal sound sources from a scene with visual detection and cross-modality translation. It then contrastively learns a Cross-Modal Sound Source (CMSS) Manifold to semantically disambiguate each source. Finally, we attentively mix their CMSS semantics into a rich audio representation, from which a pretrained audio generator outputs the sound. To model the CMSS manifold, we curate a novel single-sound-source visual-audio dataset VGGS3 from VGGSound. We also design a Sound Source Matching Score to clearly measure localized audio relevance. With the effectiveness of explicit sound source modeling, SS2A achieves state-of-the-art performance in extensive image-to-audio

搜索结果：Source

Whose Facts Win? LLM Source Preferences under Knowledge Conflicts

Gotta Hear Them All: Towards Sound Source Aware Audio Generation

Starkiller: subtracting stars and other sources from IFU spectroscopic data through forward modeling

Benchmarking weak randomness in Quantum and Natural Sources

Source-Modality Monitoring in Vision-Language Models

OneRing: A Simple Method for Source-free Open-partial Domain Adaptation

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation

Analytically Separating the Source of the Teukolsky Equation

Excess cataloged X-ray and radio sources at galaxy-cluster virial shocks

Open-Set Source Tracing of Audio Deepfake Systems

Multi-Stage Music Source Restoration with BandSplit-RoFormer Separation and HiFi++ GAN

Towards Source-free Domain Adaptive Semantic Segmentation via Importance-aware and Prototype-contrast Learning

Direction Specific Ambisonics Source Separation with End-To-End Deep Learning

Imaging a moving point source in R^3 from the time of arrival at sparse observation points

Test of source selection for constructing a more stable and uniform celestial reference frame

Estimating defectiveness of source code: A predictive model using GitHub content

QVNTVS, Open-Source Quantum Well Simulator

FFT based evaluation of microlensing magnification with extended source

Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources

Efficient Prior Publication Identification for Open Source Code