搜索 — ResearchTracker

Large language models (LLMs) have shown high agreement with human raters across a variety of tasks, demonstrating potential to ease the challenges of human data collection. In computational social science (CSS), researchers are increasingly leveraging LLM annotations to complement slow and expensive human annotations. Still, guidelines for collecting and using LLM annotations, without compromising the validity of downstream conclusions, remain limited. We introduce Confidence-Driven Inference: a method that combines LLM annotations and LLM confidence indicators to strategically select which human annotations should be collected, with the goal of producing accurate statistical estimates and provably valid confidence intervals while reducing the number of human annotations needed. Our approach comes with safeguards against LLM annotations of poor quality, guaranteeing that the conclusions will be both valid and no less accurate than if we only relied on human annotations. We demonstrate the effectiveness of Confidence-Driven Inference over baselines in statistical estimation tasks across three CSS settings--text politeness, stance, and bias--reducing the needed number of human annota

Complete characterization of the directly implementable quantum gates used in the IBM quantum processors

arXiv2018-05-18作者：Abhishek Shukla, Mitali Sisodia, Anirban Pathak

Quantum process tomography of each directly implementable quantum gate used in the IBM quantum processors is performed to compute gate error in order to check viability of complex quantum operations in the superconductivity-based quantum computers introduced by IBM and to compare the quality of these gates with the corresponding gates implemented using other technologies. Quantum process tomography (QPT) of C-NOT gates have been performed for three configurations available in IBM QX4 processor. For all the other allowed gates QPT have been performed for every allowed position (i.e., by placing the gates in different qubit lines) for IBM QX4 architecture, and thus, gate fidelities are obtained for both single-qubit and 2-qubit gates. Gate fidelities are observed to be lower than the corresponding values obtained in the other technologies, like NMR. Further, gate fidelities for all the single-qubit gates are obtained for IBM QX2 architecture by placing the gates in the third qubit line ($q[2]$). It's observed that the IBM QX4 architecture yields better gate fidelity compared to IBM QX2 in all cases except the case of $\operatorname{Y}$ gate as far as the gate fidelity corresponding t

搜索结果：used

Can Unconfident LLM Annotations Be Used for Confident Conclusions?

Complete characterization of the directly implementable quantum gates used in the IBM quantum processors

Algorithms used for the Cell Segmentation Benchmark Competition at ISBI 2019 by RWTH-GE

Inventions on Tree Navigators used in Graphical User Interface

Knowledge Management in Software Engineering: A Systematic Review of Studied Concepts, Findings and Research Methods Used

Women sue the men who used their Instagram feeds to create AI porn influencers

Inventions on dialog boxes used in GUI

Using Additional Indexes for Fast Full-Text Search of Phrases That Contain Frequently Used Words

Constraining the shape of dark matter haloes using only starlight. I. A new technique and its application to the galaxy Nube

A Demonstration of Interstellar Navigation Using New Horizons

Stochastic birhythmicity outside the coexistence region in the Hindmarsh-Rose model

Using Generative AI to Uncover What Drives Player Enjoyment in PC and VR Games

Using Slisemap to interpret physical data

Quantum Teleportation using Quantum Candies

Towards a Foundation Model for Brain Age Prediction using coVariance Neural Networks

Towards the Next Frontier in Speech Representation Learning Using Disentanglement

Bayesian variable selection in sample selection models using spike-and-slab priors

Impact of Network Heterogeneity on Neuronal Synchronization

Synchronization of two non-identical Chialvo neurons

From Code to Play: Benchmarking Program Search for Games Using Large Language Models