搜索 — ResearchTracker

Coercion-resistance (CR) is a crucial security property in e-voting systems. It ensures that an attacker cannot compel a voter to vote in a specific way by using threats or rewards. The Loki e-voting protocol, proposed by Giustolisi \emph{et al.} at IEEE S\&P (2024), introduces a novel design that mitigates last-minute coercion through a re-voting mechanism. It also aims to address the usability issues of the seminal JCJ e-voting protocol, specifically: i) the requirement that voters can store and hide pre-agreed credentials, and ii) the ability of voters to convincingly lie while being coerced. In this work, we identify two vulnerabilities in Loki. The first is a brute-force attack that compromises the integrity of the evasion strategy. Specifically, this attack allows an adversary to cast a ballot on behalf of their victim in a way that the evasion strategy cannot defend against, rendering it ineffective. The second vulnerability is a forced abstention attack, which allows an adversary to detect when their victim has complied with their instruction not to vote. We generalise the integrity attack to reveal a fundamental dilemma: without pre-agreed secret credentials, it is not

The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models

arXiv2026-04-07作者：Xiaojie Gu, Ziying Huang, Weicong Hong

Large Language Models (LLMs) internalize vast world knowledge as parametric memory, yet inevitably inherit the staleness and errors of their source corpora. Consequently, ensuring the reliability and malleability of these internal representations is imperative for trustworthy real-world deployment. Knowledge editing offers a pivotal paradigm for surgically modifying memory without retraining. However, while recent editors demonstrate high success rates on standard benchmarks, it remains questionable whether current evaluation frameworks that rely on assessing output under specific prompting conditions can reliably authenticate genuine memory modification. In this work, we introduce a simple diagnostic framework that subjects models to discriminative self-assessment under in-context learning (ICL) settings that better reflect real-world application environments, specifically designed to scrutinize the subtle behavioral nuances induced by memory modifications. This probing reveals a pervasive phenomenon of Surface Compliance, where editors achieve high benchmark scores by merely mimicking target outputs without structurally overwriting internal beliefs. Moreover, we find that recursi

搜索结果：agreed

On the Necessity of Pre-agreed Secrets for Thwarting Last-minute Coercion: Vulnerabilities and Lessons From the Loki E-voting Protocol

The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models

Agreed and Disagreed Uncertainty

Majority-Agreed Key Distribution using Absolutely Maximally Entangled Stabilizer States

Computational astrophysics for the future: An open, modular approach with agreed standards would facilitate astrophysical discovery

Can AI Agents Agree?

Revisiting the syntax of imperatives in Yemeni Arabic: An Agree across phases approach

LLMs Know They're Wrong and Agree Anyway: The Shared Sycophancy-Lying Circuit

Demo: TOSense -- What Did You Just Agree to?

AgREE: Agentic Reasoning for Knowledge Graph Completion on Emerging Entities

Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations

Agree to Disagree? A Meta-Evaluation of LLM Misgendering

EXPLAIN, AGREE, LEARN: Scaling Learning for Neural Probabilistic Logic

Do LLMs Agree on the Creativity Evaluation of Alternative Uses?

A Reverse Engineering Education Needs Analysis Survey

Pressure-Constant Monte Carlo Simulation of Solid CO2 Phase I up to 10 GPa using Kihara Potential Model

Non-trivial $r$-wise agreeing families

Agree To Disagree

Agreeing to Stop: Reliable Latency-Adaptive Decision Making via Ensembles of Spiking Neural Networks

Do Subjectivity and Objectivity Always Agree? A Case Study with Stack Overflow Questions