Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with internal knowledge, overlooking the knowledge-confidence gaps that lead to overconfident errors or uncertain truths. To bridge this gap, we propose a novel meta-cognitive framework for reliable knowledge augmentation via differentiated intervention and alignment. Our approach leverages internal cognitive signals to partition the knowledge space into mastered, confused, and missing regions, guiding targeted knowledge expansion. Furthermore, we introduce a cognitive consistency mechanism to synchronize subjective certainty with objective accuracy, ensuring calibrated knowledge boundaries. Extensive experiments demonstrate the our framework consistently outperforms strong baselines, validating its rationality in not only enhancing knowledge capabilities but also fostering cognitive behaviors that better distinguish knowns from unknowns. All codes are available at https://github.com/AI9Stars/Know-More-Know-Clearer.
In today's digital environment, the rapid propagation of fake news via social networks poses significant social challenges. Most existing detection methods either employ traditional classification models, which suffer from low interpretability and limited generalization capabilities, or craft specific prompts for large language models (LLMs) to produce explanations and results directly, failing to leverage LLMs' reasoning abilities fully. Inspired by the saying that "truth becomes clearer through debate," our study introduces a novel multi-agent system with LLMs named TruEDebate (TED) to enhance the interpretability and effectiveness of fake news detection. TED employs a rigorous debate process inspired by formal debate settings. Central to our approach are two innovative components: the DebateFlow Agents and the InsightFlow Agents. The DebateFlow Agents organize agents into two teams, where one supports and the other challenges the truth of the news. These agents engage in opening statements, cross-examination, rebuttal, and closing statements, simulating a rigorous debate process akin to human discourse analysis, allowing for a thorough evaluation of news content. Concurrently, t
Vision Transformers are widely adopted as the backbone of vision foundation models, but they are known to produce high-norm artifacts that degrade representation quality. When knowledge distillation transfers these features to students, high-norm artifacts dominate the objective, so students overfit to artifacts and underweight informative signals, diminishing the gains from larger models. Prior work attempted to remove artifacts but encountered an inherent trade-off between artifact suppression and preserving informative signals from teachers. To address this, we introduce Singular Nullspace-Guided Energy Reallocation (SiNGER), a novel distillation framework that suppresses artifacts while preserving informative signals. The key idea is principled teacher feature refinement: during refinement, we leverage the nullspace-guided perturbation to preserve information while suppressing artifacts. Then, the refined teacher's features are distilled to a student. We implement this perturbation efficiently with a LoRA-based adapter that requires minimal structural modification. Extensive experiments show that \oursname consistently improves student models, achieving state-of-the-art perform
Night photography often struggles with challenges like low light and blurring, stemming from dark environments and prolonged exposures. Current methods either disregard priors and directly fitting end-to-end networks, leading to inconsistent illumination, or rely on unreliable handcrafted priors to constrain the network, thereby bringing the greater error to the final result. We believe in the strength of data-driven high-quality priors and strive to offer a reliable and consistent prior, circumventing the restrictions of manual priors. In this paper, we propose Clearer Night Image Restoration with Vector-Quantized Codebook (VQCNIR) to achieve remarkable and consistent restoration outcomes on real-world and synthetic benchmarks. To ensure the faithful restoration of details and illumination, we propose the incorporation of two essential modules: the Adaptive Illumination Enhancement Module (AIEM) and the Deformable Bi-directional Cross-Attention (DBCA) module. The AIEM leverages the inter-channel correlation of features to dynamically maintain illumination consistency between degraded features and high-quality codebook features. Meanwhile, the DBCA module effectively integrates tex
In this paper, we address the concept of "alignment" in large language models (LLMs) through the lens of post-structuralist socio-political theory, specifically examining its parallels to empty signifiers. To establish a shared vocabulary around how abstract concepts of alignment are operationalised in empirical datasets, we propose a framework that demarcates: 1) which dimensions of model behaviour are considered important, then 2) how meanings and definitions are ascribed to these dimensions, and by whom. We situate existing empirical literature and provide guidance on deciding which paradigm to follow. Through this framework, we aim to foster a culture of transparency and critical evaluation, aiding the community in navigating the complexities of aligning LLMs with human populations.
Drawings of highly connected (dense) graphs can be very difficult to read. Power Graph Analysis offers an alternate way to draw a graph in which sets of nodes with common neighbours are shown grouped into modules. An edge connected to the module then implies a connection to each member of the module. Thus, the entire graph may be represented with much less clutter and without loss of detail. A recent experimental study has shown that such lossless compression of dense graphs makes it easier to follow paths. However, computing optimal power graphs is difficult. In this paper, we show that computing the optimal power-graph with only one module is NP-hard and therefore likely NP-hard in the general case. We give an ILP model for power graph computation and discuss why ILP and CP techniques are poorly suited to the problem. Instead, we are able to find optimal solutions much more quickly using a custom search method. We also show how to restrict this type of search to allow only limited back-tracking to provide a heuristic that has better speed and better results than previously known heuristics.
Blind deconvolution is the problem of recovering a sharp image and a blur kernel from a noisy blurry image. Recently, there has been a significant effort on understanding the basic mechanisms to solve blind deconvolution. While this effort resulted in the deployment of effective algorithms, the theoretical findings generated contrasting views on why these approaches worked. On the one hand, one could observe experimentally that alternating energy minimization algorithms converge to the desired solution. On the other hand, it has been shown that such alternating minimization algorithms should fail to converge and one should instead use a so-called Variational Bayes approach. To clarify this conundrum, recent work showed that a good image and blur prior is instead what makes a blind deconvolution algorithm work. Unfortunately, this analysis did not apply to algorithms based on total variation regularization. In this manuscript, we provide both analysis and experiments to get a clearer picture of blind deconvolution. Our analysis reveals the very reason why an algorithm based on total variation works. We also introduce an implementation of this algorithm and show that, in spite of its
We present the SI and other unit systems, including cgs-em and cgs-es, in a framework whereby a system of fully independent and dimensionally orthogonal base units is modified by conventions designed to simplify the equations that are used within each system. We propose that the radian can be seen as an independent unit whose dimensional status is modified in the SI and other unit systems for this purpose. This framework clarifies how different unit systems are interrelated, and identifies the key pieces of information that are needed to define both a unit system and the equations that are to be used with it. Specifically, these are the size of the base units in the unsimplified system, together with sufficient equations to identify all the conventions adopted by the particular unit system. The appropriate extra information for the revised SI is presented. We do not propose that the treatment of angles as dimensionless within the SI is changed. It is also proposed that the Gaussian unit system is best seen as identical to cgs-es, but with the B and H symbols in equations used to represent relativistic versions of B and H, which should properly be treated as different quantities. Th
The search for chemically unevolved galaxies remains prevalent in the nearby Universe, mostly because these systems provide excellent proxies for exploring in detail the physics of high-z systems. The most promising candidates are extremely metal-poor galaxies (XMPs), i.e., galaxies with <1/10 solar metallicity. However, due to the bright emission line based search criteria traditionally used to find XMPs, we may not be sampling the full XMP population. In 2014 we reoriented this search using only morphological properties and uncovered a population of ~150 `blue diffuse dwarf (BDD) galaxies', and published a sub-sample of 12 BDD spectra. Here we present optical spectroscopic observations of a larger sample of 51 BDDs, along with their SDSS photometric properties. With our improved statistics, we use direct-method abundances to confirm that BDDs are chemically unevolved (7.43<12+log(O/H)<8.01), with ~20% of our sample classified as being XMP galaxies, and find they are actively forming stars at rates of 1-33x10^-2 M_sol/yr in HII regions randomly embedded in a blue, low-surface brightness continuum. Stellar masses are calculated from population synthesis models and estimate
One way of carving up the broad "AI ethics and society" research space that has emerged in recent years is to distinguish between "near-term" and "long-term" research. While such ways of breaking down the research space can be useful, we put forward several concerns about the near/long-term distinction gaining too much prominence in how research questions and priorities are framed. We highlight some ambiguities and inconsistencies in how the distinction is used, and argue that while there are differing priorities within this broad research community, these differences are not well-captured by the near/long-term distinction. We unpack the near/long-term distinction into four different dimensions, and propose some ways that researchers can communicate more clearly about their work and priorities using these dimensions. We suggest that moving towards a more nuanced conversation about research priorities can help establish new opportunities for collaboration, aid the development of more consistent and coherent research agendas, and enable identification of previously neglected research areas.
Large language models (LLMs) and agentic systems are increasingly proposed for financial trading, yet their reported performance remains difficult to compare because studies vary in data provenance, temporal split discipline, execution timing, turnover treatment, and transaction-cost modeling. This article presents a targeted topical review and reproducibility audit of execution realism in LLM-based trading research. A coded evidence matrix covering 30 trade-relevant primary studies is used to assess point-in-time controls, split transparency, held-out evaluation, cost and turnover treatment, execution semantics, universe definition, and artifact release. Across the audited sample, architecture reporting is generally clearer than the evaluation assumptions needed to judge whether a trading result is economically interpretable or reproducible. A 10-equity worked example is included only as a methodological scaffold to illustrate how explicit friction and timing choices can materially compress active-strategy results. The main conclusion is that the next useful step for LLM trading research is not only better agent design, but also clearer reporting standards for execution realism, r
Effective questionnaire design improves the validity of the results, but creating and adapting questionnaires across contexts is challenging due to resource constraints and limited expert access. Recently, the emergence of LLMs has led researchers to explore their potential in survey research. In this work, we focus on the suitability of LLMs in assisting the generation and adaptation of questionnaires. We introduce a novel pipeline that leverages LLMs to create new questionnaires, pretest with a target audience to determine potential issues and adapt existing standardized questionnaires for different contexts. We evaluated our pipeline for creation and adaptation through two studies on Prolific, involving 238 participants from the US and 118 participants from South Africa. Our findings show that participants found LLM-generated text clearer, LLM-pretested text more specific, and LLM-adapted questions slightly clearer and less biased than traditional ones. Our work opens new opportunities for LLM-driven questionnaire support in survey research.
In this article, we present a new approach to studying multivariate period rings that is more consistent with classical theory and provides a clearer description of their structure. We also prove that the category of $B$-admissible representations forms a Tannakian subcategory of the category of representations of $G_{K,Δ}$ by defining an analogue of $(F,G)$-regular rings, which is central to the classification of representations in multivariate $p$-adic Hodge theory.
Symmetries play a crucial role in shaping the structure and predictions of multi-Higgs-doublet models. In three-Higgs-doublet models considerable effort has been put into classifying possible symmetry groups and the conditions for their realisation, yet the completeness of existing classifications remains an open question. In this work, we revisit the problem of identifying realisable symmetries by re-examining conventional Higgs family and general CP transformations from an alternative perspective. Our analysis identifies certain limitations in previous approaches and introduces a clearer, more systematic framework for model builders. We expand our classification by investigating more generalised symmetry structures -- the recently identified GOOFy transformations, which act non-trivially on the Higgs doublets and their conjugates. Our analysis consolidates known results, uncovers previously overlooked structures, and expands the set of symmetries in three-Higgs-doublet models, offering both a clearer theoretical foundation and a practical reference for symmetry-based model building.
The interior polynomial was originally defined for hypergraphs and later shown to coincide with the Ehrhart polynomial of the root polytope of an associated bipartite graph. In previous work, we derived an alternating cycle recursion formula for the interior polynomial. Here, we introduce a new, more transparent recursion formula based on the structure of non-expanding sets. This formula offers a clearer combinatorial interpretation of the interior polynomial and its connection to polyhedral geometry.
A full solution to the recently proposed problem of determining the probability that no $k$-gon can be built from $n$ independently and uniformly chosen sticks in $[0,1]$ is proposed. This extends the known results for triangles and quadrilaterals to general $k$-gons and offers a clearer interpretation of the connection to products of $k$-bonacci numbers.
The last decade has witnessed a rapid advancement of generative AI technology that significantly scaled the accessibility of AI-generated non-consensual intimate images (AIG-NCII), a form of image-based sexual abuse that disproportionately harms and silences women and girls. There is a patchwork of commendable efforts across industry, policy, academia, and civil society to address AIG-NCII. However, these efforts lack a shared, consistent mental model that clearly situates the technologies they target within the context of a large, interconnected, and ever-evolving technological ecosystem. As a result, interventions remain siloed and are difficult to evaluate and compare, leading to a reactive cycle of whack-a-mole. In this paper, we contribute the first comprehensive AIG-NCII technological ecosystem that maps and taxonomizes 11 categories of technologies facilitating the creation, distribution, proliferation and discovery, infrastructural support, and monetization of AIG-NCII. First, we build and visualize the ecosystem through a synthesis of over a hundred primary sources from researchers, journalists, advocates, policymakers, and technologists. Then, we conduct two detailed walk
In a series of papers, Ole Peters and his collaborators claim that the 'conceptual basis of mainstream economic theory' is 'flawed' and that the approach they call 'ergodicity economics' gives 'reason to hope for a future economic science that is more parsimonious, conceptually clearer and less subjective' (Peters, 2019). This paper argues that 'ergodicity economics' is pseudoscience because it has not produced falsifiable implications and should be taken with skepticism.
This paper presents a space-time-wise orthogonal analysis of space-time crystals. This analysis provides a solution consisting of a pair of explicit parametric equations that result from a separate application of the Bloch-Floquet theorem in the (orthogonal) directions of space and time. Compared to previous approaches, this solution offers the benefits of greater simplicity, clearer emphasis on space-time duality and deeper physical insight.
We show that a method proposed recently, based on the characteristic polynomial of an effective Hamiltonian, had been developed several years earlier by other authors in a clearer and more general way. We outline both implementations of the approach and compare them by means of the calculation of the exceptional point closest to origin for a toy model.