This study examines the performance of ChatGPT with an experiment in the legal domain. We compare the outcome with it a baseline using regular expressions (Regex), rather than focusing solely on the assessment against human performance. The study reveals that even if ChatGPT has access to the necessary knowledge and competencies, it is unable to assemble them, reason through, in a way that leads to an exhaustive result. This unveils a major limitation of ChatGPT. Intelligence encompasses the ability to break down complex issues and address them according to multiple required competencies, providing a unified and comprehensive solution. In the legal domain, one of the most crucial tasks is reading legal decisions and extracting key passages condensed from principles of law (PoLs), which are then incorporated into subsequent rulings by judges or defense documents by lawyers. In performing this task, artificial intelligence lacks an all-encompassing understanding and reasoning, which makes it inherently limited. Genuine intelligence, remains a uniquely human trait, at least in this particular field.
The high-order complexity of human behaviour is likely the root cause of extreme difficulty in financial market projections. We consider that behavioural simulation can unveil systemic dynamics to support analysis. Simulating diverse human groups must account for the behavioural heterogeneity, especially in finance. To address the fidelity of simulated agents, on the basis of agent-based modeling, we propose a new paradigm of behavioural simulation where each agent is supported and driven by a hierarchical knowledge architecture. This architecture, integrating language and professional models, imitates behavioural processes in specific scenarios. Evaluated on futures markets, our simulator achieves a 13.29% deviation in simulating crisis scenarios whose price increase rate reaches 285.34%. Under normal conditions, our simulator also exhibits lower mean square error in predicting futures price of specific commodities. This technique bridges non-quantitative information with diverse market behaviour, offering a promising platform to simulate investor behaviour and its impact on market dynamics.
We have developed a technique to restore scientific usage in compromised (publicly-available) images collected with the James Webb Space Telescope (JWST) of the Galactic globular cluster NGC 104 (47 Tucanae). In spite of the degradation and limited data, we were able to recover photometry and astrometry for the coolest stellar objects ever observed within a globular cluster, possibly unveiling the brightest part of the brown dwarf (BD) sequence. This is supported by: (i) proper motion membership, derived by the comparison with positions obtained from Hubble Space Telescope archival early epochs; (ii) the predicted location of the BD sequence; and (iii) the mass function for low-mass stars derived from models. Future JWST observations will provide the necessary deep and precise proper motions to confirm the nature of the here-identified BD candidates belonging to this globular cluster.
The return of normalcy to the population's lifestyle is a critical recovery milestone in the aftermath of disasters, and delayed lifestyle recovery could lead to significant well-being impacts. Lifestyle recovery captures the collective effects of population activities and the restoration of infrastructure and business services. This study uses a novel approach to leverage privacy-enhanced location intelligence data to characterize distinctive lifestyle patterns and to unveil recovery trajectories after a disaster in the context of 2017 Hurricane Harvey in Harris County, Texas. The analysis integrates multiple data sources to record the number of visits from home census block groups (CBGs) to different points of interest during the baseline period and disruptive period. First, primary clustering using k-means characterized four distinct essential and non-essential lifestyle patterns. Then, secondary clustering characterized the impact of the hurricane into three recovery trajectories based on the severity of maximum disruption and duration of recovery. The results reveal multiple recovery trajectories and durations within each lifestyle cluster, which imply differential recovery ra
The detected polarized radio emission from remnant of SN1987A opens the possibility to unveil the structure of the pre-supernova magnetic field in the circumstellar medium. Properties derived from direct measurements would be of importance for understanding the progenitor stars and their magnetic fields. As the first step to this goal, we adopted the hydrodynamic data from an elaborated three-dimensional (3-D) numerical model of SN1987A. We have developed an approximate method for `reconstruction' of 3-D magnetic field structure inside supernova remnant on the `hydrodynamic background'. This method uses the distribution of the magnetic field around the progenitor as the initial condition. With such a 3-D magneto-hydrodynamic model, we have synthesized the polarization maps for a number of SN1987A models and compared them to the observations. In this way, we have tested different initial configurations of the magnetic field as well as a structure of the synchrotron emission in SN987A. We have recovered the observed polarization pattern and we have found that the radial component of the ambient pre-supernova magnetic field should be dominant on the length-scale of the present-day rad
Degradation of the myelin sheath is a common pathology underlying demyelinating neurological diseases from Multiple Sclerosis to Leukodistrophies. Although large malformations of myelin ultrastructure in the advanced stages of Wallerian degradation is known, its subtle structural variations at early stages of demyelination remains poorly characterized. This is partly due to the lack of suitable and non-invasive experimental probes possessing sufficient resolution to detect the degradation. Here we report the feasibility of the application of an innovative non-invasive local structure experimental approach for imaging the changes of statistical structural fluctuations in the first stage of myelin degeneration. Scanning micro X-ray diffraction, using advances in synchrotron x-ray beam focusing, fast data collection, paired with spatial statistical analysis, has been used to unveil temporal changes in the myelin structure of dissected nerves following extraction of the Xenopus laevis sciatic nerve. The early myelin degeneration is a specific ordered compacted phase preceding the swollen myelin phase of Wallerian degradation. Our demonstration of the feasibility of the statistical anal
We describe a compression-based distance for genomic sequences. Instead of using the usual conjoint information content, as in the classical Normalized Compression Distance (NCD), it uses the conditional information content. To compute this Normalized Conditional Compression Distance (NCCD), we need a normal conditional compressor, that we built using a mixture of static and dynamic finite-context models. Using this approach, we measured chromosomal distances between Hominidae primates and also between Muroidea (rat and mouse), observing several insights of evolution that so far have not been reported in the literature.
Guanylate binding proteins (GBPs) are soluble dynamin-like proteins with structured domains that undergo a conformational transition for GTP-controlled oligomerization to exert their function as part of the innate immune system of mammalian cells - attacking intra-cellular parasites by disrupting their membranes. The structural basis and mechanism of this process is unknown. Therefore, we apply neutron spin echo, X-ray scattering, fluorescence, and EPR spectroscopy as techniques for integrative dynamic structural biology to human GBP1 (hGBP1). We mapped hGBP1's essential dynamics from nanoseconds to milliseconds by motional spectra of sub-domains. We find a GTP-independent flexibility of the C-terminal effector domain in the $μ$s-regime and structurally characterize conformers being essential that hGBP1 can open like a pocketknife for oligomerization. This unveils the intrinsic flexibility, a GTP-triggered association of the GTPase-domains and assembly-dependent GTP-hydrolysis as functional design principles of hGBP1 that control its reversible oligomerization in polar assemblies and the subsequent formation of condensates.
Accurate healthcare prediction is essential for improving patient outcomes. Existing work primarily leverages advanced frameworks like attention or graph networks to capture the intricate collaborative (CO) signals in electronic health records. However, prediction for rare diseases remains challenging due to limited co-occurrence and inadequately tailored approaches. To address this issue, this paper proposes UDC, a novel method that unveils discrete clues to bridge consistent textual knowledge and CO signals within a unified semantic space, thereby enriching the representation semantics of rare diseases. Specifically, we focus on addressing two key sub-problems: (1) acquiring distinguishable discrete encodings for precise disease representation and (2) achieving semantic alignment between textual knowledge and the CO signals at the code level. For the first sub-problem, we refine the standard vector quantized process to include condition awareness. Additionally, we develop an advanced contrastive approach in the decoding stage, leveraging synthetic and mixed-domain targets as hard negatives to enrich the perceptibility of the reconstructed representation for downstream tasks. For
The proliferation of open-sourced Large Language Models (LLMs) and diverse downstream tasks necessitates efficient model selection, given the impracticality of fine-tuning all candidates due to computational constraints. Despite the recent advances in LLM selection, a fundamental research question largely remains nascent: how can we model the dynamic behaviors of LLMs during fine-tuning, thereby enhancing our understanding of their generalization performance across diverse downstream tasks? In this work, we propose a novel theoretical framework that provides a proper lens to assess the generalization capabilities of LLMs, thereby enabling accurate and efficient LLM selection for downstream applications. In particular, we first derive a PAC-Bayesian Generalization Bound that unveils fine-tuning dynamics of LLMs and then introduce LENSLLM, a Neural Tangent Kernel (NTK)-based Rectified Scaling Model that enables accurate performance predictions across diverse tasks while maintaining computational efficiency. Extensive empirical results on 3 large-scale benchmarks demonstrate that our model achieves up to 91.1% accuracy and reduces up to 88.5% computational cost in LLM selection, outpe
Although Chain-of-Thought (CoT) has achieved remarkable success in enhancing the reasoning ability of large language models (LLMs), the mechanism of CoT remains a ``black box''. Even if the correct answers can frequently be obtained, existing CoTs struggle to make the reasoning understandable to human. In this paper, we unveil and causalize CoT from a causal perspective to ensure both correctness and understandability of all reasoning steps (to the best of our knowledge, the first such). We model causality of CoT via structural causal models (SCM) to unveil the reasoning mechanism of CoT. To measure the causality of CoT, we define the CoT Average Causal Effect (CACE) to test the causal relations between steps. For those steps without causality (wrong or unintelligible steps), we design a role-playing causal query algorithm to causalize these steps, resulting a causalized CoT with all steps correct and understandable. Experimental results on both open-source and closed-source LLMs demonstrate that the causal errors commonly in steps are effectively corrected and the reasoning ability of LLMs is significantly improved.
The Multiphase Astrophysics to Unveil the Virgo Environment (MAUVE) project is a multi-facility programme exploring how dense environments transform galaxies. Combining a VLT/MUSE P110 Large Programme and ALMA observations of 40 late-type Virgo Cluster galaxies, MAUVE resolves star formation, kinematics, and chemical enrichment within their molecular gas discs. A key goal is to track the evolution of cold gas that survives in the inner regions of satellites after entering the cluster, and how it evolves across different infall stages. With its high spatial resolution -- probing down to the physical scales of giant molecular cloud complexes -- and multiphase synergy, MAUVE aims to offer a time-resolved view of environmental quenching and set a new benchmark for cluster galaxy studies.
Large Language Models (LLMs) have achieved remarkable success across many applications, with Mixture of Experts (MoE) models demonstrating great potential. Compared to traditional dense models, MoEs achieve better performance with less computation. Speculative decoding (SD) is a widely used technique to accelerate LLM inference without accuracy loss, but it has been considered efficient only for dense models. In this work, we first demonstrate that, under medium batch sizes, MoE surprisingly benefits more from SD than dense models. Furthermore, as MoE becomes sparser -- the prevailing trend in MoE designs -- the batch size range where SD acceleration is expected to be effective becomes broader. To quantitatively understand tradeoffs involved in SD, we develop a reliable modeling based on theoretical analyses. While current SD research primarily focuses on improving acceptance rates of algorithms, changes in workload and model architecture can still lead to degraded SD acceleration even with high acceptance rates. To address this limitation, we introduce a new metric 'target efficiency' that characterizes these effects, thus helping researchers identify system bottlenecks and unders
Social media platforms have experienced a significant rise in toxic content, including abusive language and discriminatory remarks, presenting growing challenges for content moderation. Some users evade censorship by deliberately disguising toxic words through homophonic cloak, which necessitates the task of unveiling cloaked toxicity. Existing methods are mostly designed for English texts, while Chinese cloaked toxicity unveiling has not been solved yet. To tackle the issue, we propose C$^2$TU, a novel training-free and prompt-free method for Chinese cloaked toxic content unveiling. It first employs substring matching to identify candidate toxic words based on Chinese homo-graph and toxic lexicon. Then it filters those candidates that are non-toxic and corrects cloaks to be their corresponding toxicities. Specifically, we develop two model variants for filtering, which are based on BERT and LLMs, respectively. For LLMs, we address the auto-regressive limitation in computing word occurrence probability and utilize the full semantic contexts of a text sequence to reveal cloaked toxic words. Extensive experiments demonstrate that C$^2$TU can achieve superior performance on two Chines
This study unveils the elusive presence of criminal signatures in cyberspace, validating for the first time their existence through statistical evidence. By applying the A priori algorithm to the modus operandi of Advanced Persistent Threats, extracted from an extensive corpus of over 17,000 articles spanning 2007 to 2020, we highlight the enduring patterns leveraged by sophisticated cyber criminals. Our findings verify the existence of unique signatures associated with advanced cybercriminals, bridging a crucial gap in current understanding of human behavior in cyber-attacks. This pivotal research sets the foundation for an entirely new academic intersection in cybersecurity and computational criminology.
Predicting and constructing road geometric information (e.g., lane lines, road markers) is a crucial task for safe autonomous driving, while such static map elements can be repeatedly occluded by various dynamic objects on the road. Recent studies have shown significantly improved vectorized high-definition (HD) map construction performance, but there has been insufficient investigation of temporal information across adjacent input frames (i.e., clips), which may lead to inconsistent and suboptimal prediction results. To tackle this, we introduce a novel paradigm of clip-level vectorized HD map construction, MapUnveiler, which explicitly unveils the occluded map elements within a clip input by relating dense image representations with efficient clip tokens. Additionally, MapUnveiler associates inter-clip information through clip token propagation, effectively utilizing long-term temporal map information. MapUnveiler runs efficiently with the proposed clip-level pipeline by avoiding redundant computation with temporal stride while building a global map relationship. Our extensive experiments demonstrate that MapUnveiler achieves state-of-the-art performance on both the nuScenes and
Spectroscopic identification of distinct nonlinear photocurrents unveils quantum geometric properties of electron wavefunctions and the momentum-space topological structures. This is especially interesting, but still puzzling, for chiral topological semimetals with possibilities of hosting giant quantized circular photogalvanic effect. Here we report a comprehensive terahertz (THz) emission spectroscopic analysis of nonlinear photoconductivity of chiral multifold CoSi at 0.26 ~ 1 eV. We find a large linear shift conductivity (17 μA/V2), and confirm a giant injection conductivity (167 μA/V2) as a consequence of strongly interfered non-quantized contributions from the vicinity of multifold nodes with opposite chiralities. The bulk injection current excited by the pump field with a complex wavevector is shown to carry both longitudinal and transverse components. Symmetry analyses further unveil weak nonlocal photon drag effect in addition to the photogalvanic effect. This work not only highlights chiral transition metal monosilicides for mid-infrared photovoltaic applications via various nonlinear optical channels, but also consolidates the THz spectroscopy for quantitative photovolta
Navigating the intricacies of thermal management at the quantum scale is a challenge in the pursuit of advanced nanoscale technologies. To this extent, theoretical frameworks introducing minimal models mirroring the functionality of electronic current amplifiers and transistors, for instance, have been proposed. Different architectures of the subsystems composing a quantum thermal device can be considered, tacitly bringing drawbacks or advantages if properly engineered. This paper extends the prior research on thermotronics, studying a strongly coupled three-subsystem thermal device with a specific emphasis on a third excited level in the control subsystem. Our setup can be employed as a multipurpose device conditioned on the specific choice of internal parameters: heat switch, rectifier, stabilizer, and amplifier. The exploration of the detuned levels unveils a key role in the performance and working regime of the device. We observe a stable and strong amplification effect persisting over broad ranges of temperature. We conclude that considering a three-level system, as the one directly in contact with the control temperature, boosts output currents and the ability to operate our
In today's digital landscape, the proliferation of conspiracy theories within the disinformation ecosystem of online platforms represents a growing concern. This paper delves into the complexities of this phenomenon. We conducted a comprehensive analysis of two distinct X (formerly known as Twitter) datasets: one comprising users with conspiracy theorizing patterns and another made of users lacking such tendencies and thus serving as a control group. The distinguishing factors between these two groups are explored across three dimensions: emotions, idioms, and linguistic features. Our findings reveal marked differences in the lexicon and language adopted by conspiracy theorists with respect to other users. We developed a machine learning classifier capable of identifying users who propagate conspiracy theories based on a rich set of 871 features. The results demonstrate high accuracy, with an average F1 score of 0.88. Moreover, this paper unveils the most discriminating characteristics that define conspiracy theory propagators.
Large Language Models (LLMs) demonstrate an impressive capacity to recall a vast range of factual knowledge. However, understanding their underlying reasoning and internal mechanisms in exploiting this knowledge remains a key research area. This work unveils the factual information an LLM represents internally for sentence-level claim verification. We propose an end-to-end framework to decode factual knowledge embedded in token representations from a vector space to a set of ground predicates, showing its layer-wise evolution using a dynamic knowledge graph. Our framework employs activation patching, a vector-level technique that alters a token representation during inference, to extract encoded knowledge. Accordingly, we neither rely on training nor external models. Using factual and common-sense claims from two claim verification datasets, we showcase interpretability analyses at local and global levels. The local analysis highlights entity centrality in LLM reasoning, from claim-related information and multi-hop reasoning to representation errors causing erroneous evaluation. On the other hand, the global reveals trends in the underlying evolution, such as word-based knowledge e