Chronic Kidney Disease (CKD) affects millions of people worldwide, yet its early detection remains challenging, especially in outpatient settings where laboratory-based renal biomarkers are often unavailable. In this work, we investigate the predictive potential of routinely collected non-renal clinical variables for CKD classification, including sociodemographic factors, comorbid conditions, and urinalysis findings. We introduce the Nephrology-Oriented Representation leArning (NORA) approach, which combines supervised contrastive learning with a nonlinear Random Forest classifier. NORA first derives discriminative patient representations from tabular EHR data, which are then used for downstream CKD classification. We evaluated NORA on a clinic-based EHR dataset from Riverside Nephrology Physicians. Our results demonstrated that NORA improves class separability and overall classification performance, particularly enhancing the F1-score for early-stage CKD. Additionally, we assessed the generalizability of NORA on the UCI CKD dataset, demonstrating its effectiveness for CKD risk stratification across distinct patient cohorts.
In modern collider experiments, the quest to explore fundamental interactions between elementary particles has reached unparalleled levels of precision. Signatures from particle physics detectors are low-level objects (such as energy depositions or tracks) encoding the physics of collisions (the final state particles of hard scattering interactions). The complete simulation of them in a detector is a computational and storage-intensive task. To address this computational bottleneck in particle physics, alternative approaches have been developed, introducing additional assumptions and trade off accuracy for speed.The field has seen a surge in interest in surrogate modeling the detector simulation, fueled by the advancements in deep generative models. These models aim to generate responses that are statistically identical to the observed data. In this paper, we conduct a comprehensive and exhaustive taxonomic review of the existing literature on the simulation of detector signatures from both methodological and application-wise perspectives. Initially, we formulate the problem of detector signature simulation and discuss its different variations that can be unified. Next, we classify
In recent years, there have been significant breakthroughs in the field of natural language processing, particularly with the development of large language models (LLMs). These LLMs have showcased remarkable capabilities on various benchmarks. In the healthcare field, the exact role LLMs and other future AI models will play remains unclear. There is a potential for these models in the future to be used as part of adaptive physician training, medical co-pilot applications, and digital patient interaction scenarios. The ability of AI models to participate in medical training and patient care will depend in part on their mastery of the knowledge content of specific medical fields. This study investigated the medical knowledge capability of LLMs, specifically in the context of internal medicine subspecialty multiple-choice test-taking ability. We compared the performance of several open-source LLMs (Koala 7B, Falcon 7B, Stable-Vicuna 13B, and Orca Mini 13B), to GPT-4 and Claude 2 on multiple-choice questions in the field of Nephrology. Nephrology was chosen as an example of a particularly conceptually complex subspecialty field within internal medicine. The study was conducted to evalu
AI coding agents increasingly accept assigned software tasks, modify repositories under bounded authority, and return work packages for review. Prior work proposed the software delegation contract, covering the task, authority, returned work package, and acceptance context, as the unit of analysis for delegated coding work, but did not measure its effects. This paper reports a controlled pilot study of explicit delegation contracts for coding agents. We built a dependency-free TypeScript API task environment with seeded defects and documentation gaps, authored ten tasks across five families, and ran 64 agent executions across two model tiers under three conditions: a realistic issue-style prompt, an explicit delegation contract, and a contract with a required evidence bundle. Each run was scored with hidden acceptance tests, mutation checks, and scope analysis, then reviewed by three independent condition-blinded model-based reviewers using a fixed rubric, for 192 reviews. Explicit contracts did not improve objective task outcomes: all 64 runs passed hidden acceptance checks, with zero scope violations. They did improve reviewability. Evidence sufficiency improved in 22 of 30 paire
This paper investigates sentiment classification of Steam game reviews using an attention-based Bidirectional Long Short-Term Memory (BiLSTM) model. Using a dataset of 50,000 reviews sampled from a larger Steam review corpus, the authors compare a traditional machine learning baseline based on TF-IDF and PyCaret AutoML with a deep learning approach implemented in PyTorch. The proposed BiLSTM+Attention model is trained with class-weighted cross-entropy to address class imbalance and achieves 83% accuracy and 85% weighted F1-score on the test set, with 90% recall for negative reviews. The paper also presents attention visualizations to show interpretability by highlighting sentiment-bearing words. The study concludes that the BiLSTM+Attention model is effective for analyzing user sentiment in Steam reviews and useful for helping developers understand player feedback.
Possible topological nature of Kondo and mixed valence insulators has been a recent topic of interest in condensed matter physics. Attention has focused on SmB6, which has long been known to exhibit low temperature transport anomaly, whose origin is of independent interest. We argue that it is possible to resolve the topological nature of surface states by uniquely accessing the surface electronic structure of the low temperature anomalous transport regime through combining state-of-the-art laser- and synchrotron-based angle-resolved photoemission spectroscopy (ARPES) with or without spin resolution. A combination of low temperature and ultra-high resolution (laser) which is lacking in previous ARPES studies of this compound is the key to resolve the possible existence of topological surface state in SmB6. Here we outline an experimental algorithm to systematically explore the topological versus trivial or mixed (topological and trivial surface state admixture as in the first 3D TI Bi$_{1-x}$Sb$_x$) nature of the surface states in Kondo and mixed valence insulators. We conclude based on this methodology that the observed topology of the surface Fermi surface in our low temperature
Upon mechanical loading, granular materials yield and undergo plastic deformation. The nature of plastic deformation is essential for the development of the macroscopic constitutive models and the understanding of shear band formation. However, we still do not fully understand the microscopic nature of plastic deformation in disordered granular materials. Here we used synchrotron X-ray tomography technique to track the structural evolutions of three-dimensional granular materials under shear. We establish that highly distorted coplanar tetrahedra are the structural defects responsible for microscopic plasticity in disordered granular packings. The elementary plastic events occur through flip events which correspond to a neighbor switching process among these coplanar tetrahedra (or equivalently as the rotation motion of 4-ring disclinations). These events are discrete in space and possess specific orientations with the principal stress direction.
Ultralight dark matter refers to the lightest potential dark matter candidates. We will focus on the mass range that has been studied using astrophysical and cosmological observations, corresponding to a mass $10^{-24} \, \mathrm{eV} \lesssim m \lesssim 10^{-18} \, \mathrm{eV}$. We will discuss the motivations for this mass range. The most studied model in this range corresponds to a minimally coupled, single, classical, spin-0 field comprising all dark matter. However, the work exploring extensions of this model (for example, higher spin, self-coupled, multiple field, and mixed models) will be one of the focuses of this review. The phenomenology associated with ultralight dark matter is rich and includes linear effects on the primordial power spectrum, core structures forming at the center of halos, nonlinear effects resulting in heating of stellar distributions, and non-relativistic effects relating to pulsar signals and black hole superradiance, to name a few. This set of effects has been studied using an equally extensive set of numerical tools. We will summarize the most common ones and discuss their applications and limitations. Ultralight dark matter also has a wide variety
Natural products, as metabolites from microorganisms, animals, or plants, exhibit diverse biological activities, making them crucial for drug discovery. Nowadays, existing deep learning methods for natural products research primarily rely on supervised learning approaches designed for specific downstream tasks. However, such one-model-for-a-task paradigm often lacks generalizability and leaves significant room for performance improvement. Additionally, existing molecular characterization methods are not well-suited for the unique tasks associated with natural products. To address these limitations, we have pre-trained a foundation model for natural products based on their unique properties. Our approach employs a novel pretraining strategy that is especially tailored to natural products. By incorporating contrastive learning and masked graph learning objectives, we emphasize evolutional information from molecular scaffolds while capturing side-chain information. Our framework achieves state-of-the-art (SOTA) results in various downstream tasks related to natural product mining and drug discovery. We first compare taxonomy classification with synthesized molecule-focused baselines t
Planets form and obtain their compositions from the leftover material present in protoplanetary disks of dust and gas surrounding young stars. The chemical make-up of a disk influences every aspect of planetary composition including their overall chemical properties, volatile content, atmospheric composition, and potential for habitability. This Review discusses our knowledge of the chemical and isotopic composition of Solar System materials and how this information can be used to place constraints on the formation pathways of terrestrial planets. We conclude that planetesimal formation by the streaming instability followed by rapid accretion of drifting pebbles within the protoplanetary disk lifetime reproduces most of the chemical and isotopic observables in Solar System. This finding has important implications for planetary habitability beyond the Solar System because in pebble accretion, volatiles important for life are accreted during the main growth phase of rocky planets as opposed to the late-stage. Finally, we explore how bulk chemical inventories and masses of planetary bodies control the composition of their primordial atmospheres and their potential to develop habitable
The Great Divide in metaphysical debates about laws of nature is between Humeans, who think that laws merely describe the distribution of matter, and non-Humeans, who think that laws govern it. The metaphysics can place demands on the proper formulations of physical theories. It is sometimes assumed that the governing view requires a fundamental / intrinsic direction of time: to govern, laws must be dynamical, producing later states of the world from earlier ones, in accord with the fundamental direction of time in the universe. In this paper, we propose a minimal primitivism about laws of nature (MinP) according to which there is no such requirement. On our view, laws govern by constraining the physical possibilities. Our view captures the essence of the governing view without taking on extraneous commitments about the direction of time or dynamic production. Moreover, as a version of primitivism, our view requires no reduction / analysis of laws in terms of universals, powers, or dispositions. Our view accommodates several potential candidates for fundamental laws, including the principle of least action, the Past Hypothesis, the Einstein equation of general relativity, and even
In a recent publication, we demonstrated electrical spin injection and detection in n-type silicon at temperatures up to 500K using ferromagnetic metal / SiO2 tunnel barrier contacts in a three-terminal geometry (Nature Commun. 2:245 doi:10.1038/ncomms125 (2011)). In comparing our measured spin-voltage signal with the value predicted by theory, we followed the analysis of Tran et al, (Phys. Rev. Lett. 102, 036601 (2009)), and inadvertently propagated an error found therein. As they note in a recent erratum (arXiv:0810.4770v2), the correct expression for the spin resistance area product from the theory for a sample with a spin diffusion length LSD much less than the contact width or channel thickness (our experimental situation) is given by the product {gamma}^2 {rho} LSD, where {gamma} is the tunneling spin polarization, and {rho} is the resistivity of the semiconductor transport channel. With this correction, our measured spin voltages are much larger than those predicted by theory, rather than in good agreement as we stated. We emphasize that the basic conclusions of our paper are the same - the systematic decrease in electron spin lifetime with increasing electron density demons
This paper reviews the work done on black hole interior volume, entropy, and evaporation. An insight into the basics for understanding the interior volume is presented. A general analogy to investigate the interior volume of a black hole, the associated quantum mode's entropy, and the evolution relation between the interior and exterior entropy is explained. Using this analogy, we predicted the future of information stored in a BH, its radiation, and evaporation. The results are noted in tables (\ref{tab:1}) and (\ref{tab:2}). To apply this analogy in BH space-time, we investigated the interior volume, entropy, and evaluation relation for different types of BHs. Finally, we also investigated the nature of BH radiation and the probability of particle emission during the evaporation process.
This paper describes a rapid feasibility study of using GPT-4, a large language model (LLM), to (semi)automate data extraction in systematic reviews. Despite the recent surge of interest in LLMs there is still a lack of understanding of how to design LLM-based automation tools and how to robustly evaluate their performance. During the 2023 Evidence Synthesis Hackathon we conducted two feasibility studies. Firstly, to automatically extract study characteristics from human clinical, animal, and social science domain studies. We used two studies from each category for prompt-development; and ten for evaluation. Secondly, we used the LLM to predict Participants, Interventions, Controls and Outcomes (PICOs) labelled within 100 abstracts in the EBM-NLP dataset. Overall, results indicated an accuracy of around 80%, with some variability between domains (82% for human clinical, 80% for animal, and 72% for studies of human social sciences). Causal inference methods and study design were the data extraction items with the most errors. In the PICO study, participants and intervention/control showed high accuracy (>80%), outcomes were more challenging. Evaluation was done manually; scoring
Massless Dirac electrons in condensed matter have attracted considerable attention. Unlike conventional electrons, Dirac electrons are described in the form of two-component wave functions. In the surface state of topological insulators, these two components are associated with the spin degrees of freedom, hence governing the magnetic properties. Therefore, the observation of the two-component wave function provides a useful clue for exploring the novel spin phenomena. Here we show that the two-component nature is manifested in the Landau levels (LLs) whose degeneracy is lifted by a Coulomb potential. Using spectroscopic-imaging scanning tunneling microscopy, we visualize energy and spatial structures of LLs in a topological insulator Bi2Se3. The observed potential-induced LL splitting and internal structures of Landau orbits are distinct from those in a conventional electron system and are well reproduced by a two-component model Dirac Hamiltonian. Our model further predicts non-trivial energy-dependent spin-magnetization textures in a potential variation. This provides a way to manipulate spins in the topological surface state.
Increasing demands on medical imaging departments are taking a toll on the radiologist's ability to deliver timely and accurate reports. Recent technological advances in artificial intelligence have demonstrated great potential for automatic radiology report generation (ARRG), sparking an explosion of research. This survey paper conducts a methodological review of contemporary ARRG approaches by way of (i) assessing datasets based on characteristics, such as availability, size, and adoption rate, (ii) examining deep learning training methods, such as contrastive learning and reinforcement learning, (iii) exploring state-of-the-art model architectures, including variations of CNN and transformer models, (iv) outlining techniques integrating clinical knowledge through multimodal inputs and knowledge graphs, and (v) scrutinising current model evaluation techniques, including commonly applied NLP metrics and qualitative clinical reviews. Furthermore, the quantitative results of the reviewed models are analysed, where the top performing models are examined to seek further insights. Finally, potential new directions are highlighted, with the adoption of additional datasets from other rad
Berry curvature physics and quantum geometric effects have been instrumental in advancing topological condensed matter physics in recent decades. Although Landau level-based flat bands and conventional 3D solids have been pivotal in exploring rich topological phenomena, they are constrained by their limited ability to undergo dynamic tuning. In stark contrast, moiré systems have risen as a versatile platform for engineering bands and manipulating the distribution of Berry curvature in momentum space. These moiré systems not only harbor tunable topological bands, modifiable through a plethora of parameters, but also provide unprecedented access to large length scales and low energy scales. Furthermore, they offer unique opportunities stemming from the symmetry-breaking mechanisms and electron correlations associated with the underlying flat bands that are beyond the reach of conventional crystalline solids. A diverse array of tools, encompassing quantum electron transport in both linear and non-linear response regimes and optical excitation techniques, provide direct avenues for investigating Berry physics. This review navigates the evolving landscape of tunable moiré materials, hig
We briefly review the various contexts within which one might address the issue of ``why'' the dimensionless constants of Nature have the particular values that they are observed to have. Both the general historical trend, in physics, of replacing a-priori-given, absolute structures by dynamical entities, and anthropic considerations, suggest that coupling ``constants'' have a dynamical nature. This hints at the existence of observable violations of the Equivalence Principle at some level, and motivates the need for improved tests of the Equivalence Principle.
Understanding how humans conceptualize and categorize natural objects offers critical insights into perception and cognition. With the advent of Large Language Models (LLMs), a key question arises: can these models develop human-like object representations from linguistic and multimodal data? In this study, we combined behavioral and neuroimaging analyses to explore the relationship between object concept representations in LLMs and human cognition. We collected 4.7 million triplet judgments from LLMs and Multimodal LLMs (MLLMs) to derive low-dimensional embeddings that capture the similarity structure of 1,854 natural objects. The resulting 66-dimensional embeddings were stable, predictive, and exhibited semantic clustering similar to human mental representations. Remarkably, the dimensions underlying these embeddings were interpretable, suggesting that LLMs and MLLMs develop human-like conceptual representations of objects. Further analysis showed strong alignment between model embeddings and neural activity patterns in brain regions such as EBA, PPA, RSC, and FFA. This provides compelling evidence that the object representations in LLMs, while not identical to human ones, share
In this study, we investigate how supporting serendipitous discovery and analysis of online product reviews can encourage readers to explore reviews more comprehensively prior to making purchase decisions. We propose two interventions -- Exploration Metrics that can help readers understand and track their exploration patterns through visual indicators and a Bias Mitigation Model that intends to maximize knowledge discovery by suggesting sentiment and semantically diverse reviews. We designed, developed, and evaluated a text analytics system called Serendyze, where we integrated these interventions. We asked 100 crowd workers to use Serendyze to make purchase decisions based on product reviews. Our evaluation suggests that exploration metrics enabled readers to efficiently cover more reviews in a balanced way, and suggestions from the bias mitigation model influenced readers to make confident data-driven decisions. We discuss the role of user agency and trust in text-level analysis systems and their applicability in domains beyond review exploration.