The study demonstrates the capabilities of a vector-based approach for calculating stoichiometric coefficients in chemical equations, using black powder as an illustrative example. A method is proposed for selecting and constraining intermediate interactions between reactants, as well as for identifying final products. It is shown that even a small number of components can lead to a large number of final and intermediate products. Through concrete calculations, a correlation is established between the number of possible chemical equations and the number of reactants. A methodology is proposed for computing all possible chemical equations within a reaction system for arbitrary component ratios, enabling the derivation of all feasible chemical reactions. Additionally, a method is developed for calculating the chemical composition for a fixed set of reactants, allowing for the evaluation of the set of products resulting from all possible chemical interactions given a specified initial composition.
The longest increasing subsequence (LIS) of a random walk has been studied mainly for zero-mean, symmetric step increments. We numerically investigate the LIS of biased Gaussian random walks, with unit-variance increments and positive drift $μ_{p} = Φ^{-1}(p)$, where $p = P(ξ>0)$. In contrast with the symmetric case, we find that for every fixed $p>1/2$ the mean LIS length grows linearly, $\langle L_{n}(p)\rangle \sim a(p)n$, with $a(p)$ increasing from $0$ at $p=1/2$ to $1$ as $p \to 1$. The record count is also linear, with coefficient $λ(p)$ fixed by Spitzer's formula for the ascending ladder epoch, and the LIS becomes increasingly aligned with this record skeleton as $p$ grows. At the symmetric point $p=1/2$, the record skeleton collapses to the Sparre Andersen $\sqrt{n}$ scale, while the LIS returns to the finite-variance $\sqrt{n}\log{n}$ regime. Near this limit the record rate has the closed-form small-drift slope $λ(μ_{p}) \simeq \sqrt{2}\,μ_{p}$, whereas the excess $a(μ_{p})-λ(μ_{p})$ vanishes more slowly than linearly in the drift, although our data do not resolve a single power law. The empirical distribution of $L_{n}$ also changes across this point, from lognorma
This paper proves a conditional structural uniqueness theorem for induced weight on robust record sectors within an admissible Hilbert record layer. Its theorem target and additive carrier differ from those of the standard Born-rule routes: additivity is not placed on the full projector lattice, but on disjoint admissible continuation bundles through an extensive bundle valuation, from which the sector-level additive law is inherited under admissible refinement. Accordingly, the result is not a Gleason-type representation theorem in different language, but a distinct uniqueness theorem about induced sector weight inherited from bundle additivity on admissible continuation structure. Under two explicit structural conditions, internal equivalence of admissible binary refinement profiles and sufficient admissible refinement richness, the quadratic assignment is the only non-negative refinement-stable induced weight on robust record sectors. In the main theorem, refinement richness is secured by admissible binary saturation. A supplementary proposition shows that dense admissible saturation already suffices if continuity of the profile function is added. Under normalization, the result
Achieving chemical accuracy for molecular simulations remains a central challenge in computational chemistry. Here, we present an embedded correlated wavefunction transfer learning (ECW-TL) framework for accurately simulating molecular dynamics in the condensed phase. ECW-TL incorporates high-level electron exchange and correlation effects in ECW theory while preserving training and computational efficiency of machine learned interatomic potentials. We demonstrate the framework on Ca2+-CO32- ion pairing in aqueous solution, a key process underlying CO2 mineralization in seawater. As proof of principle, we first show that finetuning a DFT-revPBE-D3(BJ) baseline model with embedded-DFT-SCAN data reproduces the DFT-SCAN free-energy surface within 1 kcal/mol across all solvation states. Extending the framework to embedded MP2 and localized natural-orbital CCSD(T) further refines the free-energy profile, revealing the crucial role of exact electron exchange and correlation in determining ion-pair stability and structure. ECW-TL thus provides a general, data-efficient route for transferring CW accuracy to large-scale simulations of complex aqueous and interfacial chemical processes.
Developing accurate models for chemical reactors is often challenging due to the complexity of reaction kinetics and process dynamics. Traditional approaches require retraining models for each new system, limiting generalizability and efficiency. In this work, we take a step toward foundation models for chemical reactor modeling by introducing a neural network framework that generalizes across diverse reactor types and rapidly adapts to new chemical processes. Our approach leverages meta-learning to pretrain the model on a broad set of reactor dynamics, enabling efficient adaptation to unseen reactions with minimal data. To further enhance generalizability, we incorporate physics-informed fine-tuning, ensuring physically consistent adaptation to new reactor conditions. Our framework is evaluated across three integer-order fundamental reactor types - continuous stirred tank reactors, batch reactors, and plug flow reactors - demonstrating superior few-shot adaptation compared to conventional data-driven, physics-informed, and transfer learning approaches. By combining meta-learning with physics-informed adaptation, this work lays the foundation for a generalizable modeling framework,
A physically motivated equation that determines the number of electrons of a molecule is proposed based on chemical common sense. It shows that all molecules are entangled in the number of electrons and results in the fundamental assumption of molecular energy convexity that underpins molecular quantum mechanics. The proposed physical principle includes the molecular size consistency principle as a special case. Application of wavefunction theory to the principle shows that an individual molecule with a noninteger number of electrons is locally physical albeit locally unreal. The energy of a molecule is piecewise linear with respect to its continuous number of electrons. The continuity of the number of electrons allows the definition of an electronic chemical potential of a single molecule. A state function equivalent to the energy of a molecule can be defined using the chemical potential as a variable. The aforementioned physical principle can alternatively be expressed as a simple additivity with the new state function. The latter also shows that the quantum entanglement in the number of electrons can be viewed as all molecules sharing the same chemical potential.
Optical detection of magnetic resonance enables spin-based quantum sensing with high spatial resolution and sensitivity-even at room temperature-as exemplified by solid-state defects. Molecular systems provide a complementary, chemically tunable, platform for room-temperature optically detected magnetic resonance (ODMR)-based quantum sensing. A critical parameter governing sensing sensitivity is the optical contrast-i.e., the difference in emission between two spin states. In state-of-the-art solid-state defects such as the nitrogen-vacancy center in diamond, this contrast is approximately 30%. Here, capitalizing on chemical tunability, we show that room-temperature ODMR contrasts of 40% can be achieved in molecules. Using a nitrogen-substituted analogue of pentacene (6,13-diazapentacene), we enhance contrast compared to pentacene and, by determining the triplet kinetics through time-dependent pulsed ODMR, show how this arises from accelerated anisotropic intersystem crossing. Furthermore, we translate high-contrast room-temperature pulsed ODMR to self-assembled nanocrystals. Overall, our findings highlight the synthetic handles available to optically readable molecular spins and t
[Abridged] This review paper discussed which chemical effects may be at play in a planet-forming disk midplane, which effects are relevant under different conditions, and which tools are available for modelling chemical kinetics in a disk midplane. The review goes on to discuss some important efforts in the planet formation modelling community to treat chemical evolution, and, vice versa, efforts in the chemical modelling community to implement more physical effects related to planet formation into the chemical modelling. The aim of this review is both to outline some concepts related to planet formation chemistry, but also to encourage, not just collaboration between the planet formation modelling community and the astrochemical community, but also assistance and guidance from one community to the other. Guidance, regarding which effects, out of many, might be more relevant than others under certain planet formation conditions, and regarding why certain included effects lead to certain important modelling outcomes. As the research fields of exoplanet atmospheres and protoplanetary disks near new frontiers in observational insights with upcoming facilities, developing appropriate m
Diffusion models revolutionize image generation by leveraging natural language to guide the creation of multimedia content. Despite significant advancements in such generative models, challenges persist in depicting detailed human-object interactions, especially regarding pose and object placement accuracy. We introduce a training-free method named Reasoning and Correcting Diffusion (ReCorD) to address these challenges. Our model couples Latent Diffusion Models with Visual Language Models to refine the generation process, ensuring precise depictions of HOIs. We propose an interaction-aware reasoning module to improve the interpretation of the interaction, along with an interaction correcting module to refine the output image for more precise HOI generation delicately. Through a meticulous process of pose selection and object positioning, ReCorD achieves superior fidelity in generated images while efficiently reducing computational requirements. We conduct comprehensive experiments on three benchmarks to demonstrate the significant progress in solving text-to-image generation tasks, showcasing ReCorD's ability to render complex interactions accurately by outperforming existing metho
In this work, we study an integrated fault detection and classification framework called FARM for fast, accurate, and robust online chemical process monitoring. The FARM framework integrates the latest advancements in statistical process control (SPC) for monitoring nonparametric and heterogeneous data streams with novel data analysis approaches based on Riemannian geometry together in a hierarchical framework for online process monitoring. We conduct a systematic evaluation of the FARM monitoring framework using the Tennessee Eastman Process (TEP) dataset. Results show that FARM performs competitively against state-of-the-art process monitoring algorithms by achieving a good balance among fault detection rate (FDR), fault detection speed (FDS), and false alarm rate (FAR). Specifically, FARM achieved an average FDR of 96.97% while also outperforming benchmark methods in successfully detecting hard-to-detect faults that are previously known, including Faults 3, 9 and 15, with FDRs being 97.08%, 96.30% and 95.99%, respectively. In terms of FAR, our FARM framework allows practitioners to customize their choice of FAR, thereby offering great flexibility. Moreover, we report a significa
We develop a framework for on-the-fly machine learned force field molecular dynamics simulations based on the multipole featurization scheme that overcomes the bottleneck with the number of chemical elements. Considering bulk systems with up to 6 elements, we demonstrate that the number of density functional theory calls remains approximately independent of the number of chemical elements, in contrast to the increase in the smooth overlap of atomic positions scheme.
Record linkage (de-duplication or entity resolution) is the process of merging noisy databases to remove duplicate entities. While record linkage removes duplicate entities from such databases, the downstream task is any inferential, predictive, or post-linkage task on the linked data. One goal of the downstream task is obtaining a larger reference data set, allowing one to perform more accurate statistical analyses. In addition, there is inherent record linkage uncertainty passed to the downstream task. Motivated by the above, we propose a generalized Bayesian record linkage method and consider multiple regression analysis as the downstream task. Records are linked via a random partition model, which allows for a wide class to be considered. In addition, we jointly model the record linkage and downstream task, which allows one to account for the record linkage uncertainty exactly. Moreover, one is able to generate a feedback propagation mechanism of the information from the proposed Bayesian record linkage model into the downstream task. This feedback effect is essential to eliminate potential biases that can jeopardize resulting downstream task. We apply our methodology to multip
Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis. A popular computational paradigm formulates synthesis prediction as a sequence-to-sequence translation problem, where the typical SMILES is adopted for molecule representations. However, the general-purpose SMILES neglects the characteristics of chemical reactions, where the molecular graph topology is largely unaltered from reactants to products, resulting in the suboptimal performance of SMILES if straightforwardly applied. In this article, we propose the root-aligned SMILES (R-SMILES), which specifies a tightly aligned one-to-one mapping between the product and the reactant SMILES for more efficient synthesis prediction. Due to the strict one-to-one mapping and reduced edit distance, the computational model is largely relieved from learning the complex syntax and dedicated to learning the chemical knowledge for reactions. We compare the proposed R-SMILES with various state-of-the-art baselines and show that it significantly outperforms them all, demonstrating the superiority of the proposed method.
Approximations based on moment-closure (MA) are commonly used to obtain estimates of the mean molecule numbers and of the variance of fluctuations in the number of molecules of chemical systems. The advantage of this approach is that it can be far less computationally expensive than exact stochastic simulations of the chemical master equation. Here we numerically study the conditions under which the MA equations yield results reflecting the true stochastic dynamics of the system. We show that for bistable and oscillatory chemical systems with deterministic initial conditions, the solution of the MA equations can be interpreted as a valid approximation to the true moments of the CME, only when the steady-state mean molecule numbers obtained from the chemical master equation fall within a certain finite range. The same validity criterion for monostable systems implies that the steady-state mean molecule numbers obtained from the chemical master equation must be above a certain threshold. For mean molecule numbers outside of this range of validity, the MA equations lead to either qualitatively wrong oscillatory dynamics or to unphysical predictions such as negative variances in the mo
Fermi operator expansion (FOE) methods are powerful alternatives to diagonalization type methods for solving Kohn-Sham density functional theory (KSDFT). One example is the pole expansion and selected inversion (PEXSI) method, which approximates the Fermi operator by rational matrix functions and reduces the computational complexity to at most quadratic scaling for solving KSDFT. Unlike diagonalization type methods, the chemical potential often cannot be directly read off from the result of a single step of evaluation of the Fermi operator. Hence multiple evaluations are needed to be sequentially performed to compute the chemical potential to ensure the correct number of electrons within a given tolerance. This hinders the performance of FOE methods in practice. In this paper we develop an efficient and robust strategy to determine the chemical potential in the context of the PEXSI method. The main idea of the new method is not to find the exact chemical potential at each self-consistent-field (SCF) iteration iteration, but to dynamically and rigorously update the upper and lower bounds for the true chemical potential, so that the chemical potential reaches its convergence along th
Large Language Models (LLMs) have substantially driven scientific progress in various domains, and many papers have demonstrated their ability to tackle complex problems with creative solutions. Our paper introduces a new foundation model, nach0, capable of solving various chemical and biological tasks: biomedical question answering, named entity recognition, molecular generation, molecular synthesis, attributes prediction, and others. nach0 is a multi-domain and multi-task encoder-decoder LLM pre-trained on unlabeled text from scientific literature, patents, and molecule strings to incorporate a range of chemical and linguistic knowledge. We employed instruction tuning, where specific task-related instructions are utilized to fine-tune nach0 for the final set of tasks. To train nach0 effectively, we leverage the NeMo framework, enabling efficient parallel optimization of both base and large model versions. Extensive experiments demonstrate that our model outperforms state-of-the-art baselines on single-domain and cross-domain tasks. Furthermore, it can generate high-quality outputs in molecular and textual formats, showcasing its effectiveness in multi-domain setups.
The blue bottle experiment was first introduced to the chemical education literature as a simple demonstration on kinetics. Its original formulation contains only glucose, NaOH and small amount of methylene blue. The solution turns blue when shaken and fades to colorless upon standing. This bluing/de-bluing cycle may be repeated and may be compared to blood colors in animal's respiratory cycle. Inspired by the blue bottle experiment, the colorful chemical bottle experiment kit was commercially developed in 2006. The kit is a versatile pedagogical tool, not only for physical chemistry but also for analytical, biological and organic chemistry. It also helps teaching concepts in scientific method and laboratory safety. This manuscript contains four parts, brief review on literature relating to the blue bottle experiment, description of the colorful chemical bottle experiment kit, pedagogical discussion of the experiments and preliminary evaluation from students.
Macroscopic entropy production $σ^{(tot)}$ in the general nonlinear isothermal chemical reaction system with mass action kinetics is decomposed into a free energy dissipation and a house-keeping heat: $σ^{(tot)}=σ^{(fd)}+σ^{(hk)}$; $σ^{(fd)}=-\rd A/\rd t$, where $A$ is a generalized free energy function. This yields a novel nonequilibrium free energy balance equation $\rd A/\rd t=-σ^{(tot)}+σ^{(hk)}$, which is on a par with celebrated entropy balance equation $\rd S/\rd t=σ^{(tot)}+η^{(ex)}$ where $η^{(ex)}$ is the rate of entropy exchange with the environment.For kinetic systems with complex balance, $σ^{(fd)}$ and $σ^{(hk)}$ are the macroscopic limits of stochastic free energy dissipation and house-keeping heat, which are both nonnegative, in the Delbrück-Gillespie description of the stochastic chemical kinetics.Therefore, we show that a full kinetic and thermodynamic theory of chemical reaction systems that transcends mesoscopic and macroscopic levels emerges.
Classical theories of chemical kinetics assume independent reactions in dilute solutions, whose rates are determined by mean concentrations. In condensed matter, strong interactions alter chemical activities and create inhomogeneities that can dramatically affect the reaction rate. The extreme case is that of a reaction coupled to a phase transformation, whose kinetics must depend on the order parameter -- and its gradients, at phase boundaries. This Account presents a general theory of chemical kinetics based on nonequilibrium thermodynamics. The reaction rate is a nonlinear function of the thermodynamic driving force (free energy of reaction) expressed in terms of variational chemical potentials. The Cahn-Hilliard and Allen-Cahn equations are unified and extended via a master equation for non-equilibrium chemical thermodynamics. For electrochemistry, both Marcus and Butler-Volmer kinetics are generalized for concentrated solutions and ionic solids. The theory is applied to intercalation dynamics in the phase separating Li-ion battery material Li$_x$FePO$_4$.
Researchers discovered that hydrogen radicals generated by intense UV light can break down stubborn PFAS “forever chemicals” without added chemicals。 The breakthrough reveals a key mechanism that could lead to greener and more effective technologies for permanently destroying these pollutants