Track Circuits (TC) are the main signalling devices used to detect the presence of a train on a rail track. It has been used since the 19th century and nowadays there are many types depending on the technology. As a general classification, Track Circuits can be divided into 2 main groups, DC (Direct Current) and AC (Alternating Current) circuits. This work is focused on a particular AC track circuit, called "Smart Train Detection System" (STDS), designed with both high and low-frequency bands. This approach uses STDS current data applied to an SVM (support vector machine) classifier as a type of failure identifier. The main purpose of this work consists on determine automatically which is the component of the track that is failing to improve the maintenance action. Model was trained to classify 15 different failures that belong to 3 more general categories. The method was tested with field data from 10 different track circuits and validated by the STDS track circuit expert and maintainers. All use cases were correctly classified by the method.
Traditional Shot Boundary Detection (SBD) inherently struggles with complex transitions by formulating the task around isolated cut points, frequently yielding corrupted video shots. We address this fundamental limitation by formalizing the Shot Transition Detection (STD) task. Rather than searching for ambiguous points, STD explicitly detects the continuous temporal segments of transitions. To tackle this, we propose TransVLM, a Vision-Language Model (VLM) framework for STD. Unlike regular VLMs that predominantly rely on spatial semantics and struggle with fine-grained inter-shot dynamics, our method explicitly injects optical flow as a critical motion prior at the input stage. Through a simple yet effective feature-fusion strategy, TransVLM directly processes concatenated color and motion representations, significantly enhancing its temporal awareness without incurring any additional visual token overhead on the language backbone. To overcome the severe class imbalance in public data, we design a scalable data engine to synthesize diverse transition videos for robust training, alongside a comprehensive benchmark for STD. Extensive experiments demonstrate that TransVLM achieves su
Using consteval from C++23, we implement efficient, new versions of std::map and std::unordered_map for use when the keys are known at compile time. We demonstrate superior performance of our unordered_map on three demonstration use-cases: Lookup of elemental mass from atomic symbol, lookup of amino acid from codon, and modification of stock prices from S&P 500 ticker symbols all produced runtimes <40%, <35%, <73% of the respective runtimes of the std implementations. Our library runimes were <80%, <45%, <97% of the lookup time of Frozen, an alternative perfect hashing implementation in C++ for problems also using constexpr keys. To our knowledge, this makes our library the overall fastest drop-in (i.e., with a similar API) alternative to std::unordered_map. On one arbitrarily chosen demo, we demonstrate runtimes <35% of PTHash and <89% gperf, state-of-the-art but not drop-in hashing libraries via external tools.
Learning discrete speech representations that preserve similarity across variable-length utterances is central to query-by-example spoken term detection (QbE-STD). While wav2tok introduced CTC-based sequence alignment to enforce token consistency, its tightly coupled clustering and alignment training recipe limits scalability. We propose wav2tok 2.0, a scalable alignment-aware speech tokenizer built on the BEST-STD backbone. wav2tok 2.0 employs staged training, first learning discriminative, speaker-invariant representations via contrastive learning and vector quantization, and then enforcing pairwise token consistency using a CTC alignment loss and a novel DTW-aligned framewise prediction objective with adaptive weighting. Experiments show that wav2tok 2.0 consistently outperforms BEST-STD and general-purpose tokenizers on QbE-STD while remaining efficient and scalable.
Multi-dimensional entangled photon states represent an important resource in quantum communication networks. Specifically, hyperentangled states presenting simultaneous entanglement in several degrees of freedom (DoF), stand out for their noise resilience and information capacity. In this work, we demonstrate the generation of hyperentangled photon pairs in the time and frequency-bin domain by spontaneous four-wave mixing from the coherent driving of two integrated Silicon microresonators. We demonstrate entanglement in each DoF by proving the violation of the Clauser Horne Shimony Holt (CHSH) inequality by more than 27 standard deviations (STDs) in each reduced space. Genuine hyperentanglement is then assessed from the negativity of an hyperentanglement witness, which is verified by more than 60 STDs. These results mark, to the best of our knowledge, the first demonstration of time-frequency bin hyperentanglement in an integrated silicon photonic device.
Sexually transmitted diseases (STDs) are a group of pathogens infecting new hosts through sexual interactions. Due to its social and economic burden, multiple models have been proposed to study the spreading of pathogens. In parallel, in the ever-evolving landscape of digital social interactions, the pervasive utilization of dating apps has become a prominent facet of modern society. Despite the surge in popularity and the profound impact on relationship formation, a crucial gap in the literature persists regarding the potential ramifications of dating apps usage on the dynamics of STDs. In this paper, we address this gap by presenting a novel mathematical framework - an extended Susceptible-Infected-Susceptible (SIS) epidemiological model to elucidate the intricate interplay between dating apps engagement and the propagation of STDs. Namely, as dating apps are designed to make users revisit them and have mainly casual sexual interactions with other users, they increase the number of causal partners, which increases the overall spread of STDS. Using extensive simulation, based on real-world data, explore the effect of dating apps adoption and control on the STD spread. We show that
We present an exact analytical equation for the Shapiro time delay (STD) due to a spherical non-rotating body. As a result, accurate values of the STD in comparison with first and second-order expressions for Schwarzschild spacetime (1Sch and 2Sch) and first-order post-Newtonian formalism (1PN) are achieved. Accordingly, the lowest STD discrepancies between our exact equation and these approximations lie within the picosecond and sub-picosecond level for light beams affected by the Sun's gravity. Our results might be useful for time delay measurements in the solar system or extragalactic binary pulsar systems, where a high accuracy level is required.
We show that any symplectic filling of the standard contact submanifold $(\mathbb{S}^{2n-1},ξ_{\mathrm{std}})$ of $(\mathbb{S}^{2n+1},ξ_{\mathrm{std}})$ in $(\mathbb{D}^{n+1},ω_{\mathrm{std}})$ is smoothly unknotted if $n\ge 2$. We also give a self-contained proof of the Siefring intersection formula between punctured holomorphic curves and holomorphic hypersurfaces used in the proof using the $L$-simple setup of Bao-Honda.
Fast and accurate spoken content retrieval is vital for applications such as voice search. Query-by-Example Spoken Term Detection (STD) involves retrieving matching segments from an audio database given a spoken query. Token-based STD systems, which use discrete speech representations, enable efficient search but struggle with robustness to noise and reverberation, and with inefficient token utilization. We address these challenges by proposing a noise and reverberation-augmented training strategy to improve tokenizer robustness. In addition, we introduce optimal transport-based regularization to ensure balanced token usage and enhance token efficiency. To further speed up retrieval, we adopt a TF-IDF-based search mechanism. Empirical evaluations demonstrate that the proposed method outperforms STD baselines across various distortion levels while maintaining high search efficiency.
The Giry monad on the category of measurable spaces restricts to the full subcategory of standard Borel spaces, $\mathbf{Std}$, which we show is amenable to analysis. $\mathbf{Std}$ contains the space $\mathbb{R}_{\infty}$ which is the one-point compactification of the real numbers. By viewing probability measures $P \in \mathcal{G}(A)$ as functionals operating on measurable functions $A \rightarrow \mathbb{R}_{\infty}$, and taking the restriction of those functionals to operate on affine measurable functions we show that $A \cong Hom_{\mathbb{R}_{\infty}^{\mathbb{R}_{\infty}}}(\mathbb{R}_{\infty}^A|,\mathbb{R}_{\infty})$ for all object $A$ lying in the subcategory $\mathbf{Std}_{Cvx}$ of $\mathbf{Std}$. The objects of $\mathbf{Std}_{Cvx}$ are standard spaces with a convex space structure which satisfies the generic ``fullness property''. The morphisms of the category $\mathbf{Std}_{Cvx}$ are affine measurable functions. The isomorphism is equivalent to the statement that the full subcategory of $\mathbf{Std}_{Cvx}$ consisting of the single object $\mathbb{R}_{\infty}$ is codense in $\mathbf{Std}_{Cvx}$ which allows us to easily construct the $\mathcal{G}$-algebras of objects in $\
std::string view is a reference-like data structure in the C++ Standard Template Library (STL) that enables fast and cheap processing of read-only strings. Due to its wide applicability and performance enhancing power, std::string view has been very popular since its introduction in the C++17 standard. However, its careless use can lead to serious memory management bugs. As the lifetime of a std::string view is not tied to the lifetime of the referenced string in any way, it is the user's responsibility to ensure that the view is only used while the viewed string is live and its buffer is not reallocated. This paper describes a static analysis tool that finds programming errors caused by the incorrect use of std::string view. Our work included modeling std::string view operations in the analysis, defining steps to detect lifetime errors, constructing user-friendly diagnostic messages, and performing an evaluation of the checker.
Spoken term detection (STD) is often hindered by reliance on frame-level features and the computationally intensive DTW-based template matching, limiting its practicality. To address these challenges, we propose a novel approach that encodes speech into discrete, speaker-agnostic semantic tokens. This facilitates fast retrieval using text-based search algorithms and effectively handles out-of-vocabulary terms. Our approach focuses on generating consistent token sequences across varying utterances of the same term. We also propose a bidirectional state space modeling within the Mamba encoder, trained in a self-supervised learning framework, to learn contextual frame-level features that are further encoded into discrete tokens. Our analysis shows that our speech tokens exhibit greater speaker invariance than those from existing tokenizers, making them more suitable for STD tasks. Empirical evaluation on LibriSpeech and TIMIT databases indicates that our method outperforms existing STD baselines while being more efficient.
Spatial-temporal forecasting and imputation are important for real-world intelligent systems. Most existing methods are tailored for individual forecasting or imputation tasks but are not designed for both. Additionally, they are less effective for zero-shot and few-shot learning. While pre-trained language model (PLM) have exhibited strong pattern recognition and reasoning abilities across various tasks, including few-shot and zero-shot learning, their applications in spatial-temporal data understanding has been constrained by insufficient modeling of complex correlations such as the temporal correlations, spatial connectivity, non-pairwise and high-order spatial-temporal correlations within data. In this paper, we propose STD-PLM for understanding both spatial and temporal properties of \underline{S}patial-\underline{T}emporal \underline{D}ata with \underline{PLM}, which is capable of implementing both spatial-temporal forecasting and imputation tasks. STD-PLM understands spatial-temporal correlations via explicitly designed spatial and temporal tokenizers. Topology-aware node embeddings are designed for PLM to comprehend and exploit the topology structure of data in inductive ma
Std $Q$-target is a conservative, actor-critic, ensemble, $Q$-learning-based algorithm, which is based on a single key $Q$-formula: $Q$-networks standard deviation, which is an "uncertainty penalty", and, serves as a minimalistic solution to the problem of overestimation bias. We implement SQT on top of TD3/TD7 code and test it against the state-of-the-art (SOTA) actor-critic algorithms, DDPG, TD3 and TD7 on seven popular MuJoCo and Bullet tasks. Our results demonstrate SQT's $Q$-target formula superiority over TD3's $Q$-target formula as a conservative solution to overestimation bias in RL, while SQT shows a clear performance advantage on a wide margin over DDPG, TD3, and TD7 on all tasks.
Spintronic diodes (STDs) are emerging as a technology for the realization of high-performance microwave detectors. The key advantages of such devices are their high sensitivity, capability to work at low input power, and compactness. In this work, we show a possible use of STDs for neuromorphic computing expanding the realm of their functionalities to implement analog multiplication, which is a key operation in convolutional neural networks (CNN). In particular, we introduce the concept of degree of rectification (DOR) in injection-locked STDs. Micromagnetic simulations are used to design and identify the working range of the STDs for the implementation of the DOR. Previous experimental data confirm the applicability of the proposed solution, which is tested in image processing and in a CNN that recognizes handwritten digits.
The integration of the global Photovoltaic (PV) market with real time data-loggers has enabled large scale PV data analytical pipelines for power forecasting and long-term reliability assessment of PV fleets. Nevertheless, the performance of PV data analysis heavily depends on the quality of PV timeseries data. This paper proposes a novel Spatio-Temporal Denoising Graph Autoencoder (STD-GAE) framework to impute missing PV Power Data. STD-GAE exploits temporal correlation, spatial coherence, and value dependencies from domain knowledge to recover missing data. Experimental results show that STD-GAE can achieve a gain of 43.14% in imputation accuracy and remains less sensitive to missing rate, different seasons, and missing scenarios, compared with state-of-the-art data imputation methods such as MIDA and LRTC-TNN.
Spatiotemporal forecasting techniques are significant for various domains such as transportation, energy, and weather. Accurate prediction of spatiotemporal series remains challenging due to the complex spatiotemporal heterogeneity. In particular, current end-to-end models are limited by input length and thus often fall into spatiotemporal mirage, i.e., similar input time series followed by dissimilar future values and vice versa. To address these problems, we propose a novel self-supervised pre-training framework Spatial-Temporal-Decoupled Masked Pre-training (STD-MAE) that employs two decoupled masked autoencoders to reconstruct spatiotemporal series along the spatial and temporal dimensions. Rich-context representations learned through such reconstruction could be seamlessly integrated by downstream predictors with arbitrary architectures to augment their performances. A series of quantitative and qualitative evaluations on six widely used benchmarks (PEMS03, PEMS04, PEMS07, PEMS08, METR-LA, and PEMS-BAY) are conducted to validate the state-of-the-art performance of STD-MAE. Codes are available at https://github.com/Jimmy-7664/STD-MAE.
The purpose of the present study is to search one-dimensional Cellular Automata (CA) rules which will solve the density classification task (DCT) perfectly. The mathematical analysis of number conserving functions over binary strings of length n gives an indication of its corresponding number conserving cellular automata rules (either uniform or non-uniform). The state transition diagrams (STDs) of number conserving CA rules have been analyzed where it has been found that these STDs can generate different DCT solutions. While studying the properties of STDs, an interesting classification of binary strings could be made where equal weight strings form a class and the cardinality of each class is same as the binomial coefficient nCk; n is the length and k is the weight of the binary string. Apart from STDs, other deterministic methods have been proposed to obtain the exact solution of DCT. All these exact solutions of DCT using different deterministic methods can be viewed as an improvement over the soft computing techniques used earlier to obtain approximate solutions.
This paper presents a phenomenological study on the angle between the Standard and the Winner-Take-All (WTA) jet axes ($ΔR_{\rm axis}^{\rm WTA-Std}$) in high-energy nuclear collisions. The $p$+$p$ baseline is provided by the Pythia8 event generator. The in-medium jet propagation is simulated by the linear Boltzmann transport (LBT) model, which considers both the elastic and inelastic jet-medium interactions. Our theoretical results calculated by the LBT model show that the $ΔR_{\rm axis}^{\rm WTA-Std}$ distribution in Pb+Pb at $\sqrt{s}=5.02$ TeV is narrower than that in $p$+$p$, which agrees well with the recent ALICE measurements. The narrowing of $ΔR_{\rm axis}^{\rm WTA-Std}$ seems to violate the $p_T$-broadening nature of the jet quenching effect, usually explained by the influence of "selection bias". However, the physical details still need to be fully understood. Utilizing a matching-jet method to track the jet evolution in the QGP to remove the selection bias in the Monte Carlo simulations, we observe that the $ΔR_{\rm axis}^{\rm WTA-Std}$ distribution becomes broader due to the jet-medium interactions. At the same time, by rescaling the quark/gluon-jet fractions in Pb+Pb c
We prove that the examples by Smith and McMullen-Taubes provide infinitely many counterexamples to one direction of Donaldson's 4-6 question and the closely related Stabilising Conjecture. These are the first known counterexamples. In the other direction, we show that the Gromov-Witten invariants of two simply-connected closed symplectic $4$-manifolds, whose products with $(S^2,ω_{\text{std}})$ are deformation equivalent, agree. In particular, when $b_2^+ \geq 2$, these $4$-manifolds have the same Seiberg-Witten invariants. Furthermore, one can replace $(S^2,ω_{\text{std}})$ by $(S^2,ω_{\text{std}})^k$ for any $k \geq 1$ in both results.