The conditions under which binary black hole (BBH) mergers embedded in active galactic nucleus (AGN) disks produce detectable optical counterparts remain poorly constrained observationally. We report multi-epoch optical imaging and spectroscopic follow-up of S240413p, an O4 BBH candidate with 98\% classification confidence, obtained with the T80-South telescope through the S-PLUS Transient Extension Program (STEP). Our observations cover the 99\% credible region across epochs that span $\sim$300 days post-merger. We prioritize AGN-hosted environments and identify two transient candidates, STEP2024gab/ZTF18acvgziq and STEP2024phe/ZTF19aaflhnr. SOAR/Goodman spectroscopy and archival DESI spectra yield host supermassive black hole masses of $\log M_\mathrm{SMBH}/\mathrm{M}_\odot = 7.15 \pm 0.05$ and $8.02 \pm 0.04$. We compute predicted flare delay distributions for each host using a thermal radiation-driven outflow emission model and the spectroscopically derived host properties. Migration traps produced by thermal torques occur at $R_\text{BH}/R_g \approx 10^{4.2}$ and $10^{3.4}$ for the two hosts, with predicted flare delays spanning tens to several hundred days; our late epoch at
This work addresses the well-known Maximum Independent Set problem in the context of hypergraphs. While this problem has been extensively studied on graphs, we focus on its strong extension to hypergraphs, where edges may connect any number of vertices. A set of vertices in a hypergraph is strongly independent if there is at most one vertex per edge in the set. One application for this problem is to find perfect minimal hash functions. We propose nine new data reduction rules specifically designed for this problem. Our reduction routine can serve as a preprocessing step for any solver. We analyze the impact on the size of the reduced instances and the performance of several subsequent solvers when combined with this preprocessing. Our results demonstrate a significant reduction in instance size and improvements in running time for subsequent solvers. The preprocessing routine reduces instances, on average, to 22% of their original size in 6.76 seconds. When combining our reduction preprocessing with the best-performing exact solver, we observe an average speedup of 3.84x over not using the reduction rules. In some cases, we can achieve speedups of up to 53x. Additionally, one more
Real-time ranking of optical transient candidates during gravitational-wave (GW) and multimessenger follow-up is challenging when only sparse early-time, multi-band photometry is available.We present \texttt{KilonovaSCORER}, an open-source framework for scoring and ranking in this regime. It quantifies the consistency of each candidate with a physically motivated kilonova model grid in absolute magnitude space using two complementary per-observation metrics, $P_{\mathrm{tail},\mathrm{KNe}}$ and $P_{\mathrm{near},\mathrm{KNe}}$. These are aggregated into a cumulative ranking score via inverse-variance weighting in logit space, naturally accounting for heterogeneous observational uncertainties across bands and epochs. A sequential Approximate Bayesian Computation (ABC) diagnostic tracks photometric consistency across epochs, penalizing candidates whose temporal evolution is incompatible with kilonova expectations. We validate the framework on AT\,2017gfo and SN\,2025ulz, and test it against supernova simulations under a realistic Rubin/LSST Target-of-Opportunity strategy. The framework recovers kilonova candidates with high confidence while ruling out supernova contaminants within fi
We present RoIt-XMASA, a multilingual dataset that extends the Cross-lingual Multi-domain Amazon Sentiment Analysis to Italian and Romanian, comprising 36,000 labeled reviews across three domains (books, movies, and music) and 202,141 unlabeled samples. To address cross-lingual and cross-domain challenges, we propose a multi-target adversarial training framework that employs loss reversal with meta-learned coefficients to dynamically balance sentiment discrimination with domain and language invariance. XLM-R achieves an F1-score of 66.23% with our approach, outperforming the baseline by 4.64%. Few-shot evaluation shows that Llama-3.1-8B achieves 58.43% F1-score, revealing a meaningful trade-off between the efficiency of prompting-based approaches and the higher performance of task-specific fine-tuning.
Gravitational wave sources with electromagnetic counterparts have highlighted the need for predictive, interpretable models linking the parameters of compact binary systems to post-merger remnants and mass outflows. In this work, we explore AI-driven symbolic regression (SR) frameworks to derive updated analytical relations for disk ejecta mass in binary neutron star mergers, trained on state-of-the-art numerical relativity simulations. Our method reveals a set of compact equations that outperform existing fitting formulae across multiple statistical metrics while remaining physically interpretable. Notably, SR also enables alternative predictor sets (e.g., $\{M_1,M_2,\tildeΛ\}$) that match or exceed the accuracy of models relying solely on compactness of the lightest neutron star ($C_1$), enabling new parameter constraints from electromagnetic observations. Unlike traditional black-box machine learning models, these closed-form expressions generalize robustly to regions of the parameter space not represented in the training data, offering a physics-informed tool for multimessenger observations and constraints on the neutron star equation of state.
Vision-language models (VLMs) have the potential to become co-pilots for pathologists. However, most VLMs either focus on small regions of interest within whole-slide images, provide only static slide-level outputs, or rely on data that is not publicly available, limiting reproducibility. Furthermore, training data containing WSIs paired with detailed clinical reports is scarce, restricting progress toward transparent and generalisable VLMs. We address these limitations with three main contributions. First, we introduce Polysome, a standardised tool for synthetic instruction generation. Second, we apply Polysome to the public HISTAI dataset, generating HISTAI-Instruct, a large whole-slide instruction tuning dataset spanning 24,259 slides and over 1.1 million instruction-response pairs. Finally, we use HISTAI-Instruct to train ANTONI-α, a VLM capable of visual-question answering (VQA). We show that ANTONI-α outperforms MedGemma on WSI-level VQA tasks of tissue identification, neoplasm detection, and differential diagnosis. We also compare the performance of multiple incarnations of ANTONI-α trained with different amounts of data. All methods, data, and code are publicly available.
Graph modification problems are computational tasks where the goal is to change an input graph $G$ using operations from a fixed set, in order to make the resulting graph satisfy a target property, which usually entails membership to a desired graph class $\mathcal{C}$. Some well-known examples of operations include vertex-deletion, edge-deletion, edge-addition and edge-contraction. In this paper we address an operation known as subgraph complement. Given a graph $G$ and a subset $S$ of its vertices, the subgraph complement $G \oplus S$ is the graph resulting of complementing the edge set of the subgraph induced by $S$ in $G$. We say that a graph $H$ is a subgraph complement of $G$ if there is an $S$ such that $H$ is isomorphic to $G \oplus S$. For a graph class $\mathcal{C}$, subgraph complementation to $\mathcal{C}$ is the problem of deciding, for a given graph $G$, whether $G$ has a subgraph complement in $\mathcal{C}$. This problem has been studied and its complexity has been settled for many classes $\mathcal{C}$ such as $\mathcal{H}$-free graphs, for various families $\mathcal{H}$, and for classes of bounded degeneracy. In this work, we focus on classes graphs of minimum/maxi
The majority of gravitational wave events detected by the LIGO, Virgo, and KAGRA Collaboration originate from binary black hole (BBH) mergers, for which no confirmed electromagnetic counterparts have been identified to date. However, if such mergers occur within the disk of an active galactic nucleus (AGN), they may generate observable optical flares induced by relativistic jet activity and shock-heated gas. We present results from a long-term optical follow-up of the gravitational wave event S231206cc, conducted with the T80-South telescope as part of the S-PLUS Transient Extension Program (STEP). Our search prioritized AGN-hosted environments by crossmatching the gravitational wave localization with known AGN catalogs. No candidate met the criteria for a viable optical counterpart. We explored three BBH merger scenarios predicting optical emission in AGN disks: (i) ram pressure stripping, (ii) long-term emission from an emerging jet cocoon, and (iii) jet breakout followed by shock cooling. Using our observational cadence and depth, we constrained the BBH parameter space, including the remnant's location within the AGN disk, kick velocity, and supermassive black hole (SMBH) mass.
Antonie (Anton) Pannekoek (1873-1960) is remembered as one of the initiators of the field of stellar atmospheres. A second part of his research concerned Galactic astronomy. He was convinced that the sidereal system was built up of clouds of stars in a smooth, low-density stratum. In addition there were dark clouds together with streaks with little or no extinction in between. Pannekoek looked at bright star clouds and estimated their distance from their contribution to star counts. He found values of tens of kpc, which would mean their distribution was similar in extent to that of Shapleys globular cluster system. Later he had to reduce his distance by a factor over two, and later still retract the method. He developed a rigorous method of estimating distances of dark clouds from modeling star counts off and on the cloud, preceding Wolf's quick and dirty method. He should have received more credit for this. He started isophotal maps of the northern and southern Milky Way, first from visual observations, later from photographic surface photometry using out-of-focus exposures. I compare Pannekoeks maps with detailed photographic surface photometry of the south by the group in Bochum
Large language models (LLMs) have demonstrated remarkable progress in leveraging diverse knowledge sources. This study investigates how nine widely used LLMs allocate knowledge between local context and global parameters when answering open-ended questions in knowledge-consistent scenarios. We introduce a novel dataset, WikiAtomic, and systematically vary context sizes to analyze how LLMs prioritize and utilize the provided information and their parametric knowledge in knowledge-consistent scenarios. Additionally, we also study their tendency to hallucinate under varying context sizes. Our findings reveal consistent patterns across models, including a consistent reliance on both contextual (around 70%) and parametric (around 30%) knowledge, and a decrease in hallucinations with increasing context. These insights highlight the importance of more effective context organization and developing models that use input more deterministically for robust performance.
We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development.
For a class $\mathcal{G}$ of graphs, the objective of \textsc{Subgraph Complementation to} $\mathcal{G}$ is to find whether there exists a subset $S$ of vertices of the input graph $G$ such that modifying $G$ by complementing the subgraph induced by $S$ results in a graph in $\mathcal{G}$. We obtain a polynomial-time algorithm for the problem when $\mathcal{G}$ is the class of graphs with minimum degree at least $k$, for a constant $k$, answering an open problem by Fomin et al. (Algorithmica, 2020). When $\mathcal{G}$ is the class of graphs without any induced copies of the star graph on $t+1$ vertices (for any constant $t\geq 3$) and diamond, we obtain a polynomial-time algorithm for the problem. This is in contrast with a result by Antony et al. (Algorithmica, 2022) that the problem is NP-complete and cannot be solved in subexponential-time (assuming the Exponential Time Hypothesis) when $\mathcal{G}$ is the class of graphs without any induced copies of the star graph on $t+1$ vertices, for every constant $t\geq 5$.
Kilonovae represent a category of astrophysical transients, identifiable as the electromagnetic observable counterparts associated with the coalescence events of binary systems comprising neutron stars and neutron star-black hole pairs. They act as probes for heavy-element nucleosynthesis in astrophysical environments. These studies rely on inference of the physical parameters (e.g., ejecta mass, velocity, composition) that describe kilonovae based on electromagnetic observations. This is a complex inverse problem typically addressed with sampling-based methods such as Markov-chain Monte Carlo (MCMC) or nested sampling algorithms. However, repeated inferences can be computationally expensive due to the sequential nature of these methods. This poses a significant challenge to ensuring the reliability and statistical validity of the posterior approximations and, thus, the inferred kilonova parameters themselves. We present a novel approach: Simulation-Based Inference (SBI) using simulations produced by KilonovaNet. Our method employs an ensemble of Amortized Neural Posterior Estimation (ANPE) with an embedding network to directly predict posterior distributions from simulated spectra
With the rapid adoption of AI in the form of large language models (LLMs), the potential value of carefully engineered prompts has become significant. However, to realize this potential, prompts should be tradable on an open market. Since prompts are, at present, generally economically non-excludable, by virtue of their nature as text, no general competitive market has yet been established. This note discusses two protocols intended to provide protection of prompts, elevating their status as intellectual property, thus confirming the intellectual property rights of prompt engineers, and potentially supporting the flourishing of an open market for LLM prompts.
In this study, we developed the first baseline readability model for the Cebuano language. Cebuano is the second most-used native language in the Philippines with about 27.5 million speakers. As the baseline, we extracted traditional or surface-based features, syllable patterns based from Cebuano's documented orthography, and neural embeddings from the multilingual BERT model. Results show that the use of the first two handcrafted linguistic features obtained the best performance trained on an optimized Random Forest model with approximately 87% across all metrics. The feature sets and algorithm used also is similar to previous results in readability assessment for the Filipino language showing potential of crosslingual application. To encourage more work for readability assessment in Philippine languages such as Cebuano, we open-sourced both code and data.
Continual learning and few-shot learning are important frontiers in progress toward broader Machine Learning (ML) capabilities. Recently, there has been intense interest in combining both. One of the first examples to do so was the Continual few-shot Learning (CFSL) framework of Antoniou et al. arXiv:2004.11967. In this study, we extend CFSL in two ways that capture a broader range of challenges, important for intelligent agent behaviour in real-world conditions. First, we increased the number of classes by an order of magnitude, making the results more comparable to standard continual learning experiments. Second, we introduced an 'instance test' which requires recognition of specific instances of classes -- a capability of animal cognition that is usually neglected in ML. For an initial exploration of ML model performance under these conditions, we selected representative baseline models from the original CFSL work and added a model variant with replay. As expected, learning more classes is more difficult than the original CFSL experiments, and interestingly, the way in which image instances and classes are presented affects classification performance. Surprisingly, accuracy in t
This is the second of two papers dedicated to the computation of the reduced C*-algebra of a connected, linear, real reductive group up to C*-algebraic Morita equivalence, and the verification of the Connes-Kasparov conjecture for these groups. These results were originally announced by Antony Wassermann in 1987. In Part I we presented the Morita equivalence and the Connes-Kasparov morphism. In this part we shall compute the morphism using David Vogan's description of the tempered dual.
For a set of integers $A$, we consider $R(A)=\{a/b: a, b\in A, b eq 0\}$. It is an open problem to study the denseness of $R(A)$ in the $p$-adic numbers when $A$ is the set of nonzero values attained by an integral form. This problem has been answered for quadratic forms. Very recently, Antony and Barman have studied this problem for the diagonal binary cubic forms $ax^3+by^3$, where $a$ and $b$ are integers. In this article, we study this problem for diagonal forms. We extend the results of Antony and Barman to the diagonal binary forms $ax^n+by^n$ for all $n\geq 3$. We also study $p$-adic denseness of quotients of nonzero values attained by diagonal forms of degree $n\geq 3$, where $\gcd(n,p(p-1))=1$.
This is the first of two papers dedicated to the computation of the reduced C*-algebra of a connected, linear, real reductive group up to Morita equivalence, and the verification of the Connes-Kasparov conjecture for these groups. These results were originally announced by Antony Wassermann in 1987. In Part I we shall give details of the C*-algebraic Morita equivalence, and then compute the Connes-Kasparov morphism subject to some results in tempered representation theory that we shall prove in Part II using tools from David Vogan's classification of the tempered dual.
The application of microscopy in biomedical research has come a long way since Antonie van Leeuwenhoek discovered unicellular organisms. Countless innovations have positioned light microscopy as a cornerstone of modern biology and a method of choice for connecting omics datasets to their biological and clinical correlates. Still, regardless of how convincing published imaging data looks, it does not always convey meaningful information about the conditions in which it was acquired, processed, and analyzed. Adequate record-keeping, reporting, and quality control are therefore essential to ensure experimental rigor and data fidelity, allow experiments to be reproducibly repeated, and promote the proper evaluation, interpretation, comparison, and re-use. To this end, microscopy images should be accompanied by complete descriptions detailing experimental procedures, biological samples, microscope hardware specifications, image acquisition parameters, and image analysis procedures, as well as metrics accounting for instrument performance and calibration. However, universal, community-accepted Microscopy Metadata standards and reporting specifications that would result in Findable Access