The Atacama Large Millimeter/submillimeter Array and the James Webb Space Telescope are transforming our understanding of galaxy formation and evolution in the early Universe. By combining their capabilities, these observatories provide unprecedented insights into the gas, dust, and stars of high-redshift galaxies at spatially resolved scales, unveiling the complexities of their interstellar medium, kinematics, morphology, active galactic nuclei, and star formation activity. This review summarizes recent breakthroughs in the study of galaxies during the first billion years of cosmic history, highlighting key discoveries, open questions, and current limitations. We discuss how observations, theoretical models, and simulations are shaping our understanding of early galaxy evolution and identify promising directions for future research. While significant progress can be achieved through optimized use of existing facilities and collaborative efforts, further advances will require enhanced angular resolution and sensitivity, motivating upgrades to current instruments and the development of next-generation observatories.
Limit order books can transition rapidly from stable to stressed conditions, yet standard early-warning signals such as order flow imbalance and short-term volatility are inherently reactive. We formalise this limitation via a three-regime causal data-generating process (stable $\to$ latent build-up $\to$ stress) in which a latent deterioration phase creates a prediction window prior to observable stress. Under mild assumptions on temporal drift and regime persistence, we establish identifiability of the latent build-up regime and derive guarantees for strictly positive expected lead-time and non-trivial probability of early detection. We propose a trigger-based detector combining MAX aggregation of complementary signal channels, a rising-edge condition, and adaptive thresholding. Across 200 simulations, the method achieves mean lead-time $+18.6 \pm 3.2$ timesteps with perfect precision and moderate coverage, outperforming classical change-point and microstructure baselines. A preliminary application to one week of BTC/USDT order book data shows consistent positive lead-times while baselines remain reactive. Results degrade in low signal-to-noise and short build-up regimes, consist
State-of-the-art automated machine learning systems for tabular data often employ cross-validation; ensuring that measured performances generalize to unseen data, or that subsequent ensembling does not overfit. However, using k-fold cross-validation instead of holdout validation drastically increases the computational cost of validating a single configuration. While ensuring better generalization and, by extension, better performance, the additional cost is often prohibitive for effective model selection within a time budget. We aim to make model selection with cross-validation more effective. Therefore, we study early stopping the process of cross-validation during model selection. We investigate the impact of early stopping on random search for two algorithms, MLP and random forest, across 36 classification datasets. We further analyze the impact of the number of folds by considering 3-, 5-, and 10-folds. In addition, we investigate the impact of early stopping with Bayesian optimization instead of random search and also repeated cross-validation. Our exploratory study shows that even a simple-to-understand and easy-to-implement method consistently allows model selection to conve
Supporting decision-making has long been a central vision in the field of spatio-temporal intelligence. While prior work has improved the timeliness and accuracy of spatio-temporal forecasting, converting these forecasts into actionable strategies remains a key challenge. A main limitation is the decoupling of the prediction and the downstream decision phases, which can significantly degrade the downstream efficiency. For example, in emergency response, the priority is successful resource allocation and intervention, not just incident prediction. To this end, it is essential to propose an Adaptive Spatio-Temporal Early Decision model (ASTER) that reforms the forecasting paradigm from event anticipation to actionable decision support. This framework ensures that information is directly used for decision-making, thereby maximizing overall effectiveness. Specifically, ASTER introduces a new Resource-aware Spatio-Temporal interaction module (RaST) that adaptively captures long- and short-term dependencies under dynamic resource conditions, producing context-aware spatiotemporal representations. To directly generate actionable decisions, we further design a Preference-oriented decision
AI-native software development is often evaluated at the level of individual models, prompts, or generated artifacts. This framing is insufficient for production environments where software must be continuously produced, verified, deployed, maintained, and adapted across many operational contexts and long time horizons. We present a meta-engineering harness: a software-production architecture that transforms operational and product feature requirements into explicit contracts, routes work through role-specialized AI agents, performs independent and adversarial verification, and continuously improves itself through structured failure classification and outer-loop calibration. The harness is designed for settings in which software delivery is not a one-time project but an ongoing operating function. In our motivating application, CTO-as-a-service for small service firms, the system manages websites, booking flows, payment systems, backoffice workflow automations, and AI-agent interfaces as continuously evolving technical infrastructure rather than one-off deliverables. We describe the layered architecture, including two-pass contract compilation, persistent markdown memory with speci
The Nancy Grace Roman Space Telescope (Roman), NASA's next flagship observatory, has significant mission time to be spent on surveys for general astrophysics in addition to its three core community surveys. We considered what types of observations outside the core surveys would most benefit from early definition, given 700 hours of mission time in the first two years of Roman's operation. We recommend that a survey of the Galactic plane be defined early, based on the broad range of stakeholders for such a survey, the added scientific value of a first pass to obtain a baseline for proper motions complementary to Gaia's, and the significant potential synergies with ground-based surveys, notably the Legacy Survey of Space and Time (LSST) on Rubin. We also found strong motivation to follow a community definition process for ultra-deep observations with Roman.
Early detection of Alzheimer's Disease (AD) and its prodromal state, Mild Cognitive Impairment (MCI), is crucial for providing suitable treatment and preventing the disease from progressing. It can also aid researchers and clinicians to identify early biomarkers and minister new treatments that have been a subject of extensive research. The application of deep learning techniques on structural Magnetic Resonance Imaging (MRI) has shown promising results in diagnosing the disease. In this research, we intend to introduce a novel approach of using an ensemble of the self-attention-based Bottleneck Transformers with a sharpness aware minimizer for early detection of Alzheimer's Disease. The proposed approach has been tested on the widely accepted ADNI dataset and evaluated using accuracy, precision, recall, F1 score, and ROC-AUC score as the performance metrics.
In line with the Astro2020 Decadal Report State of the Profession findings and the NASA core value of Inclusion, the NASA Science Mission Directorate (SMD) Bridge Program was created to provide financial and programmatic support to efforts that work to increase the representation and inclusion of students from under-represented minorities in the STEM fields. To ensure an effective program, particularly for those who are often left out of these conversations, the NASA SMD Bridge Program Workshop was developed as a way to gather feedback from a diverse group of people about their unique needs and interests. The Early Career Perspectives Working Group was tasked with examining the current state of bridge programs, academia in general, and its effect on students and early career professionals. The working group, comprised of 10 early career and student members, analyzed the discussions and responses from workshop breakout sessions and two surveys, as well as their own experiences, to develop specific recommendations and metrics for implementing a successful and supportive bridge program. In this white paper, we will discuss the key themes that arose through our work, and highlight sele
The tomato is one of the most important fruits on earth. It plays an important and useful role in the agricultural production of any country. This research propose a novel smart technique for early detection of late blight diseases in tomatoes. This work improve the dataset with an increase in images from the field (the Plant Village dataset) and proposed a hybrid algorithm composed of support vector machines (SVM) and histogram-oriented gradients (HOG) for real-time detection of late blight tomato disease. To propose a HOG-based SVM model for early detection of late blight tomato leaf disease. To check the performance of the proposed model in terms of MSE, accuracy, precision, and recall as compared to Decision Tree and KNN. The integration of advanced technology in agriculture has the potential to revolutionize the industry, making it more efficient, sustainable, and profitable. This research work on the early detection of tomato diseases contributes to the growing importance of smart farming, the need for climate-smart agriculture, the rising need to more efficiently utilize natural resources, and the demand for higher crop yields. The proposed hybrid algorithm of SVM and HOG ha
Large language models have recently attracted significant attention due to their impressive performance on a variety of tasks. ChatGPT developed by OpenAI is one such implementation of a large, pre-trained language model that has gained immense popularity among early adopters, where certain users go to the extent of characterizing it as a disruptive technology in many domains. Understanding such early adopters' sentiments is important because it can provide insights into the potential success or failure of the technology, as well as its strengths and weaknesses. In this paper, we conduct a mixed-method study using 10,732 tweets from early ChatGPT users. We first use topic modelling to identify the main topics and then perform an in-depth qualitative sentiment analysis of each topic. Our results show that the majority of the early adopters have expressed overwhelmingly positive sentiments related to topics such as Disruptions to software development, Entertainment and exercising creativity. Only a limited percentage of users expressed concerns about issues such as the potential for misuse of ChatGPT, especially regarding topics such as Impact on educational aspects. We discuss these
The Big Bang singularity in standard model cosmology suggests a program of study in 'early universe' quantum gravity phenomenology. Inflation is usually thought to undermine this program's prospects by means of a dynamical diluting argument, but such a view has recently been disputed within inflationary cosmology, in the form of a 'trans-Planckian censorship' conjecture. Meanwhile, trans-Planckian censorship has been used outside of inflationary cosmology to motivate alternative early universe scenarios that are tightly linked to ongoing theorizing in quantum gravity. Against the resulting trend toward early universe quantum gravity phenomenology within and without inflation, Ijjas and Steindhardt suggest a further alternative: a 'generalized cosmic censorship' principle. I contrast the generalized cosmic censorship principle with the logic of its namesake, the cosmic censorship conjectures. I also remark on foundational concerns in the effective field theory approach to cosmology beyond the standard model, which would be based on that principle.
We analyze $Chandra$ observations of the hot atmospheres of 40 early spiral and elliptical galaxies. Using new temperature, density, cooling time, and mass profiles, we explore relationships between their hot atmospheres and cold molecular gas. Molecular gas mass correlates with atmospheric gas mass and density over four decades from central galaxies in clusters to normal giant ellipticals and early spirals. The mass and density relations follow power laws: $M_{\rm mol} \propto M_{\rm X}^{1.4\pm0.1}$ and $M_{\rm mol} \propto n_{\rm e}^{1.8\pm0.3}$, respectively, at 10 kpc. The ratio of molecular gas to atmospheric gas within a 10 kpc radius lies between $3\%$ and $10\%$ for early-type galaxies and between $3\%$ and $50\%$ for central galaxies in clusters. Early-type galaxies have detectable levels of molecular gas when their atmospheric cooling times falls below $\sim \rm Gyr$ at a radius of 10 kpc. A similar trend is found in central cluster galaxies. We find no relationship between the ratio of the cooling time to free fall time, $t_{\rm c}/t_{\rm ff}$, and the presence or absence of molecular clouds in early-type galaxies. The data are consistent with much of the molecular gas i
Gravitational wave science is a dynamical, fast-expanding research field founded on results, tools and methodologies drawn from different research areas and communities. Early career scientists entering this field must learn and combine knowledge and techniques from a range of disciplines. The Workshop on Gravitational-Wave Astrophysics for Early Career Scientists (GWAECS), held virtually in May 2021, planted the seeds of an interdisciplinary, well-connected and all-inclusive community of early career scientists working on gravitational waves, able to exchange relevant information and ideas, build a healthy professional and international environment, share and learn valuable skills, and ensure that ongoing research efforts are perpetuated and expanded in order to attain the main scientific goals envisioned by the whole community. GWAECS was the first event unifying early career scientists belonging to different communities, historically associated with different large-scale gravitational wave experiments. It provided a broad perspective on the future of gravitational waves, offered training on soft and transferable skills and allowed ample time for informal discussions between earl
With the rise of the Internet, there is a growing need to build intelligent systems that are capable of efficiently dealing with early risk detection (ERD) problems on social media, such as early depression detection, early rumor detection or identification of sexual predators. These systems, nowadays mostly based on machine learning techniques, must be able to deal with data streams since users provide their data over time. In addition, these systems must be able to decide when the processed data is sufficient to actually classify users. Moreover, since ERD tasks involve risky decisions by which people's lives could be affected, such systems must also be able to justify their decisions. However, most standard and state-of-the-art supervised machine learning models are not well suited to deal with this scenario. This is due to the fact that they either act as black boxes or do not support incremental classification/learning. In this paper we introduce SS3, a novel supervised learning model for text classification that naturally supports these aspects. SS3 was designed to be used as a general framework to deal with ERD problems. We evaluated our model on the CLEF's eRisk2017 pilot t
Rapid impact assessment in the immediate aftermath of a natural disaster is essential to provide adequate information to international organisations, local authorities, and first responders. Social media can support emergency response with evidence-based content posted by citizens and organisations during ongoing events. In the paper, we propose TriggerCit: an early flood alerting tool with a multilanguage approach focused on timeliness and geolocation. The paper focuses on assessing the reliability of the approach as a triggering system, comparing it with alternative sources for alerts, and evaluating the quality and amount of complementary information gathered. Geolocated visual evidence extracted from Twitter by TriggerCit was analysed in two case studies on floods in Thailand and Nepal in 2021.
We discuss the properties of the HI in low-luminosity early-type galaxies. The morphology of the HI is more regular than that of the HI in many more-luminous early-type galaxies. The HI is always distributed in a disk and is more centrally concentrated. The central HI surface densities are higher than in luminous early-type galaxies and are high enough for star formation to occur.
A recently introduced classifier, called SS3, has shown to be well suited to deal with early risk detection (ERD) problems on text streams. It obtained state-of-the-art performance on early depression and anorexia detection on Reddit in the CLEF's eRisk open tasks. SS3 was created to deal with ERD problems naturally since: it supports incremental training and classification over text streams, and it can visually explain its rationale. However, SS3 processes the input using a bag-of-word model lacking the ability to recognize important word sequences. This aspect could negatively affect the classification performance and also reduces the descriptiveness of visual explanations. In the standard document classification field, it is very common to use word n-grams to try to overcome some of these limitations. Unfortunately, when working with text streams, using n-grams is not trivial since the system must learn and recognize which n-grams are important "on the fly". This paper introduces t-SS3, an extension of SS3 that allows it to recognize useful patterns over text streams dynamically. We evaluated our model in the eRisk 2017 and 2018 tasks on early depression and anorexia detection.
Reducing atmospheres have recently emerged as a promising scenario to warm the surface of early Mars enough to drive the formation of valley networks and other ancient aqueous features that have been detected so far on the surface of Mars. Here we present a series of experiments and calculations to better constrain CO2+CH4 and CO2+H2 collision-induced absorptions (CIAs) as well as their effect on the prediction of early Mars surface temperature. First, we carried out a new set of experimental measurements (using the AILES line of the SOLEIL synchrotron) of both CO2+CH4 and CO2+H2 CIAs. These measurements confirm the previous results of Turbet et al. 2019, Icarus vol. 321, while significantly reducing the experimental uncertainties. Secondly, we fitted a semi-empirical model to these CIAs measurements, allowing us to compute the CO2+CH4 and CO2+H2 CIAs across a broad spectral domain (0-1500cm-1) and for a wide range of temperatures (100-600K). Last, we performed 1-D numerical radiative-convective climate calculations (using the LMD Generic Model) to compute the surface temperature expected on the surface of early Mars for several CO2, CH4 and H2 atmospheric contents, taking into acc
Studying quantum field theories through geometric principles has revealed deep connections between physics and mathematics, including the discovery by Cachazo, Early, Guevara and Mizera (CEGM) of a generalization of biadjoint scalar amplitudes. However, extending this to generalizations of other quantum field theories remains a central challenge. Recently it has been discovered that the nonlinear sigma model (NLSM) emerges after a certain zero-preserving deformation from $\text{tr}(φ^3)$. In this work, we find a much richer story of zero-preserving deformations in the CEGM context, yielding generalized NLSM amplitudes. We prove an explicit formula for the residual embedding of an $n$-point NLSM amplitude in a mixed $n+2$ point generalized NLSM amplitude, which provides a strong consistency check on our generalization. We show that the dimension of the space of pure kinematic deformations is $\gcd(k,n)-1$, we introduce a deformation-compatible modification of the Global Schwinger Parameterization, and we include a new proof, using methods from matroidal blade arrangements, of the linear independence for the set of planar kinematic invariants for CEGM amplitudes. Our framework is com
Scaffolds are the one-dimensional skeleta of high-dimensional flag simplicial complexes of nonpositive curvature. They generalize the phylogenetic trees of Trop G(2,n) to arbitrary $k$, drawing together SL(k)-web bases, affine buildings, the combinatorics of the positive tropical Grassmannian and low-dimensional topology. We prove that scaffolds model points in all tropical Grassmannians via a $k$-point distance function. In this paper, we study in detail CAT(0) planar graphs, which are positive scaffolds for the tropical Grassmannian of three-planes. CAT(0) planar graphs are directed versions of the diskoids of Fontaine-Kamnitzer-Kuperberg, planar dual to SL(3)-webs. Our main result is the construction of a unique representation of any given integer positive tropical Plucker vector by a normal CAT(0) planar graph. We show that any normal CAT(0) planar graph embeds into the tropical linear space as a Lam-Postnikov membrane, and embeds into the Keel-Tevelev membrane within the affine building. We show that Early's planar basis expansion can be computed directly from the strand combinatorics of the dual web, and connect this expansion to Petersen-Pylyavskyy-Speyer's noncrossing table