共找到 20 条结果
Machine learning is rapidly making its pathway across all of the natural sciences, including physical sciences. The rate at which ML is impacting non-scientific disciplines is incomparable to that in the physical sciences. This is partly due to the uninterpretable nature of deep neural networks. Symbolic machine learning stands as an equal and complementary partner to numerical machine learning in speeding up scientific discovery in physics. This perspective discusses the main differences between the ML and scientific approaches. It stresses the need to develop and apply symbolic machine learning to physics problems equally, in parallel to numerical machine learning, because of the dual nature of physics research.
In recent decades, the relevance of polarimetry in planetary sciences and astronomy has increased rapidly. Polarization is a fundamental property of light and can be modified by any scattering event. As such, polarization yields additional information that cannot be obtained by only assessing light's scalar properties. For instance, the polarization state of starlight scattered by planetary surfaces can provide useful insights on the composition, size, morphology, and porosity of regolith particles and might even indicate the presence of life. Beside being useful for characterization, polarimetry can also greatly enhance the detection of exoplanets. Here, polarization can be harnessed to enhance the contrast between the bright light of a star, which can be considered to be fully unpolarized, and the very dim but polarized light reflected by an exoplanet. In this paper, we discuss and review the current developments and advances in optical polarimetry and polarimetric instrumentation in Switzerland within the framework of the National Centre of Competence in Research PlanetS. We focus on their implications for the vast range of science cases that polarimetry can address within the r
Symbolic regression (SR) has emerged as a powerful method for uncovering interpretable mathematical relationships from data, offering a novel route to both scientific discovery and efficient empirical modelling. This article introduces the Special Issue on Symbolic Regression for the Physical Sciences, motivated by the Royal Society discussion meeting held in April 2025. The contributions collected here span applications from automated equation discovery and emergent-phenomena modelling to the construction of compact emulators for computationally expensive simulations. The introductory review outlines the conceptual foundations of SR, contrasts it with conventional regression approaches, and surveys its main use cases in the physical sciences, including the derivation of effective theories, empirical functional forms and surrogate models. We summarise methodological considerations such as search-space design, operator selection, complexity control, feature selection, and integration with modern AI approaches. We also highlight ongoing challenges, including scalability, robustness to noise, overfitting and computational complexity. Finally we emphasise emerging directions, particula
Labeling or classifying time series is a persistent challenge in the physical sciences, where expert annotations are scarce, costly, and often inconsistent. Yet robust labeling is essential to enable machine learning models for understanding, prediction, and forecasting. We present the \textit{Clustering and Indexation Pipeline with Human Evaluation for Recognition} (CIPHER), a framework designed to accelerate large-scale labeling of complex time series in physics. CIPHER integrates \textit{indexable Symbolic Aggregate approXimation} (iSAX) for interpretable compression and indexing, density-based clustering (HDBSCAN) to group recurring phenomena, and a human-in-the-loop step for efficient expert validation. Representative samples are labeled by domain scientists, and these annotations are propagated across clusters to yield systematic, scalable classifications. We evaluate CIPHER on the task of classifying solar wind phenomena in OMNI data, a central challenge in space weather research, showing that the framework recovers meaningful phenomena such as coronal mass ejections and stream interaction regions. Beyond this case study, CIPHER highlights a general strategy for combining sy
This study presents the development of the PsyCogMetrics AI Lab (psycogmetrics.ai), an integrated, cloud-based platform that operationalizes psychometric and cognitive-science methodologies for Large Language Model (LLM) evaluation. Framed as a three-cycle Action Design Science study, the Relevance Cycle identifies key limitations in current evaluation methods and unfulfilled stakeholder needs. The Rigor Cycle draws on kernel theories such as Popperian falsifiability, Classical Test Theory, and Cognitive Load Theory to derive deductive design objectives. The Design Cycle operationalizes these objectives through nested Build-Intervene-Evaluate loops. The study contributes a novel IT artifact, a validated design for LLM evaluation, benefiting research at the intersection of AI, psychology, cognitive science, and the social and behavioral sciences.
Do different fields of knowledge require different research strategies? A numerical model exploring different virtual knowledge landscapes, revealed two diverging optimal search strategies. Trend following is maximized when the popularity of new discoveries determine the number of individuals researching it. This strategy works best when many researchers explore few large areas of knowledge. In contrast, individuals or small groups of researchers are better in discovering small bits of information in dispersed knowledge landscapes. Bibliometric data of scientific publications showed a continuous bipolar distribution of these strategies, ranging from natural sciences, with highly cited publications in journals containing a large number of articles, to the social sciences, with rarely cited publications in many journals containing a small number of articles. The natural sciences seem to adapt their research strategies to landscapes with large concentrated knowledge clusters, whereas social sciences seem to have adapted to search in landscapes with many small isolated knowledge clusters. Similar bipolar distributions were obtained when comparing levels of insularity estimated by indic
This study examines the role of top-tier conference publications in Hungarian computer science research. We show that the national scientometric practice, which is currently journal-oriented, diverges from international norms, creating incentive distortions in researcher evaluation. By linking multiple databases (iCore, DBLP, MTMT, MTA-ATT), we mapped Hungarian-affiliated CORE A* and A conference papers, their temporal and thematic distribution, and author trajectories. Our results indicate that, in theoretical fields, publishing at international conferences became common earlier than in applied fields. At the same time, in applied fields, successful researchers are more likely to continue their careers in foreign institutions or in industry positions. Overall, a substantial share of the already established, internationally most successful researchers are now affiliated with institutions abroad. We recommend recognizing CORE A* papers as equivalent to D1 and CORE A papers as equivalent to Q1 journals in national evaluation systems.
Word embeddings are an essential instrument in many NLP tasks. Most available resources are trained on general language from Web corpora or Wikipedia dumps. However, word embeddings for domain-specific language are rare, in particular for the social science domain. Therefore, in this work, we describe the creation and evaluation of word embedding models based on 37,604 open-access social science research papers. In the evaluation, we compare domain-specific and general language models for (i) language coverage, (ii) diversity, and (iii) semantic relationships. We found that the created domain-specific model, even with a relatively small vocabulary size, covers a large part of social science concepts, their neighborhoods are diverse in comparison to more general models. Across all relation types, we found a more extensive coverage of semantic relationships.
We define deriving semantic class targets as a novel multi-modal task. By doing so, we aim to improve classification schemes in the physical sciences which can be severely abstracted and obfuscating. We address this task for upcoming radio astronomy surveys and present the derived semantic radio galaxy morphology class targets.
Nonlinear science has evolved significantly over the 35 years since the launch of the journal Chaos. This Focus Issue, dedicated to the 80th Birthday of its founding editor-in-chief, David K. Campbell, brings together a selection of contributions on influential topics, many of which were advanced by Campbell's own research program and leadership role. The topics include new phenomena and method development in the realms of network dynamics, machine learning, quantum and material systems, chaos and fractals, localized states, and living systems, with a good balance of literature review, original contributions, and perspectives for future research.
The technology industry offers exciting and diverse career opportunities, ranging from traditional software development to emerging fields such as artificial intelligence, cybersecurity, and data science. Career fairs play a crucial role in helping Computer Science (CS) students understand the various career pathways available to them in the industry. However, limited research exists on how CS students experience and benefit from these events. Through a survey of 86 students, we investigate their motivations for attending, preparation strategies, and learning outcomes, including exposure to new career paths and technologies. We envision our findings providing valuable insights for career services professionals, educators, and industry leaders in improving the career development processes of CS students.
Dwarf spheroidal galaxies (dSphs) are excellent targets for indirect dark matter (DM) searches using gamma-ray telescopes because they are thought to have high DM content and a low astrophysical background. The sensitivity of these searches is improved by combining the observations of dSphs made by different gamma-ray telescopes. We present the results of a combined search by the most sensitive currently operating gamma-ray telescopes, namely: the satellite-borne Fermi-LAT telescope; the ground-based imaging atmospheric Cherenkov telescope arrays H.E.S.S., MAGIC, and VERITAS; and the HAWC water Cherenkov detector. Individual datasets were analyzed using a common statistical approach. Results were subsequently combined via a global joint likelihood analysis. We obtain constraints on the velocity-weighted cross section $\langle σ\mathit{v} \rangle$ for DM self-annihilation as a function of the DM particle mass. This five-instrument combination allows the derivation of up to 2-3 times more constraining upper limits on $\langle σ\mathit{v} \rangle$ than the individual results over a wide mass range spanning from 5 GeV to 100 TeV. Depending on the DM content modeling, the 95% confidence
This text provides with an introduction to the modern approach of artificiality and simulation in social sciences. It presents the relationship between complexity and artificiality, before introducing the field of artificial societies which greatly benefited from the computer power fast increase, gifting social sciences with formalization and experimentation tools previously owned by "hard" sciences alone. It shows that as "a new way of doing social sciences", artificial societies should undoubtedly contribute to a renewed approach in the study of sociality and should play a significant part in the elaboration of original theories of social phenomena.
Environment Agencies from Europe and the US are setting up a network of Linked Environment Data and are looking to crosslink it with Linked Data contributions from the life sciences.
This White Paper summarises potential key science topics to be achieved with Thai National Radio Telescope (TNRT). The commissioning phase has started in mid 2022. The key science topics consist of "Pulsars and Fast Radio Bursts (FRBs)", "Star Forming Regions (SFRs)", "Galaxy and Active Galactic Nuclei (AGNs)", "Evolved Stars", "Radio Emission of Chemically Peculiar (CP) Stars", and "Geodesy", covering a wide range of observing frequencies in L/C/X/Ku/K/Q/W-bands (1-115 GHz). As a single-dish instrument, TNRT is a perfect tool to explore time domain astronomy with its agile observing systems and flexible operation. Due to its ideal geographical location, TNRT will significantly enhance Very Long Baseline Interferometry (VLBI) arrays, such as East Asian VLBI Network (EAVN), Australia Long Baseline Array (LBA), European VLBI Network (EVN), in particular via providing a unique coverage of the sky resulting in a better complete "uv" coverage, improving synthesized-beam and imaging quality with reducing side-lobes. This document highlights key science topics achievable with TNRT in single-dish mode and in collaboration with VLBI arrays.
Social Network Analysis is a way of studying agents embedded in contexts. In about 1998, physicists discovered social networks as representations of complex systems. Small-world and scale-free networks are the paradigmatic models of this Network Science. Relying on various models and mechanisms of socio-cultural processes, an identity model is developed and calibrated in a case study of Social Network Science. This research domain results from the union of Social Network Analysis and Network Science. A unique dataset of 25,760 scholarly articles from one century of research (1916-2012) is created. Clustering this set of publications, five subdomains are detected and analyzed in terms of authorship, citation, and word usage structures and dynamics. The scaling hypothesis of percolation theory is formulated for socio-cultural systems, namely that power-law size distributions like Lotka's, Bradford's, and Zipf's Law mean that the described identity resides at the phase transition between the stability and change of meaning. In this case, it can be diagnosed using bivariate scaling laws and Abbott's heuristic of fractal distinctions. Identities are not dichotomies but dualities of soci
Research in the Life Sciences depends on the integration of large, distributed and heterogeneous data sources and web services. The discovery of which of these resources are the most appropriate to solve a given task is a complex research question, since there is a large amount of plausible candidates and there is little, mostly unstructured, metadata to be able to decide among them.We contribute a semi-automatic approach,based on semantic techniques, to assist researchers in the discovery of the most appropriate web services to full a set of given requirements.
Small bodies, the unaccreted leftovers of planetary formation, are often mistaken for the leftovers of planetary science in the sense that they are everything else after the planets and their satellites (or sometimes just their regular satellites) are accounted for. This mistaken view elides the great diversity of compositions, histories, and present-day conditions and processes found in the small bodies, and the interdisciplinary nature of their study. Understanding small bodies is critical to planetary science as a field, and we urge planetary scientists and our decision makers to continue to support science-based mission selections and to recognize that while small bodies have been grouped together for convenience, the diversity of these objects in terms of composition, mass, differentiation, evolution, activity, dynamical state, physical structure, thermal environment, thermal history, and formation vastly exceeds the observed variability in the major planets and their satellites. Treating them as a monolithic group with interchangeable members does a grave injustice to the range of fundamental questions they address. We advocate for a deep and ongoing program of missions, tele
GREX-PLUS (Galaxy Reionization EXplorer and PLanetary Universe Spectrometer) is a mission candidate for a JAXA's strategic L-class mission to be launched in the 2030s. Its primary sciences are two-fold: galaxy formation and evolution and planetary system formation and evolution. The GREX-PLUS spacecraft will carry a 1.2 m primary mirror aperture telescope cooled down to 50 K. The two science instruments will be onboard: a wide-field camera in the 2-8 $μ$m wavelength band and a high resolution spectrometer with a wavelength resolution of 30,000 in the 10-18 $μ$m band. The GREX-PLUS wide-field camera aims to detect the first generation of galaxies at redshift $z>15$. The GREX-PLUS high resolution spectrometer aims to identify the location of the water ``snow line'' in proto-planetary disks. Both instruments will provide unique data sets for a broad range of scientific topics including galaxy mass assembly, origin of supermassive blackholes, infrared background radiation, molecular spectroscopy in the interstellar medium, transit spectroscopy for exoplanet atmosphere, planetary atmosphere in the Solar system, and so on.
We present the results of processing the effects of the powerful Gamma Ray Burst GRB221009A captured by the charged particle detectors (electrostatic analyzers and solid-state detectors) onboard spacecraft at different points in the heliosphere on October 9, 2022. To follow the GRB221009A propagation through the heliosphere we used the electron and proton flux measurements from solar missions Solar Orbiter and STEREO-A; Earth magnetosphere and the solar wind missions THEMIS and Wind; meteorological satellites POES15, POES19, MetOp3; and MAVEN - a NASA mission orbiting Mars. GRB221009A had a structure of four bursts: less intense Pulse 1 - the triggering impulse - was detected by gamma-ray observatories at 131659 UT (near the Earth); the most intense Pulses 2 and 3 were detected on board all the spacecraft from the list, and Pulse 4 detected in more than 500 s after Pulse 1. Due to their different scientific objectives, the spacecraft, which data was used in this study, were separated by more than 1 AU (Solar Orbiter and MAVEN). This enabled tracking GRB221009A as it was propagating across the heliosphere. STEREO-A was the first to register Pulse 2 and 3 of the GRB, almost 100 secon