共找到 20 条结果
The discovery of new exoplanets makes us wonder where each new exoplanet stands along its way to develop life as we know it on Earth. Our Evo-SETI Theory is a mathematical way to face this problem. We describe cladistics and evolution by virtue of a few statistical equations based on lognormal probability density functions (pdf) in the time. We call b-lognormal a lognormal pdf starting at instant b (birth). Then, the lifetime of any living being becomes a suitable b-lognormal in the time. Next, our "Peak-Locus Theorem" translates cladistics: each species created by evolution is a b-lognormal whose peak lies on the exponentially growing number of living species. This exponential is the mean value of a stochastic process called "Geometric Brownian Motion" (GBM). Past mass extinctions were all-lows of this GBM. In addition, the Shannon Entropy (with a reversed sign) of each b-lognormal is the measure of how evolved that species is, and we call it EvoEntropy. The "molecular clock" is re-interpreted as the EvoEntropy straight line in the time whenever the mean value is exactly the GBM exponential. We were also able to extend the Peak-Locus Theorem to any mean value other than the expone
The Hubble tuning fork diagram, based on morphology and established in the 1930s, has always been the preferred scheme for classification of galaxies. However, the current large amount of data up to higher and higher redshifts asks for more sophisticated statistical approaches like multivariate analyses. Clustering analyses are still very confidential, and do not take into account the unavoidable characteristics in our Universe: evolution. Assuming branching evolution of galaxies as a 'transmission with modification', we have shown that the concepts and tools of phylogenetic systematics (cladistics) can be heuristically transposed to the case of galaxies. This approach that we call "astrocladistics", has now successfully been applied on several samples of galaxies and globular clusters. Maximum parsimony and distance-based approaches are the most popular methods to produce phylogenetic trees and, like most other studies, we had to discretize our variables. However, since astrophysical data are intrinsically continuous, we are contributing to the growing need for applying phylogenetic methods to continuous characters.
This series of papers is intended to evaluate astrocladistics in reconstructing phylogenies of galaxies. The objective of this second paper is to formalize the concept of galaxy formation and to identify the processes of diversification. We show that galaxy diversity can be expected to organize itself in a hierarchy. In order to better understand the role of mergers, we have selected a sample of 43 galaxies from the GALICS database built from simulations with a hybrid model for galaxy formation studies. These simulated galaxies, described by 119 characters and considered as representing still undefined classes, have experienced different numbers of merger events during evolution. Our cladistic analysis yields a robust tree that proves the existence of a hierarchy. Mergers, like interactions (not taken into account in the GALICS simulations), are probably a strong driver for galaxy diversification. Our result shows that mergers participate in a branching type of evolution, but do not seem to play the role of an evolutionary clock.
The Hubble tuning fork diagram, based on morphology and established in the 1930s, has always been the preferred scheme for classification of galaxies. However, the current large amount of multiwavelength data, most often spectra, for objects up to very high distances, asks for more sophisticated statistical approaches. Interpreting formation and evolution of galaxies as a ?transmission with modification' process, we have shown that the concepts and tools of phylogenetic systematics can be heuristically transposed to the case of galaxies. This approach, which we call ?astrocladistics', has successfully been applied on several samples. Many difficulties still remain, some of them being specific to the nature of both galaxies and their diversification processes, some others being classical in cladistics, like the pertinence of the descriptors in conveying any useful evolutionary information.
A self-contained description of algebraic structures, obtained by combinations of various limit procedures applied to vertex and face sl(2) elliptic quantum affine algebras, is given. New double Yangians structures of dynamical type are in particular defined. Connections between these structures are established. A number of them take the form of twist-like actions. These are conjectured to be evaluations of universal twists.
Writing systems are cultural replicators whose evolution has never been studied quantitatively at global scale. We compile the Global Script Database (GSD): 300 writing and notation systems, 50 binary structural characters, and 259 phylogenetic edges spanning 5,400 years. Applying four methods -- phenetics, cladistics, Bayesian inference, and neural network clustering -- we find that scripts exhibit a detectable molecular clock. The best-fitting model (Mk+Gamma strict clock) yields a substitution rate of q = 0.226 substitutions/character/millennium (95% CI: 0.034-1.22; Delta BIC = -4.1 versus relaxed clock; Delta BIC = -1,364.7 versus Mk without rate variation). Political interventions break this clock: deviation from expected divergence times correlates with intervention intensity (Spearman rho = 0.556, p < 10^{-4}), and per-character rate analysis reveals that intervention selectively rewrites deep structural features rather than merely accelerating change (rate profile correlation rho = 0.320). We identify 30 major script replacement events and rank their destructive impact. A ceiling effect suppresses independent invention wherever writing already exists (Fisher's exact OR =
In this paper we propose a new mathematical model capable of merging Darwinian Evolution, Human History and SETI into a single mathematical scheme: 1) Darwinian Evolution over the last 3.5 billion years is defined as one particular realization of a certain stochastic process called Geometric Brownian Motion (GBM). This GBM yields the fluctuations in time of the number of species living on Earth. Its mean value curve is an increasing exponential curve, i.e. the exponential growth of Evolution. 2) In 2008 this author provided the statistical generalization of the Drake equation yielding the number N of communicating ET civilizations in the Galaxy. N was shown to follow the lognormal probability distribution. 3) We call "b-lognormals" those lognormals starting at any positive time b ("birth") larger than zero. Then the exponential growth curve becomes the geometric locus of the peaks of a one-parameter family of b-lognormals: this is our way to re-define Cladistics. 4) b-lognormals may be also be interpreted as the lifespan of any living being (a cell, or an animal, a plant, a human, or even the historic lifetime of any civilization). Applying this new mathematical apparatus to Human
Computing and Internet access are substantially growing markets in Southern Africa, which brings with it increasing demands for local content and tools in indigenous African languages. Since most of those languages are low-resourced, efforts have gone into the notion of bootstrapping tools for one African language from another. This paper provides an overview of these efforts for Niger-Congo B (`Bantu') languages. Bootstrapping grammars for geographically distant languages has been shown to still have positive outcomes for morphology and rules or grammar-based natural language generation. Bootstrapping with data-driven approaches to NLP tasks is difficult to use meaningfully regardless geographic proximity, which is largely due to lexical diversity due to both orthography and vocabulary. Cladistic approaches in comparative linguistics may inform bootstrapping strategies and similarity measures might serve as proxy for bootstrapping potential as well, with both fertile ground for further research.
It is possible to borrow from a topic of biology called phylogenetic systematics, concepts and tools for a logical and objective classification of galaxies. It is based on observable properties of objects - characters - either qualitative (like morphology) or quantitative (like luminosity, mass or spectrum). Distance analysis can readily be performed using a method called phenetics and based on characters. But the most promising approach is cladistics. It makes use of characters that can exist in at least two states, one being ancestral and the other one derived. Objects are gathered depending on the derived states they share. We illustrate a first application of this method to astrophysics, that we name astrocladistics, with dwarf galaxies from the Local Group.
Phylogenetic approaches to classification have been heavily developed in biology by bioinformaticians. But these techniques have applications in other fields, in particular in linguistics. Their main characteristics is to search for relationships between the objects or species in study, instead of grouping them by similarity. They are thus rather well suited for any kind of evolutionary objects. For nearly fifteen years, astrocladistics has explored the use of Maximum Parsimony (or cladistics) for astronomical objects like galaxies or globular clusters. In this lesson we will learn how it works. 1 Why phylogenetic tools in astrophysics? 1.1 History of classification The need for classifying living organisms is very ancient, and the first classification system can be dated back to the Greeks. The goal was very practical since it was intended to distinguish between eatable and toxic aliments, or kind and dangerous animals. Simple resemblance was used and has been used for centuries. Basically, until the XVIIIth century, every naturalist chose his own criterion to build a classification. At the end, hundreds of classifications were available, most often incompatible to each other. The
In a series of recent papers and in a book, this author put forward a mathematical model capable of embracing the search for extra-terrestrial intelligence (SETI), Darwinian Evolution and Human History into a single, unified statistical picture, concisely called Evo-SETI. The relevant mathematical tools are: (1) Geometric Brownian motion (GBM), the stochastic process representing evolution as the stochastic increase of the number of species living on Earth over the last 3.5 billion years. This GBM is well known in the mathematics of finances (Black-Sholes models). (2) The probability distributions known as b-lognormals, i.e. lognormals starting at a certain positive instant b>0 rather than at the origin. In the framework of Darwinian Evolution, the resulting mathematical construction was shown to be what evolutionary biologists call Cladistics. (3) The (Shannon) entropy of such b-lognormals is then seen to represent the 'degree of progress' reached by each living organism or by each big set of living organisms, like historic human civilizations. (4) All these results also match with SETI in that the statistical Drake equation (generalization of the ordinary Drake equation to enc
Jupiter and Saturn each have complex systems of satellites and rings. These satellites can be classified into dynamical groups, implying similar formation scenarios. Recently, a larger number of additional irregular satellites have been discovered around both gas giants that have yet to be classified. The aim of this paper is to examine the relationships between the satellites and rings of the gas giants, using an analytical technique called cladistics. Cladistics is traditionally used to examine relationships between living organisms, the `tree of life'. In this work, we perform the first cladistical study of objects in a planetary science context. Our method uses the orbital, physical and compositional characteristics of satellites to classify the objects in the Jovian and Saturnian systems. We find that the major relationships between the satellites in the two systems, such as families, as presented in previous studies, are broadly preserved. In addition, based on our analysis of the Jovian system, we identify a new retrograde irregular family, the Iocaste family, and suggest that the Phoebe family of the Saturnian system can be further divided into two subfamilies. We also prop
As soon as their extragalactic origins were established, the hope to make Gamma - Ray Bursts (GRBs) standardizeable candles to probe the very high - z universe has opened the search for scaling relations between redshift independent observable quantities and distance dependent ones. Although some remarkable success has been achieved, the empirical correlations thus found are still affected by a significant intrinsic scatter which downgrades the precision in the inferred GRBs Hubble diagram. We investigate here whether this scatter may come from fitting together objects belonging to intrinsically different classes. To this end, we rely on a cladistics analysis to partition GRBs in homogenous families according to their rest frame properties. Although the poor statistics prevent us from drawing a definitive answer, we find that both the intrinsic scatter and the coefficients of the $E_{peak}$\,-\,$E_{iso}$ and $E_{peak}$\,-\,$L$ correlations significantly change depending on which subsample is fitted. It turns out that the fit to the full sample leads to a scaling relation which approximately follows the diagonal of the region delimited by the fits to each homogenous class. We theref
Context. Galaxy evolution and the effect of environment are most often studied using scaling relations or some regression analyses around some given property. These approaches however do not take into account the complexity of the physics of the galaxies and their diversification. Aims. We here investigate the effect of cluster environment on the evolution of galaxies through multivariate unsupervised classification and phylogenetic analyses applied to two relatively large samples from the WINGS survey, one of cluster members and one of field galaxies (2624 and 1476 objects respectively). Methods. These samples are the largest ones ever analysed with a phylogenetic approach in astrophysics. To be able to use the Maximum Parsimony (cladistics) method, we first performed a pre-clustering in 300 clusters with a hierarchical clustering technique, before applying it to these pre-clusters. All these computations used seven parameters: B-V, log(Re), nV , $μ$e , H$β$ , D4000 , log(M *). Results. We have obtained a tree for the combined samples and do not find different evolutionary paths for cluster and field galaxies. However, the cluster galaxies seem to have accelerated evolution in the
Context. The chemical tagging technique is a promising approach to reconstruct the history of the Galaxy by only using stellar chemical abundances. Different studies have undertaken this analysis and they raised several challenges. Aims. Using a sample of open clusters stars, we wish to address two issues: minimize chemical abundance differences which origin is linked to the evolutionary stage of the stars and not their original composition; evaluate a phylogenetic approach to group stars based on their chemical composition. Methods. We derived differential chemical abundances for 207 stars (belonging to 34 open clusters) using the Sun as reference star (classical approach) and a dwarf plus a giant star from the open cluster M67 as reference (new approach). These abundances were then used to perform two phylogenetic analyses, cladistics (Maximum Parsimony) and Neighbour-Joining, together with a partitioning unsupervised classification analysis with k-means. The resulting groupings were finally confronted to the true open cluster memberships of the stars. Results. We successfully reconstruct most of the original open clusters when carefully selecting a subset of the abundances deriv
Multivariate clustering in astrophysics is a recent development justified by the bigger and bigger surveys of the sky. The phylogenetic approach is probably the most unexpected technique that has appeared for the unsupervised classification of galaxies, stellar populations or globular clusters. On one side, this is a somewhat natural way of classifying astrophysical entities which are all evolving objects. On the other side, several conceptual and practical difficulties arize, such as the hierarchical representation of the astrophysical diversity, the continuous nature of the parameters, and the adequation of the result to the usual practice for the physical interpretation. Most of these have now been solved through the studies of limited samples of stellar clusters and galaxies. Up to now, only the Maximum Parsimony (cladistics) has been used since it is the simplest and most general phylogenetic technique. Probabilistic and network approaches are obvious extensions that should be explored in the future.
Phylogenetic approaches are finding more and more applications outside the field of biology. Astrophysics is no exception since an overwhelming amount of multivariate data has appeared in the last twenty years or so. In particular, the diversification of galaxies throughout the evolution of the Universe quite naturally invokes phylogenetic approaches. We have demonstrated that Maximum Parsimony brings useful astrophysical results, and we now proceed toward the analyses of large datasets for galaxies. In this talk I present how we solve the major difficulties for this goal: the choice of the parameters, their discretization, and the analysis of a high number of objects with an unsupervised NP-hard classification technique like cladistics. 1. Introduction How do the galaxy form, and when? How did the galaxy evolve and transform themselves to create the diversity we observe? What are the progenitors to present-day galaxies? To answer these big questions, observations throughout the Universe and the physical modelisation are obvious tools. But between these, there is a key process, without which it would be impossible to extract some digestible information from the complexity of these
Galaxy diversification proceeds by transforming events like accretion, interaction or mergers. These explain the formation and evolution of galaxies that can now be described with many observables. Multivariate analyses are the obvious tools to tackle the datasets and understand the differences between different kinds of objects. However, depending on the method used, redundancies, incompatibilities or subjective choices of the parameters can void the usefulness of such analyses. The behaviour of the available parameters should be analysed before an objective reduction of dimensionality and subsequent clustering analyses can be undertaken, especially in an evolutionary context. We study a sample of 424 early-type galaxies described by 25 parameters, ten of which are Lick indices, to identify the most structuring parameters and determine an evolutionary classification of these objects. Four independent statistical methods are used to investigate the discriminant properties of the observables and the partitioning of the 424 galaxies: Principal Component Analysis, K-means cluster analysis, Minimum Contradiction Analysis and Cladistics. (abridged)
In the past ten years this author published some 15 highly mathematical papers about his new Evo-SETI (Evolution and SETI) Theory. He proved that key features of Evo-SETI are: 1) The Statistical Drake Equation is the extension of the classical Drake equation into Statistics. Probability distributions of the number of ET civilizations in the Galaxy are given, and so is the probable distribution of the distance of ETs from us. 2) Darwinian Evolution is re-defined as a Geometric Brownian Motion (GBM) in the number of living species on Earth over the last 3.5 billion years. Its mean value grows exponentially in time and Mass Extinctions of the past are accounted for as unpredictable low GBM values. 3) The exponential growth of the number of species during Evolution is the geometric locus of the peaks of a one-parameter family of lognormal distributions constrained between the time axis and the exponential mean value. This accounts for cladistics. 4) The lifespan of a living being, let it be a cell, an animal, a human, a historic human society, or even an ET society, is mathematically described as a finite b-lognormal. This author then described mathematically the historical development
The connection between multifrequency quasar observational and physical parameters related to accretion processes is still open to debate. In the last 20 year, Eigenvector 1-based approaches developed since the early papers by Boroson and Green (1992) and Sulentic et al. (2000b) have been proven to be a remarkably powerful tool to investigate this issue, and have led to the definition of a quasar "main sequence". In this paper we perform a cladistic analysis on two samples of 215 and 85 low-z quasars (z 0.7) which were studied in several previous works and which offer a satisfactory coverage of the Eigenvector 1-derived main sequence. The data encompass accurate measurements of observational parameters which represent key aspects associated with the structural diversity of quasars. Cladistics is able to group sources radiating at higher Eddington ratios, as well as to separate radio-quiet (RQ) and radio-loud (RL) quasars. The analysis suggests a black hole mass threshold for powerful radio emission and also properly distinguishes core-dominated and lobe-dominated quasars, in accordance with the basic tenet of RL unification schemes. Considering that black hole mass provides a sort