The basic and effective reproduction numbers are widely used metrics for characterizing the dynamics of infectious disease epidemics. However, the interpretation of these numbers is based on the assumption of homogeneous mixing and may not hold in real-world populations where the contact patterns deviate from that assumption. In this paper, we present a network-based framework to compare reproduction numbers in populations with and without spatial structure, while other parameters of the disease remain fixed. Using this framework, we show that in homogeneously mixed populations, in the absence of external interventions, the effective reproduction number decreases exponentially as the susceptible population declines. In contrast, in spatially structured populations, the basic reproduction number is smaller, and the effective reproduction number initially decreases faster but eventually converges to unity. We show that the reproduction number is determined by the level of competition between infectious nodes, which is governed by the network structure. Our results suggest that without knowledge of the network structure, reproduction numbers may not be informative for parameterizing t
Demographic forecasting remains a fundamental challenge for policy planning in rapidly evolving nations such as India, where fertility transitions, policy interventions, and age structured dynamics interact in complex ways. In this study, we present a hybrid modelling framework that integrates policy-aware fertility functions into a Physics-Informed Neural Network (PINN) enhanced with Long Short-Term Memory (LSTM) networks to capture physical constraints and temporal dependencies in population dynamics. The model is applied to India's age structured population from 2024 to 2054 under three fertility-policy scenarios: continuation of current fertility decline, stricter population control, and relaxed fertility promotion. The governing transport-reaction partial differential equation is formulated with India-specific demographic indicators, including age-specific fertility and mortality rates. PINNs embed the core population equation and policy-driven fertility changes, while LSTM layers improve long-term forecasting across decades. Results show that fertility policies substantially shape future age distribution, dependency ratios, and workforce size. Stricter controls intensify agei
This paper introduces a new factor contributing to the decline in marriage and fertility: the growth of leisure technology. Over recent decades, high-income countries have experienced two notable shifts in household and family dynamics. First, there has been a significant decline in marriage rates and fertility. Second, time has increasingly been allocated to leisure activities. This paper presents a unified model of marriage and fertility, incorporating intra-household bargaining dynamics. The model, calibrated using data from Japan between 2019 and 2023, is employed to assess the impact of leisure technology growth on marriage and fertility during 2005-2009. The findings highlight that leisure technology growth makes single life relatively more appealing compared to marriage and parenthood. The model explains 21.1% of the decline in marriage and 73.1% of the decrease in fertility.
The accelerating shift toward low and ultra-low fertility has intensified the debate over whether countries now undergoing rapid decline are approaching stabilization or entering a more persistent low-fertility regime. Existing projection systems answer that question differently because they embed different assumptions about recovery and about the role of external drivers. To provide an empirical benchmark in this debate, we introduce NeuralTFR, an endogenous global forecasting framework based on a recurrent neural network. Drawing on a harmonized panel of historical fertility series from 196 countries and territories, the model pools cross-country information to learn demographic momentum and generate empirical prediction intervals via multi-quantile regression. Evaluated on a held-out period (2009--2023), NeuralTFR achieves lower point-forecast errors than a Naive Drift baseline and BayesTFR, the United Nations' Bayesian Hierarchical Model, while maintaining competitive uncertainty calibration. In forward projections to 2040, NeuralTFR points to broader exposure to low and very low fertility than BayesTFR, suggesting weaker support for near-term stabilization while still falling
Estimating time-varying reproduction numbers from epidemic incidence data is a central task in infectious disease surveillance, yet it poses an inherently ill-posed inverse problem. Existing approaches often rely on strong structural assumptions derived from epidemiological models, which can limit their ability to adapt to non-stationary transmission dynamics induced by interventions or behavioral changes, leading to delayed detection of regime shifts and degraded estimation accuracy. In this work, we propose a Conditional Inverse Reproduction Learning framework (CIRL) that addresses the inverse problem by learning a {conditional mapping} from historical incidence patterns and explicit time information to latent reproduction numbers. Rather than imposing strongly enforced parametric constraints, CIRL softly integrates epidemiological structure with flexible likelihood-based statistical modeling, using the renewal equation as a forward operator to enforce dynamical consistency. The resulting framework combines epidemiologically grounded constraints with data-driven temporal representations, producing reproduction number estimates that are robust to observation noise while remaining
A novel fertility model based on Thom's nonlinear differential equations of morphogenesis is presented, utilizing a three-dimensional catastrophe surface to capture the interaction between latent non-catastrophic fertility factors and catastrophic shocks. The model incorporates key socioeconomic and environmental variables and is applicable at macro-, meso-, and micro-demographic levels, addressing global fertility declines, regional population disparities, and micro-level phenomena such as teenage pregnancies. This approach enables a comprehensive analysis of reproductive health at aggregate, sub-national, and age-group-specific levels. An agent-based model for teenage pregnancy is described to illustrate how latent factors -- such as education, contraceptive use, and parental guidance -- interact with catastrophic shocks like socioeconomic deprivation, violence, and substance abuse. The bifurcation set analysis shows how minor shifts in socioeconomic conditions can lead to significant changes in fertility rates, revealing critical points in fertility transitions. By integrating Thom's morphogenesis equations with traditional fertility theory, this paper proposes a groundbreaking
This paper investigates the conditions under which the Easterlin hypothesis holds within a neoclassical overlapping generations model with endogenous capital accumulation, wages, interest rates, and fertility. We develop a tractable analytical framework that maps economic transitions into utility space via a continuously differentiable first-order difference equation for cohort lifetime utilities. This reformulation allows for a transparent normative evaluation of non-steady-state paths without requiring explicit solutions to the underlying nonlinear system. Within this framework, we show that when fertility cycles emerge and children are normal goods, the utility of small cohorts strictly exceeds that of large cohorts. Crucially, this cohort-welfare asymmetry is driven by fertility preferences and is independent of the economy's position relative to the golden rule.
We develop a linear one-sex dynamical model of human population reproduction through marriage. In our model, a woman may marry and divorce multiple times; however, only women who are currently married are assumed to bear children. The iterative marriage process is formulated as a three-state compartmental model, which is described by a system of McKendrick equations with a marital birth rate function that depends on the duration of marriage and the age at marriage. To examine the impact of changing nuptiality on fertility, we derive new formulas for the reproduction indices. In particular, the total fertility rate (TFR) is expressed as the product of the total marriage number and the average total marital fertility. Using Japanese vital statistics, we show that our model provides a reasonable estimate of the current TFR and its future trajectory.
The fertility trend in developing countries has experienced a significant decline in the last few decades; at the same time, the role of women in the workplace has improved. To have a better insight of the causality of the rate of women participation in the labor market on the total fertility rate in developing world, this paper divides the dataset of 115 developing countries in the period of 1991-2018 into four continents group (Africa, North/South America, Asia/Pacific, Europe) and then applies a data-driven panel data econometric procedure to mitigate omitted bias. The results suggest that the fertility behaviors of women in the North/South America continents are influenced by their career choice; meanwhile in society of other regions, other factors might be more important to women when thinking of having children. In conclusion, policymakers can reference to the paper and formulate policies to have more incentives in making reproductive decisions and further research in the field needs to consider family policies and patrilocality of developing countries as important data.
Age-specific fertility rates (ASFRs) provide the most extensive record of reproductive change, but their aggregate nature obscures the individual-level behavioral mechanisms that drive fertility trends. To bridge this micro-macro divide, we introduce a likelihood-free Bayesian framework that couples a demographically interpretable, individual-level simulation model of the reproductive process with Sequential Neural Posterior Estimation (SNPE). We show that this framework successfully recovers core behavioral parameters governing contemporary fertility, including preferences for family size, reproductive timing, and contraceptive failure, using only ASFRs. The framework's effectiveness is validated on cohorts from four countries with diverse fertility regimes. Most compellingly, the model, estimated solely on aggregate data, successfully predicts out-of-sample distributions of individual-level outcomes, including age at first sex, desired family size, and birth intervals. Because our framework yields complete synthetic life histories, it significantly reduces the data requirements for building microsimulation models and enables behaviorally explicit demographic forecasts.
We characterize the outcomes of a canonical deterministic model for the intergenerational transmission of capital that features differential fertility. A fertility function determines the relationship between parental capital and the number of children, and a transmission function determines the relationship between the capital of a parent and that of their children. Together these functions generate an evolving cross-sectional distribution of capital. We establish easy-to-verify conditions on the fertility and transmission functions that guarantee (a) that the dynamical system has a steady state distribution that is either atomless (exhibiting inequality) or degenerate (not exhibiting inequality), and (b) that the system converges to such states from essentially any initial distribution. Our characterization provides new insights into the link between differential fertility and long-run cross-sectional inequality, and it gives rise to novel comparative statics relating the two. We apply our results to several parametric examples and to a model of economic growth that features endogenous differential fertility.
Tokenization is a crucial but under-evaluated step in large language models (LLMs). The standard metric, fertility (the average number of tokens per word), captures compression efficiency but obscures how vocabularies are allocated across languages and domains. We analyze six widely used tokenizers across seven languages and two domains, finding stable fertility for English, high fertility for Chinese, and little domain sensitivity. To address fertility's blind spots, we propose the Single Token Retention Rate (STRR), which measures the proportion of words preserved as single tokens. STRR reveals systematic prioritization of English, strong support for Chinese, and fragmentation in Hindi, offering an interpretable view of cross-lingual fairness. Our results show that STRR complements fertility and provides practical guidance for designing more equitable multilingual tokenizers.
There has long been an apparent consensus in the literature on intra-household allocation and fertility that greater paternal involvement in childcare relaxes maternal time constraints, enabling mothers to increase their labor supply or leisure. Recent evidence, particularly from South Korea, challenges this view: increases in fathers' childcare time have coincided with a further increase in mothers' time dedicated to child-rearing. This paper develops an Overlapping Generations (OLG) growth model to address such a puzzle. The central mechanism and our main innovation hinge on the functional form of the childcare technology. When maternal and paternal time are substitutes, the conventional result holds. However, when they are complements, greater paternal involvement necessarily raises maternal childcare time, depressing fertility and redirecting household resources toward child quality. We further argue that the elasticity of substitution should not be interpreted as a pure preference parameter, as it also reflects the social and institutional norms, the skills each parent brings to child-rearing and their intergenerational transmission. The model is extended to study the effectiv
Accurate fertility estimates at fine spatial resolution are essential for localized public health planning, particularly in low- and middle-income countries (LMICs). While national-level indicators such as age-specific fertility rates (ASFR) and total fertility rate (TFR) are often reported through official statistics, they lack the spatial granularity needed to guide targeted interventions. To address this, we develop a framework for subnational fertility estimation using small-area estimation (SAE) techniques applied to birth history data from household surveys, in particular Demographic and Health Surveys (DHS). Disaggregation by geographic area, time period, and maternal age group leads to significant data sparsity, limiting the reliability of direct estimates at fine scales. To overcome this, we propose a suite of methods, including direct estimators, area-level and unit-level Bayesian hierarchical models, to produce accurate estimates across varying spatial resolutions. The model-based approaches incorporate spatiotemporal smoothing and integrate covariates such as maternal education, contraceptive use and urbanicity. Using data from the 2021 Madagascar DHS, we generate distr
Reproducing game bugs, particularly crash bugs in continuously evolving games like Minecraft, is a notoriously manual, time-consuming, and challenging process to automate; insights from a key decision maker from Minecraft we interviewed confirm this, highlighting that a substantial portion of crash reports necessitate manual scenario reconstruction. Despite the success of LLM-driven bug reproduction in other software domains, games, with their complex interactive environments, remain largely unaddressed. This paper introduces BugCraft, a novel end-to-end framework designed to automate the reproduction of crash bugs in Minecraft directly from user-submitted bug reports, addressing the critical gap in automated game bug reproduction. BugCraft employs a two-stage approach: first, a Step Synthesizer leverages LLMs and Minecraft Wiki knowledge to transform bug reports into high-quality, structured steps to reproduce (S2R). Second, an Action Model, powered by a vision-based LLM agent and a custom macro API, executes these S2R steps within Minecraft to trigger the reported crash. To facilitate evaluation, we introduce BugCraft-Bench, a curated dataset of Minecraft crash bug reports. On Bu
Low total fertility rates throughout the world have lead to concerns about economic growth, military security, international political power, environment impacts, and quality of life. Overall total fertility rates of today's societies are complex emergent functions of culture, biology, and economic policies that are notoriously difficult to forecast. In order to study the dynamic, stochastic nature of total fertility rates, population and wealth trajectories as functions of infertility and birth cost are generated from a minimal, endogenous, agent-based model of a simple foraging economy. A harvesting model from mathematical ecology is added to reflect death by "natural causes". With these added limits of finite lifespans, decreasing total fertility rates are shown to lead to population levels consistently below the actual carry capacity of the landscape. These below carry-capacity population levels generate higher total and per capita wealth. The stochastic population trajectories generated demonstrate instabilities that significantly increase the likelihood of extinction within reasonable time frames. Society may possibly be encouraged by this increasing wealth (and perhaps reduc
The sterile insect technique controls mosquito-borne diseases such as malaria, dengue, and yellow fever through either eradication or depressing the associated vector population. We formulate a three-dimensional delayed mosquito population suppression model with a saturated release rate to explore the interactive dynamics between wild, sterile, and non-sterile mosquitoes, focusing on the delay and residual fertility in the interactive dynamics among insects. We investigate the stability of the positive equilibrium and derive the Hopf bifurcation conditions. We establish the stability conditions for the positive equilibrium and examine how the time delay ($τ$) and residual fertility affect the non-sterile insects' dynamics. Below the critical values of the delay, the system remains stable, while beyond that, the Hopf bifurcation is guaranteed under certain circumstances. However, analysis shows a clear band of non-sterile insect population values as residual fertility varies within a very narrow range. This suggests that within this interval, the system exhibits sensitive dependence on the fertility parameter, likely due to underlying nonlinear dynamics. Numerical simulations are pr
We show that translation quality can be predicted with surprising accuracy \textit{without ever running the translation system itself}. Using only a handful of features, token fertility ratios, token counts, and basic linguistic metadata (language family, script, and region), we can forecast ChrF scores for GPT-4o translations across 203 languages in the FLORES-200 benchmark. Gradient boosting models achieve favorable performance ($R^{2}=0.66$ for XX$\rightarrow$English and $R^{2}=0.72$ for English$\rightarrow$XX). Feature importance analyses reveal that typological factors dominate predictions into English, while fertility plays a larger role for translations into diverse target languages. These findings suggest that translation quality is shaped by both token-level fertility and broader linguistic typology, offering new insights for multilingual evaluation and quality estimation.
The social sciences have produced an impressive body of research on determinants of fertility outcomes, or whether and when people have children. However, the strength of these determinants and underlying theories are rarely evaluated on their predictive ability on new data. This prevents us from systematically comparing studies, hindering the evaluation and accumulation of knowledge. In this paper, we present two datasets which can be used to study the predictability of fertility outcomes in the Netherlands. One dataset is based on the LISS panel, a longitudinal survey which includes thousands of variables on a wide range of topics, including individual preferences and values. The other is based on the Dutch register data which lacks attitudinal data but includes detailed information about the life courses of millions of Dutch residents. We provide information about the datasets and the samples, and describe the fertility outcome of interest. We also introduce the fertility prediction data challenge PreFer which is based on these datasets and will start in Spring 2024. We outline the ways in which measuring the predictability of fertility outcomes using these datasets and combinin
Fertility differentials by urban-rural residence and nativity of women in Australia significantly impact population composition at sub-national levels. We aim to provide consistent fertility forecasts for Australian women characterized by age, region, and birthplace. Age-specific fertility rates at the national and sub-national levels obtained from census data between 1981-2011 are jointly modeled and forecast by the grouped functional time series method. Forecasts for women of each region and birthplace are reconciled following the chosen hierarchies to ensure that results at various disaggregation levels consistently sum up to the respective national total. Coupling the region of residence disaggregation structure with the trace minimization reconciliation method produces the most accurate point and interval forecasts. In addition, age-specific fertility rates disaggregated by the birthplace of women show significant heterogeneity that supports the application of the grouped forecasting method.