共找到 20 条结果
The Los Angeles wildfires of January 2025 caused more than 250 billion dollars in damage and lasted for nearly an entire month before containment. Following our previous work, the Digital Twin Building, we modify and leverage the multi-agent large language model framework as well as the cloud-mapping integration to study the air quality during the Los Angeles wildfires. Recent advances in large language models have allowed for out-of-the-box automated large-scale data analysis. We use a multi-agent large language system comprised of an Instructor agent and Worker agents. Upon receiving the users' instructions, the Instructor agent retrieves the data from the cloud platform and produces instruction prompts to the Worker agents. The Worker agents then analyze the data and provide summaries. The summaries are finally input back into the Instructor agent, which then provides the final data analysis. We test this system's capability for data-based policy recommendation by assessing our Instructor-Worker LLM system's health recommendations based on air quality during the Los Angeles wildfires.
Climate-driven wildfires are intensifying, particularly in urban regions such as Southern California. Yet, traditional fire risk communication tools often fail to gain public trust due to inaccessible design, non-transparent outputs, and limited contextual relevance. These challenges are especially critical in high-risk communities, where trust depends on how clearly and locally information is presented. Neighborhoods such as Pacific Palisades, Pasadena, and Altadena in Los Angeles exemplify these conditions. This study introduces a community-led approach for integrating AI into wildfire risk assessment using the Participatory AI Literacy and Explainability Integration (PALEI) framework. PALEI emphasizes early literacy building, value alignment, and participatory evaluation before deploying predictive models, prioritizing clarity, accessibility, and mutual learning between developers and residents. Early engagement findings show strong acceptance of visual, context-specific risk communication, positive fairness perceptions, and clear adoption interest, alongside privacy and data security concerns that influence trust. Participants emphasized localized imagery, accessible explanatio
Surface ozone pollution remains a persistent challenge in many metropolitan regions worldwide, as the nonlinear dependence of ozone formation on nitrogen oxides and volatile organic compounds (VOCs) complicates the design of effective emission control strategies. While chemical transport models provide mechanistic insights, they rely on detailed emission inventories and are computationally expensive. This study develops a machine learning--based surrogate framework inspired by the Empirical Kinetic Modeling Approach (EKMA). Using hourly air quality observations from Los Angeles during 2024--2025, a random forest model is trained to predict surface ozone concentrations based on precursor measurements and spatiotemporal features, including site location and cyclic time encodings. The model achieves strong predictive performance, with permutation importance highlighting the dominant roles of diurnal temporal features and nitrogen dioxide, along with additional contributions from carbon monoxide. Building on the trained surrogate, EKMA-style sensitivity experiments are conducted by perturbing precursor concentrations while holding other covariates fixed. The results indicate that ozone
Wildfires have become increasingly frequent, irregular, and severe in recent years. Understanding how affected populations perceive and respond during wildfire crises is critical for timely and empathetic disaster response. Social media platforms offer a crowd-sourced channel to capture evolving public discourse, providing hyperlocal information and insight into public sentiment. This study analyzes Reddit discourse during the 2025 Los Angeles wildfires, spanning from the onset of the disaster to full containment. We collect 385 posts and 114,879 comments related to the Palisades and Eaton fires. We adopt topic modeling methods to identify the latent topics, enhanced by large language models (LLMs) and human-in-the-loop (HITL) refinement. Furthermore, we develop a hierarchical framework to categorize latent topics, consisting of two main categories, Situational Awareness (SA) and Crisis Narratives (CN). The volume of SA category closely aligns with real-world fire progressions, peaking within the first 2-5 days as the fires reach the maximum extent. The most frequent co-occurring category set of public health and safety, loss and damage, and emergency resources expands on a wide ra
We show reflectivity cross-sections for the San Gabriel, Chino, and San Bernardino basins north of Los Angeles, California determined from autocorrelations of ambient noise and teleseismic earthquake waves. These basins are thought to channel the seismic energy from earthquakes on the San Andreas Fault to Los Angeles and a more accurate model of their depth is important for hazard mitigation. We use the causal side of the autocorrelation function to determine the zero-offset reflection response. To minimize the smoothing effect of the source time function, we remove the common mode from the autocorrelation in order to reveal the zero-offset reflection response. We apply this to 10 temporary nodal lines consisting of a total of 758 geophones with an intraline spacing of 250-300 m. We also show that the autocorrelation function from teleseismic events can provide illumination of subsurface that is consistent with ambient noise. Both autocorrelation results compare favorably to receiver functions.
This study investigates the impact of time since fire on bird community composition in Southern California chaparral ecosystems. We surveyed avian richness and abundance across 14 sites representing a 0 to 25 year post-fire chronosequence in Los Angeles County. Sites burned within the last five years supported fewer species, primarily dominated by generalists, while mid- to late-successional sites exhibited greater richness and a higher proportion of specialists. These patterns corresponded with increases in vegetation structural complexity over time. However, no consistent relationships were found between bird communities and abiotic variables, such as weather, temperature, and elevation, likely due to the single-visit sampling design. Our results align with successional theory and underscore the ecological importance of fire return intervals that allow full chaparral recovery. Restoration and management should prioritize long-term structural development, invasive grass control, and post-fire heterogeneity to support diverse and resilient avian communities.
This study develops a Bayesian hierarchical model to explore the effects of air pollution on respiratory and cardiovascular mortality in Los Angeles County. The model takes into account various pollutants such as PM2.5, PM10, CO, SO2, NO2 and O3, as well as a related meteorological factor: temperature. The objective is to identify the significant factors affecting selected health outcomes without including all variables in each model specification. This flexibility enables the model to capture key drivers of health risk without redundancy. To account for potential measurement error in pollution data due to imperfect monitoring or averaging, certain observed pollutant levels are treated as noise proxies for true exposure. By specifying priors for regression coefficients and measurement error parameters and estimating posterior distributions via Markov Chain Monte Carlo (MCMC) sampling, it leads to more precise and reliable estimates of the health risks associated with air pollution exposure in Los Angeles County by incorporating both the count nature of the health data and the uncertainties in pollution measurements.
Nutrition estimation is an important component of promoting healthy eating and mitigating diet-related health risks. Despite advances in tasks such as food classification and ingredient recognition, progress in nutrition estimation is limited due to the lack of datasets with nutritional annotations. To address this issue, we introduce FastFood, a dataset with 84,446 images across 908 fast food categories, featuring ingredient and nutritional annotations. In addition, we propose a new model-agnostic Visual-Ingredient Feature Fusion (VIF$^2$) method to enhance nutrition estimation by integrating visual and ingredient features. Ingredient robustness is improved through synonym replacement and resampling strategies during training. The ingredient-aware visual feature fusion module combines ingredient features and visual representation to achieve accurate nutritional prediction. During testing, ingredient predictions are refined using large multimodal models by data augmentation and majority voting. Our experiments on both FastFood and Nutrition5k datasets validate the effectiveness of our proposed method built in different backbones (e.g., Resnet, InceptionV3 and ViT), which demonstrat
I study Hodge decomposition (HodgeRank) for urban traffic flow on two graph representations: dense origin--destination (OD) graphs and road-segment networks. Reproducing the method of Aoki et al., we observe that on dense OD graphs the curl and harmonic components are negligible and the potential closely tracks node divergence, limiting the added value of Hodge potentials. In contrast, on a real road network (UTD19, downtown Los Angeles; 15-minute resolution), potentials differ substantially from divergence and exhibit clear morning/evening reversals consistent with commute patterns. We quantify smoothness and discriminability via local/global variances derived from the graph spectrum, and propose flow-aware embeddings that combine topology, bidirectional volume, and net-flow asymmetry for clustering. Code and preprocessing steps are provided to facilitate reproducibility.
The Burning Index (BI) produced daily by the United States government's National Fire Danger Rating System is commonly used in forecasting the hazard of wildfire activity in the United States. However, recent evaluations have shown the BI to be less effective at predicting wildfires in Los Angeles County, compared to simple point process models incorporating similar meteorological information. Here, we explore the forecasting power of a suite of more complex point process models that use seasonal wildfire trends, daily and lagged weather variables, and historical spatial burn patterns as covariates, and that interpolate the records from different weather stations. Results are compared with models using only the BI. The performance of each model is compared by Akaike Information Criterion (AIC), as well as by the power in predicting wildfires in the historical data set and residual analysis. We find that multiplicative models that directly use weather variables offer substantial improvement in fit compared to models using only the BI, and, in particular, models where a distinct spatial bandwidth parameter is estimated for each weather station appear to offer substantially improved f
Freight truck-related crashes pose significant challenges, leading to substantial economic losses, injuries, and fatalities, with pronounced spatial disparities across different regions. This study adopts a transport geography perspective to examine spatial justice concerns by employing deep counterfactual inference models to analyze how socioeconomic disparities, road infrastructure, and environmental conditions influence the geographical distribution and severity of freight truck crashes. By integrating road network datasets, socioeconomic attributes, and crash records from the Los Angeles metropolitan area, this research provides a nuanced spatial analysis of how different communities are disproportionately impacted. The results reveal significant spatial disparities in crash severity across areas with varying population densities, income levels, and minority populations, highlighting the pivotal role of infrastructural and environmental improvements in mitigating these disparities. The findings offer insights into targeted, location-specific policy interventions, suggesting enhancements in road infrastructure, lighting, and traffic control systems, particularly in low-income an
A reasonable and balanced diet is essential for maintaining good health. With the advancements in deep learning, automated nutrition estimation method based on food images offers a promising solution for monitoring daily nutritional intake and promoting dietary health. While monocular image-based nutrition estimation is convenient, efficient, and economical, the challenge of limited accuracy remains a significant concern. To tackle this issue, we proposed DPF-Nutrition, an end-to-end nutrition estimation method using monocular images. In DPF-Nutrition, we introduced a depth prediction module to generate depth maps, thereby improving the accuracy of food portion estimation. Additionally, we designed an RGB-D fusion module that combined monocular images with the predicted depth information, resulting in better performance for nutrition estimation. To the best of our knowledge, this was the pioneering effort that integrated depth prediction and RGB-D fusion techniques in food nutrition estimation. Comprehensive experiments performed on Nutrition5k evaluated the effectiveness and efficiency of DPF-Nutrition.
While agriculture is recognised as vital for improving nutrition, the evidence linking women's participation to sustained nutritional gains remains inconclusive. This review synthesizes studies published between 2000 and 2024 to reflect current agricultural practices and nutritional challenges. We examine how agricultural practices and time use affect nutritional outcomes among rural women through pathways such as income generation food preparation and intra-household labour allocation. A structured methodology with clear inclusion and exclusion criteria was used to assess gender-sensitive and nutrition-sensitive interventions. Using narrative synthesis the review categorizes findings around key themes and contextual factors including socio-economic status seasonality and labour intensity. The results show that while increased involvement in agriculture can boost household dietary diversity and income it also raises time burdens that affect food preparation childcare and self-care. Positive outcomes occur when interventions enhance women's decision-making power income access and use of time-saving technologies whereas negative outcomes emerge when excessive workloads compromise ene
Large Multimodal Models (LMMs) are increasingly applied to meal images for nutrition analysis. However, existing work primarily evaluates proprietary models, such as GPT-4. This leaves the broad range of LLMs underexplored. Additionally, the influence of integrating contextual metadata and its interaction with various reasoning modifiers remains largely uncharted. This work investigates how interpreting contextual metadata derived from GPS coordinates (converted to location/venue type), timestamps (transformed into meal/day type), and the food items present can enhance LMM performance in estimating key nutritional values. These values include calories, macronutrients (protein, carbohydrates, fat), and portion sizes. We also introduce \textbf{ACETADA}, a new food-image dataset slated for public release. This open dataset provides nutrition information verified by the dietitian and serves as the foundation for our analysis. Our evaluation across eight LMMs (four open-weight and four closed-weight) first establishes the benefit of contextual metadata integration over straightforward prompting with images alone. We then demonstrate how this incorporation of contextual information enhan
Developing and improving computational approaches to covering news can increase journalistic output and improve the way stories are covered. In this work we approach the problem of covering crime stories in Los Angeles. We present a machine-in-the-loop system that covers individual crimes by (1) learning the prototypical coverage archetypes from classical news articles on crime to learn their structure and (2) using output from the Los Angeles Police department to generate "lede paragraphs", first structural unit of crime-articles. We introduce a probabilistic graphical model for learning article structure and a rule-based system for generating ledes. We hope our work can lead to systems that use these components together to form the skeletons of news articles covering crime. This work was done for a class project in Jonathan May's Advanced Natural Language Processing Course, Fall, 2019.
Vehicular air pollution has created an ongoing air quality and public health crisis. Despite growing knowledge of racial injustice in exposure levels, less is known about the relationship between the production of and exposure to such pollution. This study assesses pollution burden by testing whether local populations' vehicular air pollution exposure is proportional to how much they drive. Through a Los Angeles, California case study we examine how this relates to race, ethnicity, and socioeconomic status -- and how these relationships vary across the region. We find that, all else equal, tracts whose residents drive less are exposed to more air pollution, as are tracts with a less-White population. Commuters from majority-White tracts disproportionately drive through non-White tracts, compared to the inverse. Decades of racially-motivated freeway infrastructure planning and residential segregation shape today's disparities in who produces vehicular air pollution and who is exposed to it, but opportunities exist for urban planning and transport policy to mitigate this injustice.
The objective of our research is to present the change in crime rates in Los Angeles post-Covid19. Using data analysis with Geo-Mapping, bubbles, Marimekko, and a time series charts, we can illustrate which areas have the largest crime rate, and how it has changed. Through regression modeling, we can interpret which locations may also have a correlation to crime versus income, race, type of crime, and gender. The story will help to uncover whether the areas associated with crime are due to demographic or income variance. In showing the details of crimes in Los Angeles along with the factors at play we hope to see a compelling relationship between crime rates and recent events from 2020 to the present, along with changes in crime type trends during these periods. We use Excel to clean the data for SAP SAC to model effectively, as well as resources from other studies a comparison.
Background: On March 19, 2020, the government of California ordered all 40 million Californians to stay at home in the coming weeks as the result of the escalation of the coronavirus disease 2019 (COVID-19) pandemic. As lockdowns were implemented, the significant changes caused by these restrictions brought a dramatic improvement in air quality in metropolitan cities such as Los Angeles (LA Basin).Methods: We use real-time data from The South Coast Air Quality Management District (South Coast AQMD), and the California Department of Transportation to evaluate the drivers of the pollution sources. We also mapped monthly spatial variations and constructed hourly heatmaps of those pollutants in 2020 to understand the impacts of the lockdown on different locations and times of day in the LA Basin. Results: Compared to the same dates in 2019, traffic flow on highways in the Los Angeles Basin dropped by 20.86 % when the stay at home order was initiated and it continued to decrease along with dramatic declines in NO2, CO, and PM2.5. The correlation (Pierson r) between truck flow change and changes of NO2, CO, and PM2.5 is statistically significant. Conclusion: The declines in truck flow ar
Polygenic risk scores (PRS) summarize the combined effect of common risk variants and are associated with breast cancer risk in patients without identifiable monogenic risk factors. One of the most well-validated PRSs in breast cancer to date is PRS313, which was developed from a Northern European biobank but has shown attenuated performance in non-European ancestries. We further investigate the generalizability of the PRS313 for American women of European (EA), African (AFR), Asian (EAA), and Latinx (HL) ancestry within one institution with a singular EHR system, genotyping platform, and quality control process. We found that the PRS313 achieved overlapping Areas under the ROC Curve (AUCs) in females of Lantix (AUC, 0.68; 95 CI, 0.65-0.71) and European ancestry (AUC, 0.70; 95 CI, 0.69-0.71) but lower AUCs for the AFR and EAA populations (AFR: AUC, 0.61; 95 CI, 0.56-0.65; EAA: AUC, 0.64; 95 CI, 0.60-0.680). While PRS313 is associated with Hormone Positive (HR+) disease in European Americans (OR, 1.42; 95 CI, 1.16-1.64), for Latinx females, it may be instead associated with Human Epidermal Growth Factor Receptor 2 (HER2+) disease (OR, 2.52; 95 CI, 1.35-4.70) although due to small nu
Accurate channel model and channel estimation are essential to empower extremely large-scale MIMO (XL-MIMO) in 6G networks with ultra-high spectral efficiency. With the sharp increase in the antenna array aperture of the XL-MIMO scenario, the electromagnetic propagation field will change from far-field to near-field. Unfortunately, due to the near-field effect, most of the existing XL-MIMO channel models fail to describe mixed line-of-sight (LoS) and non-line-of-sight (NLoS) path components simultaneously. In this paper, a mixed LoS/NLoS near-field XL-MIMO channel model is proposed to match the practical near-field XL-MIMO scenario, where the LoS path component is modeled by the geometric free space propagation assumption while NLoS path components are modeled by the near-field array response vectors. Then, to define the range of near-field for XL-MIMO, the MIMO Rayleigh distance (MIMO-RD) and MIMO advanced RD (MIMO-ARD) is derived. Next, a two stage channel estimation algorithm is proposed, where the LoS path component and NLoS path components are estimated separately. Moreover, the Cramer-Rao lower bound (CRLB) of the proposed algorithm is derived in this paper. Numerical simulat