共找到 20 条结果
We present Paris 2.0, the first video generation model pre-trained through decentralized computation. Its training recipe builds upon Paris 1.0 (arXiv:2510.03434), the first ever open-weight Decentralized Diffusion Model (DDM), which showed that image generation can be trained without a monolithic GPU cluster. However, temporally coherent video generation had remained an open problem under decentralized training, and Paris 2.0 closes it. In low-resolution text-to-video training, against a monolithic model trained on the same data under a matched total compute budget, Paris 2.0 cuts Frechet Video Distance (FVD) from 561.04 to 279.01, a ~2.0x improvement, and lifts CLIP text-video similarity and aesthetic score.
We present Paris, the first publicly released diffusion model pre-trained entirely through decentralized computation. Paris demonstrates that high-quality text-to-image generation can be achieved without centrally coordinated infrastructure. Paris is open for research and commercial use. Paris required implementing our Distributed Diffusion Training framework from scratch. The model consists of 8 expert diffusion models (129M-605M parameters each) trained in complete isolation with no gradient, parameter, or intermediate activation synchronization. Rather than requiring synchronized gradient updates across thousands of GPUs, we partition data into semantically coherent clusters where each expert independently optimizes its subset while collectively approximating the full distribution. A lightweight transformer router dynamically selects appropriate experts at inference, achieving generation quality comparable to centrally coordinated baselines. Eliminating synchronization enables training on heterogeneous hardware without specialized interconnects. Empirical validation confirms that Paris's decentralized training maintains generation quality while removing the dedicated GPU cluster
The challenge of \textbf{imbalanced regression} arises when standard Empirical Risk Minimization (ERM) biases models toward high-frequency regions of the data distribution, causing severe degradation on rare but high-impact ``tail'' events. Existing strategies uch as loss re-weighting or synthetic over-sampling often introduce noise, distort the underlying distribution, or add substantial algorithmic complexity. We introduce \textbf{PARIS} (Pruning Algorithm via the Representer theorem for Imbalanced Scenarios), a principled framework that mitigates imbalance by \emph{optimizing the training set itself}. PARIS leverages the representer theorem for neural networks to compute a \textbf{closed-form representer deletion residual}, which quantifies the exact change in validation loss caused by removing a single training point \emph{without retraining}. Combined with an efficient Cholesky rank-one downdating scheme, PARIS performs fast, iterative pruning that eliminates uninformative or performance-degrading samples. We use a real-world space weather example, where PARIS reduces the training set by up to 75\% while preserving or improving overall RMSE, outperforming re-weighting, synthet
To answer the questions of whether global warming is accelerating and when the 1.5°C Paris Agreement target will be exceeded, the global mean surface temperature from 1880 to 2025 is first examined using a purely graphical approach and later, in a more conventional way, using various time-domain and frequency-domain methods. In an effort to reduce variability, exogenous variables such as El Niño and solar variations are taken into account. Although it ultimately remains unclear to what extent these variables are actually helpful, we feel confident in summarizing the empirical results of this study to suggest that global warming is indeed accelerating and that a breach of the 1.5°C Paris Agreement target is imminent. But when it comes to statistical significance, caution should still be exercised. While the acceleration hypothesis can be confirmed with a fair degree of certainty under reasonably plausible assumptions (albeit with the help of a bit of data snooping, which is unavoidable when building on the results of earlier studies that used virtually the same data), there is currently not enough evidence to prove that the 1.5°C target has already been exceeded. However, if 2026 an
In this paper, gradient boosting is used to forecast the Q(.95) values of air temperature and the Steadman Heat Index. Paris, France during late the spring and summer months is the major focus. Predictors and responses are drawn from the Paris-Montsouris weather station for the years 2018 through 2024. Q(.95) values are used because of interest in summer heat that is statistically rare and extreme. The data are curated as a multiple time series for each year. Predictors include seven routinely collected indicators of weather conditions. They each are lagged by 14 days such that temperature and heat index forecasts are provided two weeks in advance. Forecasting uncertainty is addressed with conformal prediction regions. Forecasting accuracy is promising. Cairo, Egypt is a second location using data from the weather station at the Cairo Internal Airport over the same years and months. Cairo is a more challenging setting for temperature forecasting because its desert climate can create abrupt and erratic temperature changes. Yet, there is some progress forecasting record-setting hot days.
As a form of "small A", quantile machine learning is used to forecast diurnal and nocturnal $Q(.90)$ air temperatures for Paris, France from late spring through the summer months of 2021. The data are provided by the Paris-Montsouris weather station. Rather than trying to directly anticipate the onset and cessation of reported heat waves, Q(.90) values are estimated. The 90th percentile is chosen so that exceedances represent relatively rare and extreme conditions. Predictors include eight routinely available indicators of weather conditions, lagged by 14 days. Using holdout data, the temperature forecasts are produced two weeks in advance. Adaptive conformal prediction regions are computed that, under exchangeability, provide provably valid finite-sample coverage of forecasting uncertainty. For both diurnal and nocturnal temperatures, forecasting accuracy in the holdout data is promising, and sound measures of uncertainty are coupled with a novel decision-making framework. Benefits for policy and practice follow.
Non-linear state-space models, also known as general hidden Markov models, are ubiquitous in statistical machine learning, being the most classical generative models for serial data and sequences in general. The particle-based, rapid incremental smoother PaRIS is a sequential Monte Carlo (SMC) technique allowing for efficient online approximation of expectations of additive functionals under the smoothing distribution in these models. Such expectations appear naturally in several learning contexts, such as likelihood estimation (MLE) and Markov score climbing (MSC). PARIS has linear computational complexity, limited memory requirements and comes with non-asymptotic bounds, convergence results and stability guarantees. Still, being based on self-normalised importance sampling, the PaRIS estimator is biased. Our first contribution is to design a novel additive smoothing algorithm, the Parisian particle Gibbs PPG sampler, which can be viewed as a PaRIS algorithm driven by conditional SMC moves, resulting in bias-reduced estimates of the targeted quantities. We substantiate the PPG algorithm with theoretical results, including new bounds on bias and variance as well as deviation inequa
The Solar Maximum Mission of NASA was one of the first satellites with on board digitization of observations. It was launched for the solar maximum of cycle 21 (1980) in order to study the solar activity. It carried many instruments, such as coronagraphs, X and $γ$ ray detectors, an Ultra Violet spectrometer and a radiometer. Ground based support was offered by many institutes, such as Paris Meudon observatory under the form of systematic observations or coordinated campaigns with specific instruments. We present here the Meudon Solar Tower (MST) and magnetograph which offered in the eighties a major contribution with observations of velocity and magnetic fields of the photosphere and chromosphere, while SMM was observing the transition region and corona above.
In patent prosecution, timely and effective responses to Office Actions (OAs) are crucial for securing patents. However, past automation and artificial intelligence research have largely overlooked this aspect. To bridge this gap, our study introduces the Patent Office Action Response Intelligence System (PARIS) and its advanced version, the Large Language Model (LLM) Enhanced PARIS (LE-PARIS). These systems are designed to enhance the efficiency of patent attorneys in handling OA responses through collaboration with AI. The systems' key features include the construction of an OA Topics Database, development of Response Templates, and implementation of Recommender Systems and LLM-based Response Generation. To validate the effectiveness of the systems, we have employed a multi-paradigm analysis using the USPTO Office Action database and longitudinal data based on attorney interactions with our systems over six years. Through five studies, we have examined the constructiveness of OA topics (studies 1 and 2) using topic modeling and our proposed Delphi process, the efficacy of our proposed hybrid LLM-based recommender system tailored for OA responses (study 3), the quality of generate
Urban decarbonization is one of the pillars for strategies to achieve carbon neutrality around the world. However, the current speed of urban decarbonization is insufficient to keep pace with efforts to achieve this goal. Rooftop PVs integrated with electric vehicles (EVs) as battery is a promising technology capable to supply CO2-free, affordable, and dispatchable electricity in urban environments (SolarEV City Concept). Here, we evaluated Paris, France for the decarbonization potentials of rooftop PV + EV in comparison to the surrounding suburban area Ile-de-France and Kyoto, Japan. We assessed various scenarios by calculating the energy sufficiency, self-consumption, self-sufficiency, cost savings, and CO2 emission reduction of the PV + EV system or PV only system. The combination of EVs with PVs by V2H or V2B systems at the city or region level was found to be more effective in Ile-de-France than in Paris suggesting that SolarEV City is more effective for geographically larger area including Paris. If implemented at a significant scale, they can add substantial values to rooftop PV economics and keep a high self-consumption and self-sufficiency, which also allows bypassing the
The escalating sophistication of cyber-attacks and the widespread utilization of stealth tactics have led to significant security threats globally. Nevertheless, the existing static detection methods exhibit limited coverage, and traditional dynamic monitoring approaches encounter challenges in bypassing evasion techniques. Thus, it has become imperative to implement nuanced and dynamic analysis to achieve precise behavior detection in real time. There are two pressing concerns associated with current dynamic malware behavior detection solutions. Firstly, the collection and processing of data entail a significant amount of overhead, making it challenging to be employed for real-time detection on the end host. Secondly, these approaches tend to treat malware as a singular entity, thereby overlooking varied behaviors within one instance. To fill these gaps, we propose PARIS, an adaptive trace fetching, lightweight, real-time malicious behavior detection system. Specifically, we monitor malicious behavior with Event Tracing for Windows (ETW) and learn to selectively collect maliciousness-related APIs or call stacks, significantly reducing the data collection overhead. As a result, we
This paper evaluates the sustainability of Advanced Air Mobility (AAM) in urban and regional mobility, using Paris as a case study. Paris is committed to eco-friendly transportation and has introduced AAM, including electric Vertical Take-Off and Landing (eVTOL) air taxis for the 2024 Olympic Games. We assess eVTOL energy consumption and CO$_2$ emissions on urban and regional routes, comparing them with cars, public transport, and helicopters. Urban eVTOLs save around 23 minutes over cars and 22 minutes over public transport on 50 km routes. For regional routes (300 km), eVTOLs save 76 minutes over cars and 69 minutes over trains. However, eVTOLs' eco-friendliness depends on context. In urban areas, they consume more energy than electric cars, but beat traditional helicopters by 47%. For regional travel, eVTOLs outperform helicopters and some cars but lag behind electric vehicles and trains. To maximize AAM's sustainability in Paris, stakeholders must consider real-world operations and integrate eVTOLs into the broader transportation system. This approach can lead to greener urban and regional transportation.
We address the task of simultaneous part-level reconstruction and motion parameter estimation for articulated objects. Given two sets of multi-view images of an object in two static articulation states, we decouple the movable part from the static part and reconstruct shape and appearance while predicting the motion parameters. To tackle this problem, we present PARIS: a self-supervised, end-to-end architecture that learns part-level implicit shape and appearance models and optimizes motion parameters jointly without any 3D supervision, motion, or semantic annotation. Our experiments show that our method generalizes better across object categories, and outperforms baselines and prior work that are given 3D point clouds as input. Our approach improves reconstruction relative to state-of-the-art baselines with a Chamfer-L1 distance reduction of 3.94 (45.2%) for objects and 26.79 (84.5%) for parts, and achieves 5% error rate for motion estimation across 10 object categories. Video summary at: https://youtu.be/tDSrROPCgUc
The paper concerns an experimental study on the wind pressures over the surface of a worldwide known Gothic Cathedral: Notre Dame of Paris. The experimental tests have been conducted in the CRIACIV wind tunnel, Prato (Italy), on a model of the Cathedral at the scale 1:200 reproducing the atmospheric boundary layer. Two types of tests have been conducted: with or without the surrounding modeling the part of the city of Paris near the Cathedral. This has been done, on the one hand, for evaluating the effect of the surrounding buildings onto the wind pressure distribution on the Cathedral, and, on the other hand, to have a wind pressure distribution plausible for any other Cathedral with a similar shape. The tests have been done for all the wind directions and the mean and peak pressures have been recorded. The results emphasize that the complex geometry of this type of structures is responsible for a peculiar aerodynamic behavior that does not allow estimating correctly the wind loads on the various parts of the Cathedral based on codes and standards, which are tailored for ordinary regular buildings.
The temperature targets in the Paris Agreement cannot be met without very rapid reduction of greenhouse gas emissions and removal of carbon dioxide from the atmosphere. The latter requires large, perhaps prohibitively large subsidies. The central estimate of the costs of climate policy, unrealistically assuming least-cost implementation, is 3.8-5.6\% of GDP in 2100. The central estimate of the benefits of climate policy, unrealistically assuming constant vulnerability, is 2.8-3.2\% of GDP. The uncertainty about the benefits is larger than the uncertainty about the costs. The Paris targets do not pass the cost-benefit test unless risk aversion is high and discount rate low.
Data series similarity search is a core operation for several data series analysis applications across many different domains. Nevertheless, even state-of-the-art techniques cannot provide the time performance required for large data series collections. We propose ParIS and ParIS+, the first disk-based data series indices carefully designed to inherently take advantage of multi-core architectures, in order to accelerate similarity search processing times. Our experiments demonstrate that ParIS+ completely removes the CPU latency during index construction for disk-resident data, and for exact query answering is up to 1 order of magnitude faster than the current state of the art index scan method, and up to 3 orders of magnitude faster than the optimized serial scan method. ParIS+ (which is an evolution of the ADS+ index) owes its efficiency to the effective use of multi-core and multi-socket architectures, in order to distribute and execute in parallel both index construction and query answering, and to the exploitation of the Single Instruction Multiple Data (SIMD) capabilities of modern CPUs, in order to further parallelize the execution of instructions inside each core.
We studied the ${\bar p}$ interactions with the nuclear medium within the 2009 version of the Paris ${\bar N}N$ potential model. We constructed the $\bar{p}$--nucleus optical potential using the Paris $S$- and $P$-wave ${\bar p}N$ scattering amplitudes and treated their strong energy and density dependence self-consistently. We considered a phenomenological $P$-wave term as well. We calculated $\bar{p}$ binding energies and widths of the $\bar{p}$ bound in various nuclei. The $P$-wave potential has very small effect on the calculated ${\bar p}$ binding energies, however, it reduces the corresponding widths noticeably. Moreover, the $S$-wave potential based on the Paris amplitudes supplemented by a phenomenological $P$-wave term yields the ${\bar p}$ binding energies and widths in very good agreement with those obtained within the RMF model consistent with ${\bar p}$-atom data.
One of the main challenges that the Semantic Web faces is the integration of a growing number of independently designed ontologies. In this work, we present PARIS, an approach for the automatic alignment of ontologies. PARIS aligns not only instances, but also relations and classes. Alignments at the instance level cross-fertilize with alignments at the schema level. Thereby, our system provides a truly holistic solution to the problem of ontology alignment. The heart of the approach is probabilistic, i.e., we measure degrees of matchings based on probability estimates. This allows PARIS to run without any parameter tuning. We demonstrate the efficiency of the algorithm and its precision through extensive experiments. In particular, we obtain a precision of around 90% in experiments with some of the world's largest ontologies.
Miller-Paris transformations are extensions of Euler's transformations for the Gauss hypergeometric functions to generalized hypergeometric functions of higher-order having integral parameter differences (IPD). In our recent work we computed the degenerate versions of these transformations corresponding to the case when one parameter difference is equal to a negative integer. The purpose of this paper is to present an independent new derivation of both the general and the degenerate forms of Miller-Paris transformations. In doing so we employ the generalized Stieltjes transform representation of the generalized hypergeometric functions and some partial fraction expansions. Our approach leads to different forms of the characteristic polynomials, one of them appears noticeably simpler than the original form due to Miller and Paris. We further present two extensions of the degenerate transformations to the generalized hypergeometric functions with additional free parameters and additional parameters with negative integral differences.
Web archives preserve portions of the web, but quantifying their completeness remains challenging. Prior approaches have estimated the coverage of a crawl by either comparing the outcomes of multiple crawlers, or by comparing the results of a single crawl to external ground truth datasets. We propose a method to estimate the absolute coverage of a crawl using only the archive's own longitudinal data, i.e., the data collected by multiple subsequent crawls. Our key insight is that coverage can be estimated from the empirical URL overlaps between subsequent crawls, which are in turn well described by a simple urn process. The parameters of the urn model can then be inferred from longitudinal crawl data using linear regression. Applied to our focused crawl configuration of the German Academic Web, with 15 semi-annual crawls between 2013-2021, we find a coverage of approximately 46 percent of the crawlable URL space for the stable crawl configuration regime. Our method is extremely simple, requires no external ground truth, and generalizes to any longitudinal focused crawl.