共找到 20 条结果
The Harder-Narasimhan theory provides a canonical filtration of a vector bundle on a projective curve whose successive quotients are semistable with strictly decreasing slopes. In this article, we present the formalization of Harder-Narasimhan theory in the proof assistant Lean 4 with Mathlib. This formalization is based on a recent approach of Harder-Narasimhan theory by Chen and Jeannin, which reinterprets the theory in order-theoretic terms and avoids the classical dependence on algebraic geometry. As an application, we formalize the uniqueness of coprimary filtration of a finitely generated module over a noetherian ring, and the existence of the Jordan-Hölder filtration of a semistable Harder-Narasimhan game. Code available at: https://github.com/YijunYuan/HarderNarasimhan
Large Language Models (LLMs) excel in general tasks, but adapting them to specialized domains relies on high-quality supervised fine-tuning (SFT) data. Although existing methods can identify subsets of high-quality data and reduce training cost to some extent, their selection process still suffers from over-reliance on LLMs' internal knowledge, weak interpretability, and limited generalization. To address these limitations, we propose THTB (The Harder The Better), a cognitive science-inspired framework for instruction data selection and annotation guidance. THTB prioritizes higher-level cognitive instructions by combining quality filtering with intrinsic and extrinsic hardness scoring, offering interpretable and quantifiable criteria for efficient SFT, both in data selection and annotation guidance. Experiments show that THTB enables models trained on only 5% of the data to outperform full-dataset training, while achieving superior generalization compared with LLM-only selection. In addition, THTB provides effective annotation guidance in vertical domains, enabling a model trained on just 2% of the data to surpass models trained on much larger datasets, demonstrating strong potenti
We prove that for any simply connected isotropic reductive group G over a Dedekind domain D, any Zariski-locally trivial principal G-bundle over D is trivial. The corresponding result for quasi-split groups was proved in 1967 by G. Harder.
In this article, we study the notion of semi-stability and the Harder-Narasimhan filtration from a game-theoretic point of view. This allows us to provide a unified proof for the existence and uniqueness of the Harder-Narasimhan filtration in various settings and offer a conceptual interpretation for semi-stability conditions. As an application, we establish the existence and uniqueness of a coprimary filtration for modules of finite type over a commutative Noetherian ring and interpret it as a Harder-Narasimhan filtration.
For any flat family of pure-dimensional coherent sheaves on a family of projective schemes, the Harder-Narasimhan type (in the sense of Gieseker semistability) of its restriction to each fiber is known to vary semicontinuously on the parameter scheme of the family. This defines a stratification of the parameter scheme by locally closed subsets, known as the Harder-Narasimhan stratification. In this note, we show how to endow each Harder-Narasimhan stratum with the structure of a locally closed subscheme of the parameter scheme, which enjoys the universal property that under any base change the pullback family admits a relative Harder-Narasimhan filtration with a given Harder-Narasimhan type if and only if the base change factors through the schematic stratum corresponding to that Harder-Narasimhan type. The above schematic stratification induces a stacky stratification on the algebraic stack of pure-dimensional coherent sheaves. We deduce that coherent sheaves of a fixed Harder-Narasimhan type form an algebraic stack in the sense of Artin.
While generating better negative samples for contrastive learning has been widely studied in the areas of CV and NLP, very few work has focused on graph-structured data. Recently, Mixup has been introduced to synthesize hard negative samples in graph contrastive learning (GCL). However, due to the unsupervised learning nature of GCL, without the help of soft labels, directly mixing representations of samples could inadvertently lead to the information loss of the original hard negative and further adversely affect the quality of the newly generated harder negative. To address the problem, in this paper, we propose a novel method DropMix to synthesize harder negative samples, which consists of two main steps. Specifically, we first select some hard negative samples by measuring their hardness from both local and global views in the graph simultaneously. After that, we mix hard negatives only on partial representation dimensions to generate harder ones and decrease the information loss caused by Mixup. We conduct extensive experiments to verify the effectiveness of DropMix on six benchmark datasets. Our results show that our method can lead to better GCL performance. Our data and cod
Knowledge graph embedding~(KGE) aims to represent entities and relations into low-dimensional vectors for many real-world applications. The representations of entities and relations are learned via contrasting the positive and negative triplets. Thus, high-quality negative samples are extremely important in KGE. However, the present KGE models either rely on simple negative sampling methods, which makes it difficult to obtain informative negative triplets; or employ complex adversarial methods, which requires more training data and strategies. In addition, these methods can only construct negative triplets using the existing entities, which limits the potential to explore harder negative triplets. To address these issues, we adopt mixing operation in generating harder negative samples for knowledge graphs and introduce an inexpensive but effective method called MixKG. Technically, MixKG first proposes two kinds of criteria to filter hard negative triplets among the sampled negatives: based on scoring function and based on correct entity similarity. Then, MixKG synthesizes harder negative samples via the convex combinations of the paired selected hard negatives. Experiments on two p
The notion of Harder-Narasimhan filtration was firstly introduced by Harder and Narasimhan in the setting of vector bundles on a non-singular projective curve. Curiously, analogous constructions have been discovered in other branches of mathematics which motivate categorical constructions of Harder-Narasimhan filtration. In this article, we introduce a categorical construction of Harder-Narasimhan filtration via slope function method which does not need the additive condition in degree function. We also give a method to prove the existence and uniqueness theorem of the Harder-Narasimhan filtration intrinsically from a set E and a admissible collection of subsets of E, which does not need sub-quotient structure.
We propose a generalization of Quillen's exact category -- arithmetic exact category and we discuss conditions on such categories under which one can establish the notion of Harder-Narasimhan filtrations and Harder-Narsimhan polygons. Furthermore, we show the functoriality of Harder-Narasimhan filtrations (indexed by $\mathbb R$), which can not be stated in the classical setting of Harder and Narasimhan's formalism.
We establish in this article convergence results of normalized Harder-Narasimhan polygons both in geometric and in arithmetic frameworks by introducing the Harder-Narasimhan filtration indexed by $\mathbb R$ and the associated Borel probability measure.
Methods of Harder and Narasimhan from the theory of moduli of vector bundles are applied to moduli of quiver representations. Using the Hall algebra approach to quantum groups, an analog of the Harder-Narasimhan recursion is constructed inside the quantized enveloping algebra of a Kac-Moody algebra. This leads to a canonical orthogonal system, the HN system, in this algebra. Using a resolution of the recursion, an explicit formula for the HN system is given. As an application, explicit formulas for Betti numbers of the cohomology of quiver moduli are derived, generalizing several results on the cohomology of quotients in 'linear algebra type' situations.
The paper gives an example of a tree language G that is recognised by an unambiguous parity automaton and is analytic-complete as a set in Cantor space. This already shows that the unambiguous languages are topologically more complex than the deterministic ones, that are all coanalytic. Using set G as a building block we construct an unambiguous language that is topologically harder than any countable boolean combination of analytic and coanalytic sets. In particular the language is harder than any set in difference hierarchy of analytic sets considered by O.Finkel and P.Simonnet in the context of nondeterministic automata.
Triplet loss is a widely adopted loss function in ReID task which pulls the hardest positive pairs close and pushes the hardest negative pairs far away. However, the selected samples are not the hardest globally, but the hardest only in a mini-batch, which will affect the performance. In this report, a hard batch mining method is proposed to mine the hardest samples globally to make triplet harder. More specifically, the most similar classes are selected into a same mini-batch so that the similar classes could be pushed further away. Besides, an adversarial scene removal module composed of a scene classifier and an adversarial loss is used to learn scene invariant feature representations. Experiments are conducted on dataset MSMT17 to prove the effectiveness, and our method surpasses all of the previous methods and sets state-of-the-art result.
We present Activation- and Influence-Aware Ranks (AIR), an SVD-based LLM compression framework that guides each weight matrix's low-rank approximation with a backward-signal influence metric. Starting from the activation-aware optimum of SVD-LLM(W), AIR runs a single closed-form alternating least squares (ALS) sweep that integrates influence element-wise under a monotone-descent guarantee. AIR is layer-local and composes orthogonally with end-to-end methods: alone it exceeds ACIP, and AIR+LoRA outperforms it further. AIR improves perplexity over SVD-LLM(W) by >18% at <=60% parameter retention, matches its quality with ~90% less calibration data, and turns parameter savings into FLOP, peak-memory, and per-token latency gains.
The Copernicus Atmospheric Monitoring Service provides reanalysis products for atmospheric composition by combining model simulations with satellite observations. The quality of these products depends strongly on the availability of the observational data, which can vary over time as new satellite instruments become available or are discontinued, such as Carbon Monoxide (CO) observations of the Measurements Of Pollution In The Troposphere (MOPITT) satellite in early 2025. Machine learning offers a promising approach to compensate for such data losses by learning systematic discrepancies between model configurations. In this study, we investigate machine learning methods to predict monthly-mean total column of Carbon Monoxide re-analysis from a control model simulation.
The field of high-speed autonomous racing has seen significant advances in recent years, with the rise of competitions such as RoboRace and the Indy Autonomous Challenge providing a platform for researchers to develop software stacks for autonomous race vehicles capable of reaching speeds in excess of 170 mph. Ensuring the safety of these vehicles requires the software to continuously monitor for different faults and erroneous operating conditions during high-speed operation, with the goal of mitigating any unreasonable risks posed by malfunctions in sub-systems and components. This paper presents a comprehensive overview of the HALO safety architecture, which has been implemented on a full-scale autonomous racing vehicle as part of the Indy Autonomous Challenge. The paper begins with a failure mode and criticality analysis of the perception, planning, control, and communication modules of the software stack. Specifically, we examine three different types of faults - node health, data health, and behavioral-safety faults. To mitigate these faults, the paper then outlines HALO safety archetypes and runtime monitoring methods. Finally, the paper demonstrates the effectiveness of the
Scenario Description Languages (SDLs) provide structured, interpretable embeddings that represent traffic scenarios encountered by autonomous vehicles (AVs), supporting key tasks such as scenario similarity searches and edge case detection for safety analysis. This paper introduces the Trajectory-to-Action Pipeline (TAP), a scalable and automated method for extracting SDL labels from large trajectory datasets. TAP applies a rules-based cross-entropy optimization approach to learn parameters directly from data, enhancing generalization across diverse driving contexts. Using the Waymo Open Motion Dataset (WOMD), TAP achieves 30% greater precision than Average Displacement Error (ADE) and 24% over Dynamic Time Warping (DTW) in identifying behaviorally similar trajectories. Additionally, TAP enables automated detection of unique driving behaviors, streamlining safety evaluation processes for AV testing. This work provides a foundation for scalable scenario-based AV behavior analysis, with potential extensions for integrating multi-agent contexts.
This paper is concerned with probabilistic techniques for forecasting dynamical systems described by partial differential equations (such as, for example, the Navier-Stokes equations). In particular, it is investigating and comparing various extensions to the flow matching paradigm that reduce the number of sampling steps. In this regard, it compares direct distillation, progressive distillation, adversarial diffusion distillation, Wasserstein GANs and rectified flows. Moreover, experiments are conducted on a set of challenging systems. In particular, we also address the challenge of directly predicting 2D slices of large-scale 3D simulations, paving the way for efficient inflow generation for solvers.
We extend the mirror construction of singular Calabi-Yau double covers, introduced by Hosono, Lee, Lian, and Yau, to a broader class of singular Calabi-Yau $(\mathbb{Z}/2)^k$-Galois covers, and prove Hodge number duality for both the original and extended mirror pairs. A main tool in our approach is an analogue of the Cayley trick, which relates the de Rham complex of the branched covers to the twisted de Rham complex of certain Landau-Ginzburg models. In particular, it reveals direct relations between the Hodge numbers of the covers and the irregular Hodge numbers of the associated Landau-Ginzburg models. This construction is independent of mirror symmetry and may be of independent interest.
Deep learning offers promising capabilities for the statistical downscaling of climate and weather forecasts, with generative approaches showing particular success in capturing fine-scale precipitation patterns. However, most existing models are region-specific, and their ability to generalize to unseen geographic areas remains largely unexplored. In this study, we evaluate the generalization performance of generative downscaling models across diverse regions. Using a global framework, we employ ERA5 reanalysis data as predictors and IMERG precipitation estimates at $0.1^\circ$ resolution as targets. A hierarchical location-based data split enables a systematic assessment of model performance across 15 regions around the world.