搜索 — ResearchTracker

The goal of Open-Vocabulary Compositional Zero-Shot Learning (OV-CZSL) is to recognize attribute-object compositions in the open-vocabulary setting, where compositions of both seen and unseen attributes and objects are evaluated. Recently, prompt tuning methods have demonstrated strong generalization capabilities in the closed setting, where only compositions of seen attributes and objects are evaluated, i.e., Compositional Zero-Shot Learning (CZSL). However, directly applying these methods to OV-CZSL may not be sufficient to generalize to unseen attributes, objects and their compositions, as it is limited to seen attributes and objects. Normally, when faced with unseen concepts, humans adopt analogies with seen concepts that have the similar semantics thereby inferring their meaning (e.g., "wet" and "damp", "shirt" and "jacket"). In this paper, we experimentally show that the distribution of semantically related attributes or objects tends to form consistent local structures in the embedding space. Based on the above structures, we propose Structure-aware Prompt Adaptation (SPA) method, which enables models to generalize from seen to unseen attributes and objects. Specifically, in

Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning

arXiv2025-10-06作者：Xiaomeng Fan, Yuchuan Mao, Zhi Gao

Open-vocabulary learning requires modeling the data distribution in open environments, which consists of both seen-class and unseen-class data. Existing methods estimate the distribution in open environments using seen-class data, where the absence of unseen classes makes the estimation error inherently unidentifiable. Intuitively, learning beyond the seen classes is crucial for distribution estimation to bound the estimation error. We theoretically demonstrate that the distribution can be effectively estimated by generating unseen-class data, through which the estimation error is upper-bounded. Building on this theoretical insight, we propose a novel open-vocabulary learning method, which generates unseen-class data for estimating the distribution in open environments. The method consists of a class-domain-wise data generation pipeline and a distribution alignment algorithm. The data generation pipeline generates unseen-class data under the guidance of a hierarchical semantic tree and domain information inferred from the seen-class data, facilitating accurate distribution estimation. With the generated data, the distribution alignment algorithm estimates and maximizes the posterio

搜索结果：seen

Structure-aware Prompt Adaptation from Seen to Unseen for Open-Vocabulary Compositional Zero-Shot Learning

Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning

Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing

Seen-to-Scene: Keep the Seen, Generate the Unseen for Video Outpainting

Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning

The Impact of Model Scaling on Seen and Unseen Language Performance

BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

The time-domain gamma-ray sky seen by the Fermi-LAT

Estimating See and Be Seen Performance with an Airborne Visual Acquisition Model

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

Bias-Awareness for Zero-Shot Learning the Seen and Unseen

Distribution of angles to lattice points seen from a fast moving observer

Towards Zero-Shot Learning with Fewer Seen Class Examples

SEEN: Sharpening Explanations for Graph Neural Networks using Explanations from Neighborhoods

The structure and dynamics of a bright point as seen with Hinode, SoHO and TRACE

The $X(3960)$ seen in $D_{s}^{+} D_{s}^{-}$ as the $X(3930)$ state seen in $ D^{+} D^{-} $

Wobbly discs -- corrugations seen in the dust lanes of edge-on galaxies

Generic decoding of seen and imagined objects using hierarchical visual features

Convergence of the Environment Seen from Geodesics in Exponential Last-Passage Percolation

The Explosion in Orion-KL as Seen by Mosaicking the Magnetic Field with ALMA