搜索 — ResearchTracker

A key challenge in Visual Place Recognition (VPR) is matching query images against reference maps captured under diverse environmental conditions and viewpoints. While multiple reference traversals improve robustness, existing fusion strategies either aggregate references uniformly or rely on heuristic selection, without distinguishing descriptor variations that preserve stable place identity from those caused by changing conditions or viewpoints. In this paper, we propose DisPlace, a multi-reference VPR framework that fuses multiple reference descriptors into a single compact and discriminative place representation. DisPlace formulates descriptor fusion as a generalized eigenvalue problem that maximizes between-place separability while suppressing within-place variation across references, rather than preserving overall descriptor variance. Unlike existing multi-reference fusion methods, DisPlace exploits variation across reference traversals to identify which linear combinations of descriptor dimensions preserve place identity and which capture condition- or viewpoint-specific variation. We evaluate DisPlace on Oxford RobotCar, Nordland, Pittsburgh30k, and Google Landmarks v2 acro

Leveraging Symmetries in Pick and Place

arXiv2023-08-15作者：Haojie Huang, Dian Wang, Arsh Tangri

Robotic pick and place tasks are symmetric under translations and rotations of both the object to be picked and the desired place pose. For example, if the pick object is rotated or translated, then the optimal pick action should also rotate or translate. The same is true for the place pose; if the desired place pose changes, then the place action should also transform accordingly. A recently proposed pick and place framework known as Transporter Net captures some of these symmetries, but not all. This paper analytically studies the symmetries present in planar robotic pick and place and proposes a method of incorporating equivariant neural models into Transporter Net in a way that captures all symmetries. The new model, which we call Equivariant Transporter Net, is equivariant to both pick and place symmetries and can immediately generalize pick and place knowledge to different pick and place poses. We evaluate the new model empirically and show that it is much more sample efficient than the non-symmetric version, resulting in a system that can imitate demonstrated pick and place behavior using very few human demonstrations on a variety of imitation learning tasks.

搜索结果：Place

DisPlace: Discriminative Place Projections for Multi-Reference Visual Place Recognition

Leveraging Symmetries in Pick and Place

In-Place Test-Time Training

Pick and Place Planning is Better than Pick Planning then Place Planning

Going Places: Place Recognition in Artificial and Natural Systems

Place-it-R1: Unlocking Environment-aware Reasoning Potential of MLLM for Video Object Insertion

Improving Condition- and Environment-Invariant Place Recognition with Semantic Place Categorization

Place recognition: An Overview of Vision Perspective

Learning to Place New Objects

Branching Place Bisimilarity

Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge

Precise Pick-and-Place using Score-Based Diffusion Networks

Place Bisimilarity is Decidable, Indeed!

Place Deduplication with Embeddings

1$^{st}$ Place Solution of WWW 2025 EReL@MIR Workshop Multimodal CTR Prediction Challenge

Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

Unsupervised Place Discovery for Place-Specific Change Classifier

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation

Collective behavior of place and non-place neurons in the hippocampal network

3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW