搜索 — ResearchTracker

Dynamic 3D hand reconstruction from egocentric videos is essential for next-generation computing platforms such as AR/VR and AI glasses. Despite its importance, most prior works focus either on multi-view 3D hand reconstruction or on 4D human body reconstruction. Egocentric 4D hand reconstruction remains challenging due to fast head motion, rapid hand dynamics, severe occlusions, and inherent ambiguity from single-view observations. To address these challenges, we introduce Hand-4DGS, the first feed-forward framework for reconstructing dynamic 4D hands directly from egocentric videos, enabling both fast (~60 FPS) inference and strong generalization. Our approach incorporates a mesh-guided representation for structural priors and temporal convolutions to model dynamic motion. We evaluate our framework on two challenging egocentric datasets, H2O and ARCTIC, and demonstrate significant improvements over baselines. Our method benefits from the generalization capability of feed-forward networks and effective 2D image supervision through Gaussian splatting, without requiring expensive 3D hand pose ground-truth annotations.

Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation

arXiv2025-09-01作者：Lee Chae-Yeon, Nam Hyeon-Woo, Tae-Hyun Oh

3D hand pose estimation is a fundamental task in understanding human hands. However, accurately estimating 3D hand poses remains challenging due to the complex movement of hands, self-similarity, and frequent occlusions. In this work, we address two limitations: the inability of existing 3D hand pose estimation methods to estimate aleatoric (data) uncertainty, and the lack of uncertainty modeling that incorporates joint correlation knowledge, which has not been thoroughly investigated. To this end, we introduce aleatoric uncertainty modeling into the 3D hand pose estimation framework, aiming to achieve a better trade-off between modeling joint correlations and computational efficiency. We propose a novel parameterization that leverages a single linear layer to capture intrinsic correlations among hand joints. This is enabled by formulating the hand joint output space as a probabilistic distribution, allowing the linear layer to capture joint correlations. Our proposed parameterization is used as a task head layer, and can be applied as an add-on module on top of the existing models. Our experiments demonstrate that our parameterization for uncertainty modeling outperforms existing

搜索结果：Hand (New York, N.Y.)

Hand-4DGS: Feed-Forward 3D Gaussian Splatting for 4D Hand Reconstruction from Egocentric Videos

Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation

VGG Induced Deep Hand Sign Language Detection

HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition

Multi-view Hand Reconstruction with a Point-Embedded Transformer

Articulated Hand Pose Estimation Review

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

3D Object Reconstruction from Hand-Object Interactions

A Vision-Based Analysis of Congestion Pricing in New York City

Egocentric Hand Track and Object-based Human Action Recognition

New York Smells: A Large Multimodal Dataset for Olfaction

A System for General In-Hand Object Re-Orientation

V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map

York time in JT gravity

Learning Dexterous In-Hand Manipulation

Doctor Imitator: Hand-Radiography-based Bone Age Assessment by Imitating Scoring Methods

Man, these New York Times games are hard! A computational perspective

Hot Hands, Streaks and Coin-flips: Numerical Nonsense in the New York Times

Denotational Semantics of Gradual Typing using Synthetic Guarded Domain Theory (Extended Version)

Estimating the Number of Street Vendors in New York City: Ratio Estimation with Point Process Data