搜索 — ResearchTracker

We propose NVS-HO, the first benchmark designed for novel view synthesis of handheld objects in real-world environments using only RGB inputs. Each object is recorded in two complementary RGB sequences: (1) a handheld sequence, where the object is manipulated in front of a static camera, and (2) a board sequence, where the object is fixed on a ChArUco board to provide accurate camera poses via marker detection. The goal of NVS-HO is to learn a NVS model that captures the full appearance of an object from (1), whereas (2) provides the ground-truth images used for evaluation. To establish baselines, we consider both a classical SfM pipeline and a state-of-the-art pre-trained feed-forward neural network (VGGT) as pose estimators, and train NVS models based on NeRF and Gaussian Splatting. Our experiments reveal significant performance gaps in current methods under unconstrained handheld conditions, highlighting the need for more robust approaches. NVS-HO thus offers a challenging real-world benchmark to drive progress in RGB-based novel view synthesis of handheld objects.

SelfHVD: Self-Supervised Handheld Video Deblurring

arXiv2025-08-12作者：Honglei Xu, Zhilu Zhang, Junjie Fan

Shooting video with handheld shooting devices often results in blurry frames due to shaking hands and other instability factors. Although previous video deblurring methods have achieved impressive progress, they still struggle to perform satisfactorily on real-world handheld video due to the blur domain gap between training and testing data. To address the issue, we propose a self-supervised method for handheld video deblurring, which is driven by sharp clues in the video. First, to train the deblurring model, we extract the sharp clues from the video and take them as misalignment labels of neighboring blurry frames. Second, to improve the deblurring ability of the model, we propose a novel Self-Enhanced Video Deblurring (SEVD) method to create higher-quality paired video data. Third, we propose a Self-Constrained Spatial Consistency Maintenance (SCSCM) method to regularize the model, preventing position shifts between the output and input frames. Moreover, we construct synthetic and real-world handheld video datasets for handheld video deblurring. Extensive experiments on these and other common real-world datasets demonstrate that our method significantly outperforms existing self

搜索结果：Handheld

NVS-HO: A Benchmark for Novel View Synthesis of Handheld Objects

SelfHVD: Self-Supervised Handheld Video Deblurring

Diffusion Autoencoder for Unsupervised Artifact Restoration in Handheld Fundus Images

Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices

IFNet: Deep Imaging and Focusing for Handheld SAR with Millimeter-wave Signals

WeeCare: Towards Handheld Bladder Fullness Sensing with a Conformable Pad

The Handheld and Hand Powered Homopolar Generator

A Fast and Accurate 3-D Reconstruction Algorithm for Near-Range Microwave Imaging with Handheld Synthetic Aperture Radar

GHAR: GeoPose-based Handheld Augmented Reality for Architectural Positioning, Manipulation and Visual Exploration

FoodTrack: Estimating Handheld Food Portions with Egocentric Video

Rydberg atom reception of a handheld UHF frequency-modulated two-way radio

Portable Biomechanics Laboratory: Clinically Accessible Movement Analysis from a Handheld Smartphone

LUMIA: A Handheld Vision-to-Music System for Real-Time, Embodied Composition

Towards Intuitive Drone Operation Using a Handheld Motion Controller

Generating Fit Check Videos with a Handheld Camera

MFSR-GAN: Multi-Frame Super-Resolution with Handheld Motion Modeling

Haptic Stylus vs. Handheld Controllers: A Comparative Study for Surface Visualization Interactions

Perceptually Equivalent Resolution in Handheld Devices for Streaming Bandwidth Saving

Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document Scanning

Handheld Haptic Device with Coupled Bidirectional Input