Pose Estimation of Transparent Objects via Depth Completion and Confidence-Guided Registration

Anonymous Authors
Dataset overview with registered objects.

Depth completion and confidence-guided registration improve pose estimation for transparent objects under diverse fill-state and material conditions.

Abstract

we propose VIUGIC, a Vision- and Uncertainty-Guided ICP framework that integrates shape recovery from large-scale depth estimation models with a confidence-aware registration stage.

Transparent objects pose persistent challenges for robotic perception due to refraction, specular reflection, and unreliable depth sensing, all of which degrade geometry reconstruction and pose estimation. Existing datasets seldom account for the transparency variations caused by fill state or mixed materials, limiting generalization to the diverse appearances found in real environments.

To address this, we introduce the HCTD dataset, a controlled transparent-object dataset that systematically varies transparency through material and fill-state conditions. Building on this resource, we propose VIUGIC, a Vision- and Uncertainty-Guided ICP framework that integrates shape recovery from large-scale depth estimation models with a confidence-aware registration stage. VIUGIC leverages dense geometry and per-point uncertainty provided by these models, incorporating the uncertainty into a weighted ICP objective that down-weights unreliable predictions near refractive boundaries and missing-depth regions. Experiments across multiple objects and transparency levels demonstrate that combining controlled data with uncertainty-guided registration substantially improves the stability and accuracy of transparent-object pose estimation.

HCTD Dataset

Figures extracted from the paper show the capture setup, object categories, fill-rate distribution, and annotation workflow.

Transparent-object capture setup.
Capture Setup
Transparent object dataset overview with masks.
Dataset Overview
Transparent object model categories.
Object Models
Data counts by object category and fill rate.
Fill-rate Counts
Blender annotation add-on from the paper.
Annotation Tool
Input RGB sample from the paper.
RGB Sample

Method Overview

VIUGIC first recovers dense object geometry from RGB observations using a large-scale depth estimation model. The recovered geometry is then registered to object models through a confidence-weighted ICP objective that reduces the influence of unreliable depth near transparent boundaries and missing regions.

Overview of the VIUGIC depth estimation, merge, confidence, and robust ICP pipeline.
1

RGB Observation

Capture transparent objects under controlled material and fill-state variations.

2

Depth Completion

Recover dense geometry and per-point uncertainty from a large-scale depth model.

3

Weighted ICP

Register object models while down-weighting uncertain refractive and missing-depth regions.

Registration Results

Ablation results chart extracted from the paper.

Printed Objects

MethodOverall
Ours46.58
w/o Confidence ICP48.78
Robust-ICP49.62
ICP52.37

Scanned Objects

MethodOverall
Ours10.46
w/o Confidence ICP12.02
Robust-ICP13.02
ICP14.28

BibTeX

@article{anonymous2026viugic,
  title  = {Pose Estimation of Transparent Objects via Depth Completion and Confidence-Guided Registration},
  author = {Anonymous Authors},
  year   = {2026}
}