Van Nguyen NGUYEN

I recently completed my PhD at IMAGINE team, École des Ponts ParisTech, advised by Prof. Vincent Lepetit. During my PhD, I also worked closely with Dr. Mathieu Salzmann and Dr. Thibault Groueix.

I was fortunate to spend time with great teams at Meta Reality Labs, worked with Tomas Hodan in 2024, and Pierre Moulon in 2022. I also actively participated in co-organizing BOP challenge 2024 and 9th R6D Workshop.

I graduated from the Master Mathématiques, Vision and Apprentissage (MVA) from École Normale Supérieure Paris-Saclay (with highest honours) and Engineering Program in Applied Mathematics at INSA Toulouse. I also had chance to spend time at EPFL, SIEMENS.

I am currently on the job market and actively looking opportunities starting from March 2025.

Email CV Github Scholar Linkedin

News

12/2024 PhD defended!
06/2024 BOP challenge 2024 has been opened! Check out the updates in CNOS and GigaPose.
06/2024 I am happy to join Meta Reality Labs as a research intern, working with Dr. Tomas Hodan and Dr. Christian Forster.
04/2024 I visited Prof. Ko Nishino at Kyoto University, Japan for one month!
04/2024 Accepted to CVPR 2024 Doctoral Consortium!
04/2024 BOP challenge 2023 report is out and accepted to CVPRW 2024.
02/2024 Our papers OSV5M, GigaPose, and NOPE are accepted to CVPR 2024.
10/2023 Our work CNOS is accepted to ICCV 2023 R6D. CNOS has been awarded as the best method for 2D detection method of unseen objects at BOP challenge.
08/2023 BOP challenge 2023 has been opened!

PhD thesis


Publications

BOP Challenge 2024 on Model-free, Model-based Detection, and Pose Estimation of Unseen Rigid Objects

Van Nguyen Nguyen, Stephen Tyree, Andrew Guo, Médéric Fourmy, Anas Gouda, Taeyeop Lee, Sungphill Moon, Hyeontae Son, Lukas Ranftl, Jonathan Tremblay, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Stan Birchfield, Jiri Matas, Yann Labbé, Martin Sundermeyer, Tomáš Hodaň

We introduce a new model-free variant of all tasks, define a new 6D object detection task, and introduce three new publicly available datasets

BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

Tomas Hodan, Martin Sundermeyer, Yann Labbé, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas
CVPRW 2024

The report of BOP challenge 2023 on state-of-the-art methods for seen and unseen object pose estimation.

GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence

CVPR 2024

A "hybrid" template-patch correspondence approach that is fast, robust, and more accurate to estimate 6D pose of novel objects in RGB images. GigaPose predicts 6D object pose from a single 2D-to-2D correspondence.

NOPE: Novel Object Pose Estimation from a Single Image

CVPR 2024

A method that can estimate relative pose of unseen objects given only a single reference image. Our method also predicts 3D pose distribution which can be used to address pose ambiguities due to symmetries.

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

Guillaume Astruc, Nicolas Dufour, Ioannis Siglidis, Constantin Aronssohn, Nacim Bouia, Stephanie Fu, Romain Loiseau, Van Nguyen Nguyen, Charles Raude, Elliot Vincent, Lintao Xu, Hongyu Zhou, Loic Landrieu
CVPR 2024

A new benchmark for visual geolocation (~Geoguessr).

CNOS: A Strong Baseline for CAD-based Novel Object Segmentation

ICCVW 2023

A method that can segment novel objects for a given RGB image from only their CAD models. Based on Segmenting Anything, DINOv2, CNOS is a strong baseline for Task 5 and 6 in the BOP challenge 2023.

PIZZA: A Powerful Image-only Zero-Shot Zero-CAD Approach to 6DoF Tracking

3DV Oral

A method for tracking the 6D motion of objects in RGB video sequences when neither training images nor even the 3D geometry of the objects is available.

Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions

CVPR 2022

A method that can recognize objects and estimate their 3D pose in color images even under partial occlusions. Our method requires neither a training phase on these objects nor real images depicting them, only their CAD models.

Academic services

Reviewing: CVPR (2024 2023 2022), ICCV (2023), ECCV (2024, 2022), IROS (2024), 3DV (2024), TPAMI (2022), BMVC (2021).

Teaching: Teaching: Image Processing and Artificial Vision (M1) at Ecole des Ponts ParisTech in 2021 & 2022