[Week 1] Research Project Journey - 6 Degree of Freedom Object Pose Estimation

Ty Nguyen
2 min readDec 10, 2020

After working on various projects during my Ph.D., I decided to write down my experience on a new project that I’ve just started for a few weeks and share it publically.

Vision-based car rotation and translation estimation pipeline. Courtesy of Wu et al.[3]

A brief introduction about myself: I’m a 5th-year Ph.D. student in Computer Science at Penn. Throughout my Ph.D., I’ve worked on different machine perception problems, from tomography estimation, depth estimation to object detection. I’ve recently published a paper on multi-agent localization using pair-wise measurement. Describing myself as an applied science researcher, I enjoyed working on different projects that aim to equip robots and intelligent agents with the capability to sense the world using cameras and LiDARs. This time, my goal is to estimate 6Dof of an object pose using monocular RGB images.

This week, I’ve spent time reading a couple of papers including:

  1. He, Yisheng, et al. “PVN3D: A deep point-wise 3D keypoints voting network for 6DoF pose estimation.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.
  2. Li, Zhigang, et al. “Robust RGB-based 6-DoF Pose Estimation without Real Pose Annotations.” arXiv preprint arXiv:2008.08391 (2020).
  3. Wu, Di, et al. “6d-vnet: End-to-end 6-dof vehicle pose estimation from monocular rgb images.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2019.

I found that Wu et.all[3] look closest to what I want to solve. Therefore, I’ve read it in more details and started cloning their codebase to experiment with. I’ll update on the experiment later.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Ty Nguyen
Ty Nguyen

Written by Ty Nguyen

A Ph.D. in CS at the University of Pennsylvania. I have been working on deep learning for robotics machine perception

No responses yet

Write a response