Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception

In this thesis, we study the problem of outlier pruning for robust estimation. Robust estimation is the workhorse for many perception problems, from object pose estimation to robot localization and mapping. In these problems, the robot has to estimate quantities of interest in the face of outliers....

Full description

Bibliographic Details
Main Author: Shi, Jingnan
Other Authors: Carlone, Luca
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139117
_version_ 1826205238978150400
author Shi, Jingnan
author2 Carlone, Luca
author_facet Carlone, Luca
Shi, Jingnan
author_sort Shi, Jingnan
collection MIT
description In this thesis, we study the problem of outlier pruning for robust estimation. Robust estimation is the workhorse for many perception problems, from object pose estimation to robot localization and mapping. In these problems, the robot has to estimate quantities of interest in the face of outliers. Such outliers can be the result of incorrect data association, and it is not unusual to have problems where more than 90% of the input measurements are outliers. Our first contribution is ROBIN (Reject Outliers Based on INvariants), a graphtheoretic approach that employs invariance to find mutually compatible measurements and prune outliers. ROBIN captures the mutual compatibility information by modeling measurements as vertices and mutual compatibility as edges in a compatibility graph. We generalize existing results showing that the inliers form a clique in this graph and typically belong to the maximum clique. We also provide a general definition of invariance for noisy measurements. We test ROBIN in various instance-level perception problems such as single rotation averaging and 3D point cloud registration. ROBIN boosts robustness of existing solvers (making them robust to more than 95% outliers), while running in milliseconds in large problems. With ROBIN developed, we then consider a category-level perception problem, where one is given 3D sensor data picturing an object of a given category (e.g., a car), and has to reconstruct the pose and shape of the object despite intra-class variability (i.e., different car models have different shapes). To solve this problem, we develop the first certifiably optimal solver for pose and shape estimation. We demonstrate that ROBIN can also be applied in this scenario, using compatibility checks based on convex hulls. We evaluate our approach through extensive experiments on both simulated and real datasets (PASCAL3D+ and ApolloScape), demonstrating that the resulting approach improves over the state of the art.
first_indexed 2024-09-23T13:09:52Z
format Thesis
id mit-1721.1/139117
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:09:52Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1391172022-01-15T03:28:40Z Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception Shi, Jingnan Carlone, Luca Massachusetts Institute of Technology. Department of Aeronautics and Astronautics In this thesis, we study the problem of outlier pruning for robust estimation. Robust estimation is the workhorse for many perception problems, from object pose estimation to robot localization and mapping. In these problems, the robot has to estimate quantities of interest in the face of outliers. Such outliers can be the result of incorrect data association, and it is not unusual to have problems where more than 90% of the input measurements are outliers. Our first contribution is ROBIN (Reject Outliers Based on INvariants), a graphtheoretic approach that employs invariance to find mutually compatible measurements and prune outliers. ROBIN captures the mutual compatibility information by modeling measurements as vertices and mutual compatibility as edges in a compatibility graph. We generalize existing results showing that the inliers form a clique in this graph and typically belong to the maximum clique. We also provide a general definition of invariance for noisy measurements. We test ROBIN in various instance-level perception problems such as single rotation averaging and 3D point cloud registration. ROBIN boosts robustness of existing solvers (making them robust to more than 95% outliers), while running in milliseconds in large problems. With ROBIN developed, we then consider a category-level perception problem, where one is given 3D sensor data picturing an object of a given category (e.g., a car), and has to reconstruct the pose and shape of the object despite intra-class variability (i.e., different car models have different shapes). To solve this problem, we develop the first certifiably optimal solver for pose and shape estimation. We demonstrate that ROBIN can also be applied in this scenario, using compatibility checks based on convex hulls. We evaluate our approach through extensive experiments on both simulated and real datasets (PASCAL3D+ and ApolloScape), demonstrating that the resulting approach improves over the state of the art. S.M. 2022-01-14T14:50:58Z 2022-01-14T14:50:58Z 2021-06 2021-06-16T13:27:08.153Z Thesis https://hdl.handle.net/1721.1/139117 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Shi, Jingnan
Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception
title Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception
title_full Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception
title_fullStr Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception
title_full_unstemmed Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception
title_short Graph-Theoretic Outlier Rejection: From Instance to Category-Level Perception
title_sort graph theoretic outlier rejection from instance to category level perception
url https://hdl.handle.net/1721.1/139117
work_keys_str_mv AT shijingnan graphtheoreticoutlierrejectionfrominstancetocategorylevelperception