The USC Andrew and Erna Viterbi School of Engineering USC Signal and Image Processing Institute USC Ming Hsieh Department of Electrical and Computer Engineering University of Southern California

Technical Report USC-SIPI-459

“Green Learning for 3D Point Cloud Data Processing”

by Pranav Kadam

May 2023

3D Point Cloud processing and analysis has attracted a lot of attention in present times due to the numerous applications such as in autonomous driving, computer graphics, and robotics. In this dissertation, we focus on the problems of point cloud registration, pose estimation, rotation invariant classification, odometry and scene flow estimation. These tasks are important in the realization of a 3D vision system. Rigid registration aims at finding a 3D transformation consisting of rotation and translation that optimally aligns two point clouds. The next two tasks focus on object-level analysis. For pose estimation, we predict the 6-DOF pose of an object with respect to a chosen frame of reference. Rotation invariant classification aims at classifying 3D objects which are arbitrarily rotated. The latter two problems are for outdoor environments. In odometry, we want to estimate the incremental motion of an object using the point cloud scans captured by it at every instance. While the scene flow estimation task aims at determining the point-wise flow between two consecutive point clouds. 3D perception using point clouds is dominated by deep learning methods nowadays. However, large scale learning on point clouds with deep learning techniques has several issues which are often overlooked. This research is based on the green learning (GL) paradigm and focuses on interpretability, smaller training times and smaller model size. Using GL, we separate the feature learning process from the decision. Features are derived in an unsupervised feedforward manner from the statistics of the training data. For the decision part, we mainly use well established model-free techniques which are optimized during inference. When the decision process involves classification, a lightweight classifier is trained. Overall, the proposed methods can be trained within an hour on CPUs and the number of model parameters are much fewer than deep learning methods. These advantages are promising keeping in mind applications that demand low power and complexity, such as in edge computing.

To download the report in PDF format click here: USC-SIPI-459.pdf (15.7Mb)