[month] [year]

Shantanu Singh – Monocular RGB

Shantanu Singh received his Master of Science in Computer Science and Engineering (CSE). His research work was supervised by Prof. Madhava Krishna. Here’s a summary of his research work on Extended Indoor Layout Estimation using Monocular RGB for Efficient Path Planning and Navigation:

In this work, we propose IndoLayout, a novel real- time approach for generating high-quality occupancy maps from an RGB image for indoor scenes. Such occupancy maps are often crucial for path planning and mapping in indoor environments but are often built using only information contained in the ego view. In contrast, our approach also predicts occupancy values beyond immediately visible regions from just a monocular image, leveraging learnt priors from indoor scenes. Hence, our proposed network can produce a hallucinated, amodal scene layout that includes areas occluded in the RGB image, such as a navigable floor behind a desk. Specifically, we propose a novel architecture that uses self-attention and adversarial learning to vastly improve the quality of the predicted layout. We evaluate our model on several photorealistic indoor datasets and outperform previous relevant work on all metrics that measure layout quality, including newly adopted ones. Finally, we demonstrate the effectiveness of our method by showing significant improvements on the Point Goal navigation task over similar approaches using IndoLayout.

 

June 2023

  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •