Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/91632
Title: Using objects and scene attributes to estimate depth from single monocular images : relative depth estimation between two objects using semantic features
Authors: Cassar, Stefan (2021)
Keywords: Neural networks (Computer science)
Three-dimensional imaging
Computer vision
Issue Date: 2021
Citation: Cassar, S. (2021). Using objects and scene attributes to estimate depth from single monocular images : relative depth estimation between two objects using semantic features (Master’s dissertation).
Abstract: The depth of objects plays an integral part of several computer vision tasks. Using just a single image it is very difficult to obtain an accurate depth level without matching the scene with a Three-Dimensional (3D) representation of the image using any additional feature information of the scene. The state of the art systems in this area are complex Neural Network models trained on stereo image data to predict depth per pixel. Fortunately, in some tasks, only the relative depth between objects is required. This dissertation moves away from other approaches that make use of multiple images or scene reconstruction and explores the use of semantic information in the learning process of machine learning models to aid the learning and prediction of relative depth. The problem tackled in this dissertation is addressed using a classification approach where geometrical features based on the object bounding boxes, object labels and scene attributes are computed and extracted and used as inputs to pattern recognition machine learning models models to predict relative depth between the objects (i.e object is behind, in-front or objects are at equal depth). We evaluate the solution with the monodepth Convolutional Neural Network (CNN) trained on the Cityscapes dataset, and note a positive boost of 14% accuracy over the results reported by the monodepth depth prediction CNN. This result demonstrates that data driven depth estimation models are capable of producing results that are superior to other complex depth prediction mechanisms.
Description: M.Sc.(Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/91632
Appears in Collections:Dissertations - FacICT - 2021
Dissertations - FacICTAI - 2021

Files in This Item:
File Description SizeFormat 
21MAIPT006.pdf
  Restricted Access
3.17 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.