Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/18507
Title: | Predicting spatial relationships for image descriptions |
Authors: | Birmingham, Brandon |
Keywords: | Image processing Algorithms Decision trees |
Issue Date: | 2016 |
Abstract: | This dissertation studies and demonstrates how machine generated spatial relationships between image objects can improve the accuracy in retrieval-based image caption generation systems. To this end, a Random Forest Tree based spatial preposition predictive model is developed. This model outperforms current best preposition prediction accuracy rates. The main contribution of this dissertation lies on a proposed image description framework that casts the generation of image descriptions as a combination of generation-retrieval process. In contrast to all current retrieval-based methods, the suggested novel approach is designed to extract image descriptions from the endless multimedia content found on the Web. The system is evaluated by both human and computational methods. Results show that the proposed image description system is highly competitive to current state-of-art retrieval-based image description algorithms. Good results were achieved when describing images containing two image objects with object labels connected by spatial prepositions, while the web-retrieval based approach was notably effective when describing single-object images. |
Description: | M.SC.COMPUTER SCIENCE |
URI: | https://www.um.edu.mt/library/oar//handle/123456789/18507 |
Appears in Collections: | Dissertations - FacICT - 2016 Dissertations - FacICTCS - 2016 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
16MCSFT005.pdf Restricted Access | 4.14 MB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.