Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/85819
Title: Face2Text : collecting an annotated image description corpus for the generation of rich face descriptions
Authors: Gatt, Albert
Tanti, Marc
Muscat, Adrian
Paggio, Patrizia Pagg
Farrugia, Reuben A.
Borg, Claudia
Camilleri, Kenneth P.
Rosner, Michael
van der Plas, Lonneke
Keywords: Face perception
Natural language generation (Computer science)
Crowdsourcing
Issue Date: 2018
Publisher: LREC
Citation: Gatt, A., Tanti, M., Muscat, A., Paggio, P., Farrugia, R. A., Borg, C., ... & Van der Plas, L. (2018). Face2text: Collecting an annotated image description corpus for the generation of rich face descriptions. International Conference on Language Resources and Evaluation, Miyazaki. 3323-3328.
Abstract: The past few years have witnessed renewed interest in NLP tasks at the interface between vision and language. One intensively-studied problem is that of automatically generating text from images. In this paper, we extend this problem to the more specific domain of face description. Unlike scene descriptions, face descriptions are more fine-grained and rely on attributes extracted from the image, rather than objects and relations. Given that no data exists for this task, we present an ongoing crowdsourcing study to collect a corpus of descriptions of face images taken ‘in the wild’. To gain a better understanding of the variation we find in face description and the possible issues that this may raise, we also conducted an annotation study on a subset of the corpus. Primarily, we found descriptions to refer to a mixture of attributes, not only physical, but also emotional and inferential, which is bound to create further challenges for current image-to-text methods.
URI: https://www.um.edu.mt/library/oar/handle/123456789/85819
Appears in Collections:Scholarly Works - FacICTCCE

Files in This Item:
File Description SizeFormat 
L18-1525.pdf
  Restricted Access
316.84 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.