Face2Text : collecting an annotated image description corpus for the generation of rich face descriptions

Gatt, Albert; Tanti, Marc; Muscat, Adrian; Paggio, Patrizia Pagg; Farrugia, Reuben A.; Borg, Claudia; Camilleri, Kenneth P.; Rosner, Michael; van der Plas, Lonneke

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/85819

Full metadata record

DC Field	Value	Language
dc.contributor.author	Gatt, Albert	-
dc.contributor.author	Tanti, Marc	-
dc.contributor.author	Muscat, Adrian	-
dc.contributor.author	Paggio, Patrizia Pagg	-
dc.contributor.author	Farrugia, Reuben A.	-
dc.contributor.author	Borg, Claudia	-
dc.contributor.author	Camilleri, Kenneth P.	-
dc.contributor.author	Rosner, Michael	-
dc.contributor.author	van der Plas, Lonneke	-
dc.date.accessioned	2021-12-20T10:56:27Z	-
dc.date.available	2021-12-20T10:56:27Z	-
dc.date.issued	2018	-
dc.identifier.citation	Gatt, A., Tanti, M., Muscat, A., Paggio, P., Farrugia, R. A., Borg, C., ... & Van der Plas, L. (2018). Face2text: Collecting an annotated image description corpus for the generation of rich face descriptions. International Conference on Language Resources and Evaluation, Miyazaki. 3323-3328.	en_GB
dc.identifier.uri	https://www.um.edu.mt/library/oar/handle/123456789/85819	-
dc.description.abstract	The past few years have witnessed renewed interest in NLP tasks at the interface between vision and language. One intensively-studied problem is that of automatically generating text from images. In this paper, we extend this problem to the more specific domain of face description. Unlike scene descriptions, face descriptions are more fine-grained and rely on attributes extracted from the image, rather than objects and relations. Given that no data exists for this task, we present an ongoing crowdsourcing study to collect a corpus of descriptions of face images taken ‘in the wild’. To gain a better understanding of the variation we find in face description and the possible issues that this may raise, we also conducted an annotation study on a subset of the corpus. Primarily, we found descriptions to refer to a mixture of attributes, not only physical, but also emotional and inferential, which is bound to create further challenges for current image-to-text methods.	en_GB
dc.language.iso	en	en_GB
dc.publisher	LREC	en_GB
dc.rights	info:eu-repo/semantics/restrictedAccess	en_GB
dc.subject	Face perception	en_GB
dc.subject	Natural language generation (Computer science)	en_GB
dc.subject	Crowdsourcing	en_GB
dc.title	Face2Text : collecting an annotated image description corpus for the generation of rich face descriptions	en_GB
dc.type	conferenceObject	en_GB
dc.rights.holder	The copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.	en_GB
dc.bibliographicCitation.conferencename	International Conference on Language Resources and Evaluation	en_GB
dc.bibliographicCitation.conferenceplace	Miyazaki, Japan, May 2018	en_GB
dc.description.reviewed	peer-reviewed	en_GB
Appears in Collections:	Scholarly Works - FacICTCCE

Files in This Item:

File	Description	Size	Format
L18-1525.pdf Restricted Access		316.84 kB	Adobe PDF	View/Open Request a copy

Show simple item record Statistics