Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/16950
Title: | Amilcare : adaptive information extraction for document annotation |
Authors: | Ciravegna, Fabio Dingli, Alexiei Wilks, Yorick Petrelli, Daniela |
Keywords: | Knowledge management Corpora (Linguistics) Natural language processing (Computer science) Self-adaptive software Expert systems (Computer science) |
Issue Date: | 2002 |
Publisher: | The ACM Digital Library |
Citation: | Ciravegna, F., Dingli, A., Wilks, Y., & Petrelli, D. (2002). Amilcare: adaptive information extraction for document annotation. 25th ACM/SIGIR International Conference on Research and Development in Information Retrieval, Tampere. 367-368. |
Abstract: | The traditional process of document annotation for knowledge identification and extraction in Knowledge Management (KM) is complex and time consuming, as it requires manual annotation by domain experts. In the typical scenario a domain expert:(1) builds an ontology describing the application domain: (2) annotates a number of texts in order to identify instances of elements (i.e., concepts and relations) in the texts. There is a strong interest in Text Mining technologies (and in particular in Human Language- based Technologies), for reducing the burden of text annotation for KM [1]. In this paper we show how Information Extraction from texts (IE) can provide support for document enrichment and make the text annotation process more effective and efficient. The main challenge to be addressed by IE researchers in this framework is portability of IE systems to new applications with no knowledge of HLT. As a matter of fact, in terms of IE each annotation task (e.g. tagging texts about failures in cars in order to identify faults and involved car parts) requires to port the IE system to a new application domain. IE is just one of the many technologies required by complex KM environments: wide use of IE tools will come only when the definition of such application domain will not require any specific IE skill apart from notions of KM. Moreover there is the need to port across different text types without major recoding of system resources, as documents in KM can range from free texts (technical reports, newspaper-like texts) to structured marked up texts (e.g. highly structured XML/HTML documents) and even a mixture of them [2]. |
URI: | https://www.um.edu.mt/library/oar//handle/123456789/16950 |
ISSN: | 01635840 |
Appears in Collections: | Scholarly Works - FacICTAI |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Conference paper - Amilcare Adaptive information extraction for document annotation.pdf Restricted Access | Amilcare : adaptive information extraction for document annotation | 784.42 kB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.