Mining web sites using adaptive information extraction

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/16967

Title:	Mining web sites using adaptive information extraction
Authors:	Dingli, Alexiei Ciravegna, Fabio Guthrie, David Wilks, Yorick
Keywords:	Semantic Web Information retrieval -- Automation Data mining Self-adaptive software Web sites
Issue Date:	2003
Publisher:	Association for Computational Linguistics
Citation:	Dingli, A., Ciravegna, F., Guthrie, D., & Wilks, Y. (2003). Mining web sites using adaptive information extraction. 10th Conference on European Chapter of the Association for Computational Linguistics, Budapest. 75-78.
Abstract:	Adaptive Information Extraction systems (IES) are currently used by some Semantic Web (SW) annotation tools as support to annotation (Handschuh et al., 2002; Vargas-Vera et al., 2002). They are generally based on fully supervised methodologies requiring fairly intense domain-specific annotation. Unfortunately, selecting representative examples may be difficult and annotations can be incorrect and require time. In this paper we present a methodology that drastically reduce (or even remove) the amount of manual annotation required when annotating consistent sets of pages. A very limited number of user-defined examples are used to bootstrap learning. Simple, high precision (and possibly high recall) IE patterns are induced using such examples, these patterns will then discover more examples which will in turn discover more patterns, etc.
URI:	https://www.um.edu.mt/library/oar//handle/123456789/16967
Appears in Collections:	Scholarly Works - FacICTAI

Files in This Item:

File	Description	Size	Format
OA-Mining websites.pdf	Mining web sites using adaptive information extraction	191.42 kB	Adobe PDF	View/Open