Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/92176
Title: | Architectural layout for a text classifier for electronic mail |
Authors: | Muscat, Andre (2000) |
Keywords: | Electronic mail systems Computer software -- Development Machine learning |
Issue Date: | 2000 |
Citation: | Muscat, A. (2000). Architectural layout for a text classifier for electronic mail (Bachelor's dissertation). |
Abstract: | This document is summary of the investigative work done into the various components, which make up a text classification system for e-mail documents. The final intent of this research is the creation an application (TextCat) that uses machine learning techniques to classify an e-mail document into predefined semantic classes. The resulting software program performs text classification, based on distributional approximation of feature word terms, extracted from a very large corpus of labelled texts. It enables document categorisations based on a full-text of articles. Using probability theory as a formal foundation, a Machine Learning based method was developed to allow electronic mail document collections to be automatically organized at a topical level. The classification decisions are made by considering only the presence (or absence) of a small number of features (words) in each document. Various feature selection methods were investigated. |
Description: | B.Sc. IT (Hons)(Melit.) |
URI: | https://www.um.edu.mt/library/oar/handle/123456789/92176 |
Appears in Collections: | Dissertations - FacICT - 1999-2009 Dissertations - FacICTCS - 1999-2007 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
B.SC.(HONS)IT_Muscat_Andre_2000.PDF Restricted Access | 43.35 MB | Adobe PDF | View/Open Request a copy | |
Muscat_Andre_acc.material.pdf Restricted Access | 215.46 kB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.