Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/92176
Title: Architectural layout for a text classifier for electronic mail
Authors: Muscat, Andre (2000)
Keywords: Electronic mail systems
Computer software -- Development
Machine learning
Issue Date: 2000
Citation: Muscat, A. (2000). Architectural layout for a text classifier for electronic mail (Bachelor's dissertation).
Abstract: This document is summary of the investigative work done into the various components, which make up a text classification system for e-mail documents. The final intent of this research is the creation an application (TextCat) that uses machine learning techniques to classify an e-mail document into predefined semantic classes. The resulting software program performs text classification, based on distributional approximation of feature word terms, extracted from a very large corpus of labelled texts. It enables document categorisations based on a full-text of articles. Using probability theory as a formal foundation, a Machine Learning based method was developed to allow electronic mail document collections to be automatically organized at a topical level. The classification decisions are made by considering only the presence (or absence) of a small number of features (words) in each document. Various feature selection methods were investigated.
Description: B.Sc. IT (Hons)(Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/92176
Appears in Collections:Dissertations - FacICT - 1999-2009
Dissertations - FacICTCS - 1999-2007

Files in This Item:
File Description SizeFormat 
B.SC.(HONS)IT_Muscat_Andre_2000.PDF
  Restricted Access
43.35 MBAdobe PDFView/Open Request a copy
Muscat_Andre_acc.material.pdf
  Restricted Access
215.46 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.