Investigating the use of Bi-words for learning SPAM

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/92896

Title:	Investigating the use of Bi-words for learning SPAM
Authors:	Bartolo, Mark (2006)
Keywords:	Spam (Electronic mail) Internet -- Security measures Computer security
Issue Date:	2006
Citation:	Bartolo, M. (2006). Investigating the use of Bi-words for learning SPAM (Bachelor's dissertation).
Abstract:	The target behind the artifact was to develop an anti-spam mail-client that would be able to classify e-mails according to whether they are spam or not. This was achieved by implementing two anti-spam algorithms that separately classified the e-mails in the best way possible, in order to avoid false-positive and false-negative e-mails. The results from the two anti-spam algorithms were quite satisfactory, especially that achieved by the first algorithm, which based it's classification on the occurrences of word-pairs in each e-mail. This algorithm achieved a maximum rate of correct ham classification of 92.6% and a maximum rate of correct spam classification of 98.4%. The second algorithm, which used only the bi-words which had low entropy, performed less well than algorithm one, especially when the system was not trained enough. The algorithms were left for the user's choice, from the application's main-window, that has the basic functionalities of popular mail-clients. The final windows-application, thoroughly showed the aims of the artifact.
Description:	B.Sc. IT (Hons)(Melit.)
URI:	https://www.um.edu.mt/library/oar/handle/123456789/92896
Appears in Collections:	Dissertations - FacICT - 1999-2009 Dissertations - FacICTCS - 1999-2007

Files in This Item:

File	Description	Size	Format
B.SC.(HONS)IT_Bartolo_Mark_2006.pdf Restricted Access		9.85 MB	Adobe PDF	View/Open Request a copy

Show full item record Statistics