Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/92198
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2022-03-24T13:41:05Z-
dc.date.available2022-03-24T13:41:05Z-
dc.date.issued2021-
dc.identifier.citationGrech, F. (2021). Using evolutionary algorithms for DFA learning (Bachelor’s dissertation).en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/92198-
dc.descriptionB.Sc. IT (Hons)(Melit.)en_GB
dc.description.abstractGrammatical inference is the task of learning a formal grammar or class description from a set of training examples. In this research, we tackle the task of finding the minimum state Deterministic Finite-state Automaton (DFA) that is consistent with the training data, which is an NP-Complete problem. The problem instances that we are concerned with consist of an unknown DFA whose alphabet is binary, similar to those used in the Abbadingo One competition. A dataset of strings is labelled by this DFA and given to the learner. The learner constructs a hypothesis (DFA) with this dataset, whose accuracy is calculated using another set of strings, called the testing set. A common and successful approach to solving this problem is by building a maximally specific hypothesis called an Augmented Prefix Tree Acceptor (APTA) that embeds the training data exactly. States in a DFA can be merged to reduce its size. From the APTA, many merges are possible, and heuristics such as the Abbadingo-winning Evidence-Driven State Merging (EDSM) algorithm can be used to score the merges. EDSM does this by comparing labels of states which are to be merged. The highest-scoring merge is performed, and this process repeats until no more merges are possible, resulting in the final DFA. In this project, we evolve these types of heuristics using a Genetic Program (GP). Running this GP multiple times results in multiple viable heuristics. The performance of these heuristics can be increased by forming an ensemble, which when faced with a problem instance, each heuristic in it generates a hypothesis. Then, these DFAs parse the testing set using one of many modes, such as a fair majority vote. In such case, each string is parsed by each DFA, and its label is determined by the majority’s verdict. In this project, we design and implement a GP with several features that ensure proper representation and diversity of heuristics. We also experiment with different types of GP runs that focus on evolving heuristics for other purposes. Several heuristics were evolved which in turn were used to form ensembles of different sizes. Furthermore, the several ensemble modes are also evaluated using a number of problem instances. Over 16-, 32-, 64- and 128-state problem instances, the evolved individual heuristics perform roughly as well as W-EDSM, a variation of EDSM. However, when forming an ensemble with these heuristics, we see a significant boost in performance, with some ensembles performing nearly three times better than W-EDSM. Furthermore, smaller ensembles of sizes 10 and 20 achieve 1.5 – 2 times the performance of W-EDSM in some cases while still keeping the execution time linear in the size of the ensemble.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectGenetic programming (Computer science)en_GB
dc.subjectSequential machine theoryen_GB
dc.subjectAlgorithmsen_GB
dc.subjectHeuristic algorithmsen_GB
dc.subjectEvolutionary computationen_GB
dc.titleUsing evolutionary algorithms for DFA learningen_GB
dc.typebachelorThesisen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.publisher.institutionUniversity of Maltaen_GB
dc.publisher.departmentFaculty of ICT. Department of Artificial Intelligenceen_GB
dc.description.reviewedN/Aen_GB
dc.contributor.creatorGrech, Francesco (2021)-
Appears in Collections:Dissertations - FacICT - 2021
Dissertations - FacICTAI - 2021

Files in This Item:
File Description SizeFormat 
21BITAI025.pdf
  Restricted Access
1.09 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.