DFA learning using SAT solvers

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/107909

Full metadata record

DC Field	Value	Language
dc.date.accessioned	2023-03-28T16:16:09Z	-
dc.date.available	2023-03-28T16:16:09Z	-
dc.date.issued	2022	-
dc.identifier.citation	Formosa, L. (2022). DFA learning using SAT solvers (Bachelor's dissertation).	en_GB
dc.identifier.uri	https://www.um.edu.mt/library/oar/handle/123456789/107909	-
dc.description	B.Sc. IT (Hons)(Melit.)	en_GB
dc.description.abstract	Regular inference is the task of inferring a deterministic finite-state automaton (DFA) from a training set of positive and negative strings which, respectively, belong and do not belong to a regular language. Additionally, the regular inference task is usually formulated as finding the minimumstate DFA that is consistent with the training data; this problem is known to be NP-complete and is one of the more heavily studied areas in the broader field of grammatical inference [1]. One of the most successful approaches are so called state merging algorithms where a highly specific hypothesis called a prefix tree acceptor (PTA) is created from the training data and pairs of states are iteratively selected and merged to compact and generalise the hypothesis. Another interesting approach reduces this problem to Boolean satisfiability (SAT). DFA learning is first translated to graph colouring and then to SAT, allowing a SAT solver to infer a hypothesis from a training set of positive and negative strings. The number of clauses written for large problems prove to be too much for certain SAT solvers to handle, and therefore the APTA is first pre-processed using a pre-existing state merging algorithm such as EDSM to first obtain a partially identified DFA and reduce the size of the problem. In this FYP, we study the DFASAT algorithm and perform a comparative analysis with current state-of-the-art state-merging algorithms such as EDSM, windowed-EDSM, and blue-fringe. Different experiments of 512 problem instances of Abbadingo and StaMinA-style DFAs were set up. Results show that DFASAT outperforms other algorithms for 16-state problems with a binary alphabet and can infer the target DFA at a higher rate. DFASAT is also able to infer multiple DFAs with the same clauses through different truth assignments. It was also found to be very reliant on the pre-processing performed. We propose two new approaches for possible improvement were proposed. The first approach makes use of DFASAT’s ability to find multiple non-isomorphic DFAs and combine them into a single ensemble that accepts and rejects strings under a voting scheme. The second proposed approach identifies other algorithms which have been shown to work very well for sparse training data, and aim at produce high quality initial merges. These can also be extended with the previous approach.	en_GB
dc.language.iso	en	en_GB
dc.rights	info:eu-repo/semantics/restrictedAccess	en_GB
dc.subject	Sequential machine theory	en_GB
dc.subject	Heuristic algorithms	en_GB
dc.subject	Algebra, Boolean	en_GB
dc.title	DFA learning using SAT solvers	en_GB
dc.type	bachelorThesis	en_GB
dc.rights.holder	The copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.	en_GB
dc.publisher.institution	University of Malta	en_GB
dc.publisher.department	Faculty of Information and Communication Technology. Department of Artificial Intelligence	en_GB
dc.description.reviewed	N/A	en_GB
dc.contributor.creator	Formosa, Logan (2022)	-
Appears in Collections:	Dissertations - FacICT - 2022 Dissertations - FacICTAI - 2022

Files in This Item:

File	Description	Size	Format
2208ICTICT390905069315_1.PDF Restricted Access		1.14 MB	Adobe PDF	View/Open Request a copy

Show simple item record Statistics