Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/117595
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGeorgakilas, Georgios K.-
dc.contributor.authorGrioni, Andrea-
dc.contributor.authorLiakos, Konstantinos G.-
dc.contributor.authorChalupova, Eliska-
dc.contributor.authorPlessas, Fotis C.-
dc.contributor.authorAlexiou, Panagiotis-
dc.date.accessioned2024-01-18T15:28:09Z-
dc.date.available2024-01-18T15:28:09Z-
dc.date.issued2020-
dc.identifier.citationGeorgakilas, G. K., Grioni, A., Liakos, K. G., Chalupova, E., Plessas, F. C., & Alexiou, P. (2020). Multi-branch convolutional neural network for identification of small non-coding RNA genomic loci. Scientific reports, 10(1), 9486.en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/117595-
dc.description.abstractGenomic regions that encode small RNA genes exhibit characteristic patterns in their sequence, secondary structure, and evolutionary conservation. Convolutional Neural Networks are a family of algorithms that can classify data based on learned patterns. Here we present MuStARD an application of Convolutional Neural Networks that can learn patterns associated with user-defined sets of genomic regions, and scan large genomic areas for novel regions exhibiting similar characteristics. We demonstrate that MuStARD is a generic method that can be trained on different classes of human small RNA genomic loci, without need for domain specific knowledge, due to the automated feature and background selection processes built into the model. We also demonstrate the ability of MuStARD for inter-species identification of functional elements by predicting mouse small RNAs (pre-miRNAs and snoRNAs) using models trained on the human genome. MuStARD can be used to filter small RNA-Seq datasets for identification of novel small RNA loci, intra- and inter- species, as demonstrated in three use cases of human, mouse, and fly pre-miRNA prediction. MuStARD is easy to deploy and extend to a variety of genomic classification questions. Code and trained models are freely available at gitlab.com/RBP_Bioinformatics/mustard.en_GB
dc.language.isoenen_GB
dc.publisherNature Publishing Group UKen_GB
dc.rightsinfo:eu-repo/semantics/openAccessen_GB
dc.subjectComputational biologyen_GB
dc.subjectBioinformaticsen_GB
dc.subjectMachine learningen_GB
dc.titleMulti-branch convolutional neural network for identification of small non-coding RNA genomic locien_GB
dc.typearticleen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holderen_GB
dc.description.reviewedpeer-revieweden_GB
dc.identifier.doi10.1038/s41598-020-66454-3-
dc.publication.titleScientific reportsen_GB
Appears in Collections:Scholarly Works - FacHScABS



Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.