Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/117420
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.date.accessioned | 2024-01-17T06:53:32Z | - |
dc.date.available | 2024-01-17T06:53:32Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Martinek, V., Cechak, D., Gresova, K., Alexiou, P., & Simecek, P. (2022). Fine-Tuning Transformers For Genomic Tasks. bioRxiv, 2022-02. | en_GB |
dc.identifier.uri | https://www.um.edu.mt/library/oar/handle/123456789/117420 | - |
dc.description.abstract | Transformers are a type of neural network architecture that has been successfully used to achieve state-of-the-art performance in numerous natural language processing tasks. However, what about DNA, the language life written in the four-letter alphabet? In this paper, we review the current state of Transformers usage in genomics and molecular biology in general, introduce a collection of benchmark datasets for the classification of genomic sequences, and compare the performance of several model architectures on those benchmarks, including a BERT-like model for DNA sequences DNABERT as implemented in HuggingFace (armheb/DNA_bert_6 model). In particular, we explore the effect of pre-training on a large DNA corpus vs training from scratch (with randomized weights). The results presented here can be used for identification of functional elements in human and other genomes. | en_GB |
dc.language.iso | en | en_GB |
dc.publisher | Cold Spring Harbor Laboratory | en_GB |
dc.rights | info:eu-repo/semantics/restrictedAccess | en_GB |
dc.subject | Data sets | en_GB |
dc.subject | Genomics -- Case studies | en_GB |
dc.subject | Deep learning (Machine learning) | en_GB |
dc.subject | Convolutions (Mathematics) | en_GB |
dc.subject | Neural networks (Computer science) | en_GB |
dc.title | Fine-tuning transformers for genomic tasks | en_GB |
dc.type | article | en_GB |
dc.rights.holder | The copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder | en_GB |
dc.description.reviewed | non peer-reviewed | en_GB |
dc.identifier.doi | 10.1101/2022.02.07.479412 | - |
dc.publication.title | bioRxiv | en_GB |
dc.contributor.creator | Martinek, Vlastimil | - |
dc.contributor.creator | Cechak, David | - |
dc.contributor.creator | Gresova, Katarina | - |
dc.contributor.creator | Alexiou, Panagiotis | - |
dc.contributor.creator | Simecek, Petr | - |
Appears in Collections: | Scholarly Works - FacHScABS |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Fine_tuning_transformers_for_genomic_tasks.pdf Restricted Access | 184.23 kB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.