Machine learning techniques applied to biomolecular data prediction
DOI:
https://doi.org/10.33448/rsd-v15i3.50648Keywords:
Machine learning, Deep learning, AI.Abstract
The objective of this study was to analyze the application of machine learning (ML) techniques in the evaluation of biomolecular prediction data in the genetic field, comparing different approaches regarding accuracy, efficiency, and applicability in the analysis of biomolecular data, as well as identifying and discussing the main limitations and advances of these techniques. In the era of Industry 4.0, there is a significant growth in the volume of digital data, including health information, social media, and the Internet of Things, whose analysis demands the use of artificial intelligence (AI), especially ML and deep learning (DL). AI enables computational systems to perform tasks traditionally associated with human intelligence, being widely applied to the analysis of big data in healthcare. Among its advantages are flexibility, scalability, and the ability to integrate heterogeneous data, such as demographic information, laboratory tests, images, and clinical texts, contributing to diagnosis, prognosis, and risk prediction. ML has wide application in molecular biology and drug discovery, while DL, based on deep neural networks, proves particularly efficient in analyzing large volumes of data, such as in cancer diagnosis and the identification of therapeutic targets. However, challenges persist, including the need for robust computational infrastructure, limited interpretability of models, high computational costs, and ethical issues related to data privacy and security.
References
Crampon, K., Giorkallos, A., Deldossi, M., Baud, S., & Steffenel, L. A. (2022). Machine-learning methods for ligand–protein molecular docking. Drug Discovery Today, 27(1), 151–164. https://doi.org/10.1016/j.drudis.2021.09.007
Crossetti, M. G. O. (2012). Revisão integrativa de pesquisa na enfermagem o rigor cientifico que lhe é exigido. Rev Gaúcha Enferm. 33(2):8-9.
Dias, M. F. R., et al. (2021). Application of PolyPRep tools on HIV protease polyproteins using molecular docking. Brazilian Journal of Biology, 84, e245592. https://doi.org/10.1590/1519-6984.245592
Elgart, M., Redline, S., & Sofer, T. (2021). Machine and deep learning in molecular and genetic aspects of sleep research. Neurotherapeutics, 18(1), 228–243. https://doi.org/10.1007/s13311-021-01014-9
Gupta, R., Srivastava, D., Sahu, M., Tiwari, S., Ambasta, R. K., & Kumar, P. (2021). Artificial intelligence to deep learning: Machine intelligence approach for drug discovery. Molecular Diversity, 25, 1315–1360. https://doi.org/10.1007/s11030-021-10217-3
Hudson, I. L. (2021). Data integration using advances in machine learning in drug discovery and molecular biology. In Artificial Neural Networks (pp. 167–184). Springer. https://doi.org/10.1007/978-1-0716-0826-5_7
Issa, N. T., et al. (2021). Machine and deep learning approaches for cancer drug repurposing. In Seminars in Cancer Biology (pp. 132–142). Academic Press. https://doi.org/10.1016/j.semcancer.2019.12.011
Ngiam, K. Y., & Khor, I. W. (2019). Big data and machine learning algorithms for health-care delivery. The Lancet Oncology, 20(5), 262–273. : https://doi.org/10.1016/S1470-2045(19)30149-4
Pereira, A. S. et al. (2018). Metodologia da pesquisa científica. Editora da UFSM. https://repositorio.ufsm.br/bitstream/handle/1/15824/Lic_Computacao_Metodologia-Pesquisa-Cientifica.pdf?sequence=1.
Risemberg, R. I. C. et al. (2026). A importância da metodologia científica no desenvolvimento de artigos científicos. E-Acadêmica. 7(1), e0171675. https://eacademica.org/eacademica/article/view/675.
Snyder, H. (2019). Literature review as a research methodology: An overview and guidelines. Journal of Business Research. 104, 333-9. https://doi.org/10.1016/j.jbusres.2019.07.039.
Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN Computer Science, 2(3), 1–21. https://doi.org/10.1007/s42979-021-00592-x
Theodosiou, A. A., & Read, R. C. (2023). Artificial intelligence, machine learning and deep learning: Potential resources for the infection clinician. Journal of Infection, 87(4). https://www.sciencedirect.com/science/article/pii/S0163445323003791
Tran, K. A., et al. (2021). Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Medicine, 13, Article 1–17. https://doi.org/10.1186/s13073-021-00968-x
Wessels, H. H., Stirn, A., Méndez-Mancilla, A., Kim, E. J., Hart, S. K., Knowles, D. A., & Sanjana, N. E. (2024). Prediction of on-target and off-target activity of CRISPR–Cas13d guide RNAs using deep learning. Nature Biotechnology, 42(4), 628–637. https://doi.org/10.1038/s41587-023-01830-8
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Yasmin Vitória da Silva Pedroso, Jéssica Karoline da Cunha, Joyce Nascimento Santos, Paulo Bandiera Paiva

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
1) Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
2) Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
3) Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
