Machine learning techniques applied to biomolecular data prediction

Authors

DOI:

https://doi.org/10.33448/rsd-v15i3.50648

Keywords:

Machine learning, Deep learning, AI.

Abstract

The objective of this study was to analyze the application of machine learning (ML) techniques in the evaluation of biomolecular prediction data in the genetic field, comparing different approaches regarding accuracy, efficiency, and applicability in the analysis of biomolecular data, as well as identifying and discussing the main limitations and advances of these techniques. In the era of Industry 4.0, there is a significant growth in the volume of digital data, including health information, social media, and the Internet of Things, whose analysis demands the use of artificial intelligence (AI), especially ML and deep learning (DL). AI enables computational systems to perform tasks traditionally associated with human intelligence, being widely applied to the analysis of big data in healthcare. Among its advantages are flexibility, scalability, and the ability to integrate heterogeneous data, such as demographic information, laboratory tests, images, and clinical texts, contributing to diagnosis, prognosis, and risk prediction. ML has wide application in molecular biology and drug discovery, while DL, based on deep neural networks, proves particularly efficient in analyzing large volumes of data, such as in cancer diagnosis and the identification of therapeutic targets. However, challenges persist, including the need for robust computational infrastructure, limited interpretability of models, high computational costs, and ethical issues related to data privacy and security.

References

Crampon, K., Giorkallos, A., Deldossi, M., Baud, S., & Steffenel, L. A. (2022). Machine-learning methods for ligand–protein molecular docking. Drug Discovery Today, 27(1), 151–164. https://doi.org/10.1016/j.drudis.2021.09.007

Crossetti, M. G. O. (2012). Revisão integrativa de pesquisa na enfermagem o rigor cientifico que lhe é exigido. Rev Gaúcha Enferm. 33(2):8-9.

Dias, M. F. R., et al. (2021). Application of PolyPRep tools on HIV protease polyproteins using molecular docking. Brazilian Journal of Biology, 84, e245592. https://doi.org/10.1590/1519-6984.245592

Elgart, M., Redline, S., & Sofer, T. (2021). Machine and deep learning in molecular and genetic aspects of sleep research. Neurotherapeutics, 18(1), 228–243. https://doi.org/10.1007/s13311-021-01014-9

Gupta, R., Srivastava, D., Sahu, M., Tiwari, S., Ambasta, R. K., & Kumar, P. (2021). Artificial intelligence to deep learning: Machine intelligence approach for drug discovery. Molecular Diversity, 25, 1315–1360. https://doi.org/10.1007/s11030-021-10217-3

Hudson, I. L. (2021). Data integration using advances in machine learning in drug discovery and molecular biology. In Artificial Neural Networks (pp. 167–184). Springer. https://doi.org/10.1007/978-1-0716-0826-5_7

Issa, N. T., et al. (2021). Machine and deep learning approaches for cancer drug repurposing. In Seminars in Cancer Biology (pp. 132–142). Academic Press. https://doi.org/10.1016/j.semcancer.2019.12.011

Ngiam, K. Y., & Khor, I. W. (2019). Big data and machine learning algorithms for health-care delivery. The Lancet Oncology, 20(5), 262–273. : https://doi.org/10.1016/S1470-2045(19)30149-4

Pereira, A. S. et al. (2018). Metodologia da pesquisa científica. Editora da UFSM. https://repositorio.ufsm.br/bitstream/handle/1/15824/Lic_Computacao_Metodologia-Pesquisa-Cientifica.pdf?sequence=1.

Risemberg, R. I. C. et al. (2026). A importância da metodologia científica no desenvolvimento de artigos científicos. E-Acadêmica. 7(1), e0171675. https://eacademica.org/eacademica/article/view/675.

Snyder, H. (2019). Literature review as a research methodology: An overview and guidelines. Journal of Business Research. 104, 333-9. https://doi.org/10.1016/j.jbusres.2019.07.039.

Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN Computer Science, 2(3), 1–21. https://doi.org/10.1007/s42979-021-00592-x

Theodosiou, A. A., & Read, R. C. (2023). Artificial intelligence, machine learning and deep learning: Potential resources for the infection clinician. Journal of Infection, 87(4). https://www.sciencedirect.com/science/article/pii/S0163445323003791

Tran, K. A., et al. (2021). Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Medicine, 13, Article 1–17. https://doi.org/10.1186/s13073-021-00968-x

Wessels, H. H., Stirn, A., Méndez-Mancilla, A., Kim, E. J., Hart, S. K., Knowles, D. A., & Sanjana, N. E. (2024). Prediction of on-target and off-target activity of CRISPR–Cas13d guide RNAs using deep learning. Nature Biotechnology, 42(4), 628–637. https://doi.org/10.1038/s41587-023-01830-8

Published

2026-03-07

Issue

Section

Health Sciences

How to Cite

Machine learning techniques applied to biomolecular data prediction. Research, Society and Development, [S. l.], v. 15, n. 3, p. e2315350648, 2026. DOI: 10.33448/rsd-v15i3.50648. Disponível em: https://www.rsdjournal.org/rsd/article/view/50648. Acesso em: 15 mar. 2026.