Application of Artificial Intelligence in Biochemistry Research: A Review
DOI:
https://doi.org/10.54987/jobimb.v11i2.835Keywords:
Artificial Intelligence, Biochemistry Research, Data, Genomics, Protein foldingAbstract
The exploration of Biochemistry research has been expanded by artificial intelligence (AI) and its ability to analyse immense and intricate datasets in a way that would be unattainable by human effort alone. This review delves into the most recent examples of AI breakthroughs that had a transformative impact on key aspects of biochemistry. AI has now led to the creation and improvement of drug molecules and the capability to predict which new proteins could be targeted for repurposing with current drugs. When it comes to protein structures, algorithms such as AlphaFold have made great strides in resolving the protein folding problem that has been a challenge for so long. Reliably identifying proteins and metabolites from spectral data is now possible with deep learning models. Meanwhile, AI can classify sequences and spot gene expression patterns in massive genomics and transcriptomics datasets with ease. The remarkable capabilities of AI to automate the analysis of medical images and natural language descriptions of patient symptoms have a promising potential for transforming disease diagnosis and treatment. Nevertheless, obstacles such as data availability, interpretability of AI models, ethical considerations, and generalization must be tackled as these technologies evolve. The collaboration between AI and biochemistry appears to be optimistic, with biochemical data powering the development of more robust AI systems that can extract new knowledge from vast datasets beyond human reach. Thus, this mutually beneficial relationship has the potential to vastly expedite discovery across molecular biology.
References
Nelson DL, Cox MM. Lehninger Principles of Biochemistry 8th Edition. Macmillan Learning; 2021.
Khatri P, Sirota M, Butte AJ. Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol. 2012; 8(2):e1002375.
Libbrecht MW, Noble WS. Machine learning applications in genetics and genomics. Nat Rev Genet. 2015; 16(6):321-32.
Jordan MI, Mitchell TM. Machine learning: Trends, perspectives, and prospects. Sci. 2015; 349(6245):255-60.
Sarker IH. Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Comput Sci. 2021; 2(3).
Min S, Lee B, Yoon S. Deep learning in bioinformatics. Brief Bioinformatics. 2017; 18(5):851-69.
Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596(7873):583-9.
Matsuta Y, Ito M, Tohsato Y. ECOH: An Enzyme Commission number predictor using mutual information and a support vector machine. Bioinformatics. 2012; 29(3):365-72.
Baum ZJ, Yu X, Ayala PY, Zhao Y, Watkins SP, Zhou Q. Artificial intelligence in chemistry: current trends and future directions. J Chem Inf Model. 2021; 61(7):3197-212.
Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, et al. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface. 2018; 15(141).
AlQuraishi M. AlphaFold at CASP13. Bioinformatics. 2019; 35(22):4862-5.
Dasgupta A, Nath A. Classification of machine learning algorithms. Int J Innov Res Adv Eng. 2016; 3(3):6-11.
Liu Y, Esan OC, Pan Z, An L. Machine learning for advanced energy materials. Energ AI. 2021; 3:100049.
Ayodele TO. Types of machine learning algorithms. 2010; 3:19-48.
Han J. M. Kamber in J. Pei, Data mining: Concepts and techniques: Concepts and techniques, 3. izd. Amsterdam, Netherlands: Elsevier; 2011.
Sarker IH, Kayes A, Badsha S, Alqahtani H, Watters P, Ng A. Cybersecurity data science: an overview from machine learning perspective. J Big Data. 2020; 7:1-29.
Nasteski V. An overview of the supervised machine learning methods. Horizons Int Sci J. 2017; 4:51-62.
Patel L, Shukla T, Huang X, Ussery DW, Wang S. Machine learning methods in drug discovery. Molecules. 2020; 25(22):5277.
Ramadass S, Abraham A. Unsupervised control paradigm for performance evaluation. Int J Comput Appl. 2012; 975:8887.
Gupta R, Srivastava D, Sahu M, Tiwari S, Ambasta RK, Kumar P. Artificial intelligence to deep learning: machine intelligence approach for drug discovery. Mol Divers. 2021; 25:1315-60.
Puterman ML. Markov decision processes: discrete stochastic dynamic programming: John Wiley & Sons; 2014.
Xu Y, Wang C, Liang J, Yue K, Li W, Zheng S, et al. Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms. Entropy. 2022; 24(10):1441.
Polydoros AS, Nalpantidis L. Survey of model-based reinforcement learning: Applications on robotics. J Intell Robot Syst. 2017; 86(2):153-73.
Çal???r S, Pehlivano?lu MK, editors. Model-free reinforcement learning algorithms: A survey. 2019 27th Signal Processing and Communications Applications Conference (SIU); 2019: IEEE.
Bengio Y, Lecun Y, Hinton G. Deep learning for AI. Commun ACM. 2021; 64(7):58-65.
Nichols JA, Herbert Chan HW, Baker MA. Machine learning: applications of artificial intelligence to imaging and diagnosis. Biophys Rev. 2019; 11:111-8.
Hamilton WL. Graph representation learning. Synth Lect Artif Intell Mach Learn. 2020; 14(3):1-159.
Wu Z, Pan S, Chen F, Long G, Zhang C, Philip SY. A comprehensive survey on graph neural networks. IEEE Trans Neural Netw Learn Syst. 2020; 32(1):4-24.
Ahmed KT, Park S, Jiang Q, Yeu Y, Hwang T, Zhang W. Network-based drug sensitivity prediction. BMC Med Genomics. 2020; 13(11):1-10.
Bove P, Micheli A, Milazzo P, Podda M, editors. Prediction of Dynamical Properties of Biochemical Pathways with Graph Neural Networks. Bioinformatics; 2020.
Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging. 2018; 9:611-29.
Chauhan R, Ghanshala KK, Joshi R, editors. Convolutional neural network (CNN) for image detection and recognition. 2018 first international conference on secure cyber computing and communication (ICSCCC); 2018: IEEE.
Mohamed C, Nsiri B, Abdelmajid S, Brahim B, editors. Deep convolutional networks for image segmentation: application to optic disc detection. 2020 International Conference on Electrical and Information Technologies (ICEIT); 2020: IEEE.
Askr H, Elgeldawi E, Aboul Ella H, Elshaier YA, Gomaa MM, Hassanien AE. Deep learning in drug discovery: an integrative review and future challenges. Artif Intell Rev. 2022:1-63.
Zia T, Zahid U. Long short-term memory recurrent neural network architectures for Urdu acoustic modeling. Int J Speech Technol. 2019; 22:21-30.
Eliasy A, Przychodzen J. The role of AI in capital structure to enhance corporate funding strategies. Array. 2020; 6:100017.
Hewamalage H, Bergmeir C, Bandara K. Recurrent neural networks for time series forecasting: Current status and future directions. Int J Forecast. 2021; 37(1):388-427.
Vickram A, Kamini AR, Das R, Pathy MR, Parameswari R, Archana K, et al. Validation of artificial neural network models for predicting biochemical markers associated with male infertility. Syst Biol Reprod Med. 2016; 62(4):258-65.
Aggarwal A, Mittal M, Battineni G. Generative adversarial network: An overview of theory and applications. Int J Inf Manag Data Insights. 2021; 1(1):100004.
Repecka D, Jauniskis V, Karpus L, Rembeza E, Rokaitis I, Zrimec J, et al. Expanding functional protein sequence spaces using generative adversarial networks. Nat Mach Intell. 2021; 3(4):324-33.
Kazeminia S, Baur C, Kuijper A, van Ginneken B, Navab N, Albarqouni S, et al. GANs for medical image analysis. Artif Intell Med. 2020; 109:101938.
Khurana D, Koli A, Khatter K, Singh S. Natural language processing: state of the art, current trends and challenges. Multimed Tools Appl. 2023; 82(3):3713-44.
Thessen AE, Cui H, Mozzherin D. Applications of Natural Language Processing in Biodiversity Science. Adv Bioinformatics. 2012; 2012:391574.
Chen X, Xie H, Cheng G, Poon LKM, Leng M, Wang FL. Trends and Features of the Applications of Natural Language Processing Techniques for Clinical Trials Text Analysis. Appl Sci. 2020; 10(6):2157.
Gallego V, Naveiro R, Roca C, Ríos Insua D, Campillo NE. AI in drug development: a multidisciplinary perspective. Mol Divers. 2021; 25:1461-79.
Lin E, Lin C-H, Lane H-Y. Relevant applications of generative adversarial networks in drug design and discovery: molecular de novo design, dimensionality reduction, and de novo peptide and protein design. Molecules. 2020; 25(14):3250.
Grisoni F, Moret M, Lingwood R, Schneider G. Bidirectional molecule generation with recurrent neural networks. J Chem Inf Model. 2020; 60(3):1175-83.
Pogány P, Arad N, Genway S, Pickett SD. De novo molecule design by translating from reduced graphs to SMILES. J Chem Inf Model. 2018; 59(3):1136-46.
Kell DB, Samanta S, Swainston N. Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently. Biochem J. 2020; 477(23):4559-80.
Prykhodko O, Johansson SV, Kotsias P-C, Arús-Pous J, Bjerrum EJ, Engkvist O, et al. A de novo molecular generation method using latent vector based generative adversarial network. J Cheminform. 2019; 11(1):1-13.
Schneider P, Walters WP, Plowright AT, Sieroka N, Listgarten J, Goodnow Jr RA, et al. Rethinking drug design in the artificial intelligence era. Nat Rev Drug Discov. 2020; 19(5):353-64.
Kolluri S, Lin J, Liu R, Zhang Y, Zhang W. Machine Learning and Artificial Intelligence in Pharmaceutical Research and Development: a Review. AAPS J. 2022; 24(1):19.
Beck BR, Shin B, Choi Y, Park S, Kang K. Predicting commercially available antiviral drugs that may act on the novel coronavirus (SARS-CoV-2) through a drug-target interaction deep learning model. Comput Struct Biotechnol J. 2020; 18:784-90.
Cheng F, Kovács IA, Barabási A-L. Network-based prediction of drug combinations. Nat Commun. 2019; 10(1):1197.
Dey S, Luo H, Fokoue A, Hu J, Zhang P. Predicting adverse drug reactions through interpretable deep learning framework. BMC Bioinformatics. 2018; 19(21):1-13.
Parvathaneni M, Awol AK, Kumari M, Lan K, Lingam M. Application of Artificial Intelligence and Machine Learning in Drug Discovery and Development. J Drug Deliv Ther. 2023; 13(1):151-8.
Kendrew JC. The three-dimensional structure of a protein molecule. Sci Am. 1961; 205(6):96-111.
Dill KA, Ozkan SB, Shell MS, Weikl TR. The Protein Folding Problem. Annu Rev Biophys. 2008; 37(1):289-316.
Service RF. 'The game has changed.'AI triumphs at protein folding. American Association for the Advancement of Science; 2020.
Al-Janabi A. Has DeepMind's AlphaFold solved the protein folding problem? : Future Science; 2022.
Pereira J, Simpkin AJ, Hartmann MD, Rigden DJ, Keegan RM, Lupas AN. High?accuracy protein structure prediction in CASP14. Proteins. 2021; 89(12):1687-99.
Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, et al. Improved protein structure prediction using potentials from deep learning. Nature. 2020; 577(7792):706-10.
Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022; 50(D1):D439-D44.
Mariani V, Biasini M, Barbato A, Schwede T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics. 2013; 29(21):2722-8.
Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Sci. 2021; 373(6557):871-6.
Pandey D, Onkara Perumal P. A scoping review on deep learning for next-generation RNA-Seq. data analysis. Funct Integr Genomics. 2023; 23(2):134.
Mathema VB, Sen P, Lamichhane S, Oreši? M, Khoomrung S. Deep learning facilitates multi-data type analysis and predictive biomarker discovery in cancer precision medicine. Comput Struct Biotechnol. 2023.
Schmidt B, Hildebrandt A. Deep learning in next-generation sequencing. Drug Discov Today. 2021; 26(1):173-80.
Corsaro C, Vasi S, Neri F, Mezzasalma AM, Neri G, Fazio E. NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches. Appl Sci. 2022; 12(6):2824.
Wei W, Liao Y, Wang Y, Wang S, Du W, Lu H, et al. Deep Learning-Based Method for Compound Identification in NMR Spectra of Mixtures. Molecules. 2022; 27(12).
Hennessy A, Clarke K, Lewis M. Generative Adversarial Network Synthesis of Hyperspectral Vegetation Data. Remote Sens. 2021; 13(12):2243.
Cox J. Prediction of peptide mass spectral libraries with machine learning. Nat Biotechnol. 2023; 41(1):33-43.
Chen C, Hou J, Tanner JJ, Cheng J. Bioinformatics methods for mass spectrometry-based proteomics data analysis. Int J Mol Sci. 2020; 21(8):2873.
Helmy M, Smith D, Selvarajoo K. Systems biology approaches integrated with artificial intelligence for optimized metabolic engineering. Metab Eng Commun. 2020; 11:e00149.
You Y, Lai X, Pan Y, Zheng H, Vera J, Liu S, et al. Artificial intelligence in cancer target identification and drug discovery. Signal Transduct Target Ther. 2022; 7(1):156.
Kaur C, Garg U. Artificial intelligence techniques for cancer detection in medical image processing: A review. Mater Today Proc. 2021.
Keenan TD, Clemons TE, Domalpally A, Elman MJ, Havilio M, Agrón E, et al. Retinal specialist versus artificial intelligence detection of retinal fluid from OCT: age-related eye disease study 2: 10-year follow-on study. Ophthalmology. 2021; 128(1):100-9.
Belkacem AN, Ouhbi S, Lakas A, Benkhelifa E, Chen C. End-to-end AI-based point-of-care diagnosis system for classifying respiratory illnesses and early detection of COVID-19: a theoretical framework. Front Med. 2021; 8:585578.
Zhang H, Zheng J. The Application Analysis of Medical Chatbots and Virtual Assistant. Front Soc Sci Technol. 2021; 3:11-6.
Magrabi F, Ammenwerth E, McNair JB, De Keizer NF, Hyppönen H, Nykänen P, et al. Artificial intelligence in clinical decision support: challenges for evaluating AI and practical implications. Yearb Med Inform. 2019; 28(01):128-34.
Noorbakhsh-Sabet N, Zand R, Zhang Y, Abedi V. Artificial intelligence transforms the future of health care. Am J Med. 2019; 132(7):795-801.
John-Mathews J-M. Some critical and ethical perspectives on the empirical turn of AI interpretability. Technol Forecast Soc Change. 2022; 174:121209.
Ying X, editor An overview of overfitting and its solutions. Journal of physics: Conference series; 2019: IOP Publishing.
Lo Piano S. Ethical principles in machine learning and artificial intelligence: cases from the field and possible ways forward. Hum Soc Sci Commun. 2020; 7(1):1-7.
Chesterman S. Artificial intelligence and the problem of autonomy. Notre Dame J Emerg Tech. 2020; 1:210.
Lentzos F. AI and Biological Weapons. Armament, Arms Control and Artificial Intelligence: The Janus-faced Nature of Machine Learning in the Military Realm: Springer; 2022. p. 91-100.
Hassoun S, Jefferson F, Shi X, Stucky B, Wang J, Rosa Jr E. Artificial intelligence for biology. Integr Comp Biol. 2021; 61(6):2267-75.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Journal of Biochemistry, Microbiology and Biotechnology

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).