Machine learning-driven polygenic risk scores for bipolar disorder, depression, and panic disorder

https://doi.org/10.55214/25768484.v8i6.2338

Authors

  • Sara Benoumhani AI Research Center, College of Engineering, Alfaisal University, Riyadh, Saudi Arabia, and Software Engineering Department, College of Engineering, Alfaisal University, Riyadh, Saudi Arabia
  • Saima Jabeen AI Research Center, College of Engineering, Alfaisal University, Riyadh, Saudi Arabia, and Software Engineering Department, College of Engineering, Alfaisal University, Riyadh, Saudi Arabia
  • Mariam M AlEissa AI Research Center, College of Engineering, Alfaisal University, Riyadh, Saudi Arabia, and Molecular Genetics Laboratory, Public Health Authority, Riyadh, Saudi Arabia, and College of Medicine, Alfaisal University, Riyadh, Saudi Arabia

Polygenic Risk Score (PRS) is a computational tech- nique that uses various genomic data to simultaneously analyze an individ- ual’s genetic risk for particular illnesses or traits. However, the traditional PRS computation has a few weaknesses, including its limited capacity to account for just a portion of trait variance, susceptibility to overfitting, and insufficient ability to discriminate among the larger population. Machine Learning (ML) methods offer a promising alternative to the traditional method by avoiding the problem of overfitting and improving accuracy. This study aims to develop an ML model for improved PRS calculation. We used the summary statistics for three mentals diseases, bipolar, depression, and panic disorder, from the Psychiatric Genomics Consortium (PGC) as a disease reference. We also obtained actual genotype data of individuals from OpenSNP, which includes both case and control samples. This data is used for predicting scores. The suggested approach, called Polygenic Risk Score Neural Network (PRSNN), calculates the PRS using weight vectors that estimate the relevance of each single nucleotide polymorphism (SNP) with a particular phenotype by deep learning model as an alternative to the traditional method. This study aims to develop a machine learning model, called PRSNN, for improved calculation of Polygenic Risk Scores (PRS). The PRSNN method outperforms the conventional method in identifying individuals at risk of mental disease. A novel deep-learning approach, named as PRSNN, is proposed for generating PRSs. The results demonstrate that it outperforms the traditional method of computing PRS for complex diseases. Further upgrades for this tool are required to overcome the current limitations, including lack of validation with external data from different ancestries, which may limit the applicability of the PRSNN method across diverse populations, and the small sample size, which may affect the results.

Section

How to Cite

Benoumhani, S. ., Jabeen, S. ., & AlEissa, M. M. . (2024). Machine learning-driven polygenic risk scores for bipolar disorder, depression, and panic disorder. Edelweiss Applied Science and Technology, 8(6), 1758–1773. https://doi.org/10.55214/25768484.v8i6.2338

Downloads

Download data is not yet available.

Dimension Badge

Download

Downloads

Issue

Section

Articles

Published

2024-10-15