Přístupnostní navigace
E-application
Search Search Close
Publication detail
SCHWARZEROVÁ, J. HURTA, M. WECKWERTH, W. WALTHER, D.
Original Title
Decoding the Hidden Secrets of SNP Data: Revealing Ancestral Origins, Genomic Predictions, and Polygenic Risk Score
Type
abstract
Language
English
Original Abstract
The study of Single Nucleotide Polymorphism (SNPs) data provides a gateway to uncovering profound insights into ancestral origins, enomic predictions, and polygenic genetic scores. We embarked on an exploration of SNP data using two distinct datasets available for the plant Arabidopsis thaliana. Firstly, we employed a constructed artificial dataset to simulate and analyse mutation increments over multiple generations. This artificial dataset allowed us to investigate the dynamics of genetic variation and its implications for ancestral lineage tracing by Principal Component Analysis (PCA). Secondly, we incorporated real data comprising 27,081 non-redundant SNPs. Leveraging this extensive dataset, our investigations aimed to explore the intricate genetic landscape of Arabidopsis thaliana and reveal crucial details about population structure, genetic diversity, and the potential functional implications of identified SNPs in relation to various metabolites. We developed a Polygenic Risk Score (PRS) tool implemented in Python - PGine: Py/Bioconda software package for the calculation of polygenic risk scores in plants that may be useful for new breeding strategies. Subsequently, we integrated diverse computational approaches to achieve Genomic Prediction models. Our study reveals the hidden information embedded in SNP data, thereby improving our understanding of general genetic variation and its implications for different fields, including predictive modelling, population genetics and evolutionary biology.
Keywords
Arabidopsis thaliana, Single Nucleotide Polymorphisms (SNPs), Machine learning, Genetic Variation
Authors
SCHWARZEROVÁ, J.; HURTA, M.; WECKWERTH, W.; WALTHER, D.
Released
12. 9. 2023
Location
Germany
Pages count
1
BibTex
@misc{BUT184936, author="Jana {Schwarzerová} and Martin {Hurta} and Wolfram {Weckwerth} and Dirk {Walther}", title="Decoding the Hidden Secrets of SNP Data: Revealing Ancestral Origins, Genomic Predictions, and Polygenic Risk Score", year="2023", pages="1", address="Germany", note="abstract" }