DOI | Resolve DOI: https://doi.org/10.1504/IJDMB.2021.116881 |
---|
Author | Search for: Dagasso, Gabrielle; Search for: Yan, N.A.; Search for: Wang, Lipu; Search for: Li, Longhai; Search for: Kutcher, Randy; Search for: Zhang, Wentao1; Search for: Jin, Lingling |
---|
Affiliation | - National Research Council of Canada. Aquatic and Crop Resource Development
|
---|
Format | Text, Article |
---|
Subject | genome-wide association studies; machine learning; population structure analysis; cross-validation; LASSO; fusarium head blight |
---|
Abstract | Genome-Wide Association Studies (GWAS) has demonstrated its power in discovering genetic variations to particular traits related to agronomically important features in crops. The typical output of a GWAS program includes a series of Single Nucleotide Polymorphisms (SNPs) and their significance. Currently, there is no standard way to compare results across different programs or to select the most ‘significant’ results uniformly and consistently. To obtain a comprehensive and accurate set of SNPs associated with a trait of interest, we present a novel automated pipeline that leverages machine learning for GWAS discoveries. The pipeline first performs population structure analysis, then executes multiple GWAS software and combines their results into a single SNP set. After that, it selects SNPs from the set with high individual and/or joint effects with the Least Absolute Shrinkage and Selection Operator analysis. Finally, the predictivity of the model is assessed using cross-validation. |
---|
Publication date | 2021 |
---|
Publisher | Inderscience |
---|
In | |
---|
Language | English |
---|
Peer reviewed | Yes |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | 45ec777b-155c-4950-961f-c86d97b9bb25 |
---|
Record created | 2023-01-23 |
---|
Record modified | 2023-01-23 |
---|