Abstract
This study aimed to investigate the effects of incidence rate, heritability, and polygenic variance on the statistical power of genome-wide association studies (GWAS) for threshold traits. Different incidence rates of threshold trait (1, 3, 5, 10, 25, 40, 50, 60, 75 and 90%), heritability (10 and 25%), and polygenic variance ratio (0 and 25%) were simulated separately for common (MAF ≥ 0.05), low-frequency (0.05 > MAF ≥ 0.01), and rare (MAF < 0.01) variants. Association studies were performed by logistic and linear mixed models. The highest statistical powers were observed in common and low-frequency variants with an incidence of 25–50% and 10–40%, respectively, but for rare variants, the highest statistical power was observed at low incidence. For all causal variant frequencies, the estimated heritability decline with an increase in incidence rate. We found high statistical power for traits with high heritability. In contrast, those with a high polygenic variance ratio have lower statistical power to detect common causal variants using a linear mixed model. These results demonstrate that the incidence rate of threshold traits, heritability, and polygenic variance may affect the statistical power of GWAS. Therefore, it is recommended that the effect of incidence rate, heritability, and polygenic variance be considered in designing GWAS for threshold traits.
Similar content being viewed by others
Data availability
The authors affirm that all simulated data for confirming the conclusions of the current article are available from the corresponding author on reasonable request.
References
Abdoli R, Mirhoseini S, Ghavi Hossein-Zadeh N, Zamani P, Gondro C (2018) Genome-wide association study to identify genomic regions affecting prolificacy in Lori-Bakhtiari sheep. Anim Genet 49:488–491. https://doi.org/10.1111/age.12700.Epub
Behera SK (2007) Study of heritability of threshold character. MSc, Indian Agricultural Statistics Research
Bush WS, Moore JH (2012) Genome-wide association studies. PLoS Comput Biol 8:e1002822. https://doi.org/10.1371/journal.pcbi.1002822
Carvajal-Rodríguez A (2008) GENOMEPOP: a program to simulate genomes in populations. BMC Bioinform 9:223. https://doi.org/10.1186/1471-2105-9-223
Chung RH, Tsai WY, Hsieh CH, Hung KY, Hsiung CA, Hauser ER (2015) SeqSIMLA2: simulating correlated quantitative traits accounting for shared environmental effects in user-specified pedigree structure. Genet Epidemiol 39:20–24. https://doi.org/10.1002/gepi.21850
Damgaard LH, Korsgaard IR (2006) A bivariate quantitative genetic model for a threshold trait and a survival trait. Genet Sel Evol 38:565. https://doi.org/10.1186/1297-9686-38-6-565
Falconer D (1989) Introduction to quantitative genetics, 3rd edn. Longman Scientific & Technical, Harlow
Fang L, Sahana G, Su G, Yu Y, Zhang S, Lund MS, Sørensen P (2017) Integrating sequence-based GWAS and RNA-Seq provides novel insights into the genetic basis of mastitis and milk production in dairy cattle. Sci Rep 7:45560. https://doi.org/10.1038/srep45560
Gilmour A, Anderson R, Rae A (1985) The analysis of binomial data by a generalized linear mixed model. Biometrika 72:593–599. https://doi.org/10.1093/BIOMET/72.3.593
Hossein-Zadeh NG, Nejati-Javaremi A, Miraei-Ashtiani SR, Mehrabani-Yeganeh H (2007) Effect of the threshold nature of traits on heritability estimates obtained by linear model. Pakistan J Biol Sci 10:145–147. https://doi.org/10.3923/pjbs.2007.145.147
Khanzadeh H, Hossein-Zadeh NG, Ghovvati S (2021) Statistical power and heritability in whole-genome association studies for quantitative traits. Meta Gene 28:100869. https://doi.org/10.1016/j.mgene.2021.100869
Lai F-N et al (2016) Whole-genome scanning for the litter size trait associated genes and SNPs under selection in dairy goat (Capra hircus). Sci Rep 6:38096. https://doi.org/10.1038/srep38096
Lee SH, Wray NR, Goddard ME, Visscher PM (2011) Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet 88:294–305. https://doi.org/10.1016/j.ajhg.2011.02.002
Li Y, Levran O, Kim J, Zhang T, Chen X, Suo C (2019) Extreme sampling design in genetic association mapping of quantitative trait loci using balanced and unbalanced case-control samples. Sci Rep 9:1–9. https://doi.org/10.1038/s41598-019-51790-w
Lippert C, Listgarten J, Liu Y, Kadie CM, Davidson RI, Heckerman D (2011) FaST linear mixed models for genome-wide association studies. Nat Methods 8:833–835. https://doi.org/10.1038/nmeth.1681
Moorad JA, Linksvayer TA (2008) Levels of selection on threshold characters. Genetics 179:899–905. https://doi.org/10.1534/genetics.108.086959
Purcell S et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575. https://doi.org/10.1086/519795
Reddy MPL et al (2011) Association between type 1 diabetes and GWAS SNPs in the southeast US Caucasian population. Genes Immun 12:208. https://doi.org/10.1038/gene.2010.70.Epub
Roff DA, Stirling G, Fairbairn DJ (1997) The evolution of threshold traits: a quantitative genetic analysis of the physiological and life-history correlates of wing dimorphism in the sand cricket. Evolution 51:1910–1919. https://doi.org/10.1111/j.1558-5646.1997.tb05113.x
SAS Institute I (2003) SAS 9.1. 3 for Windows. SAS Institute Cary, NC.
Shin J, Lee C (2015) Statistical power for identifying nucleotide markers associated with quantitative traits in genome-wide association analysis using a mixed model. Genomics 105:1–4. https://doi.org/10.1016/j.ygeno.2014.11.001
Wang C et al (2013) Bayesian methods for estimating GEBVs of threshold traits. Heredity 110:213. https://doi.org/10.1038/hdy.2012.65
Wang C, Li X, Qian R, Su G, Zhang Q, Ding X (2017) Bayesian methods for jointly estimating genomic breeding values of one continuous and one threshold trait. PLoS ONE 12:e0175448. https://doi.org/10.1371/journal.pone.0175448
Xiang Y, Xiang X, Li Y (2020) Identifying rare variants for quantitative traits in extreme samples of population via Kullback-Leibler distance. BMC Genet 21:1–9. https://doi.org/10.1186/s12863-020-00951-2
Xu S-S et al (2018) Genome-wide association analyses highlight the potential for different genetic mechanisms for litter size among sheep breeds. Front Genet 9:118. https://doi.org/10.3389/fgene.2018.00118
Zhou X, Carbonetto P, Stephens M (2013) Polygenic modeling with bayesian sparse linear mixed models. PLoS Genet 9:e1003264. https://doi.org/10.1371/journal.pgen.1003264
Acknowledgements
The authors declare that they have no acknowledgments.
Funding
The authors did not receive support from any organization for the submitted work. The authors have no relevant financial or non-financial interests to disclose.
Author information
Authors and Affiliations
Contributions
HK simulated, analyzed, and interpreted the scenarios and wrote the manuscript. NGHZ gave scientific advice, and was a major contributor in planning and writing the manuscript. SG gave scientific advice. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Khanzadeh, H., Ghavi Hossein-Zadeh, N. & Ghovvati, S. The statistical power of genome-wide association studies for threshold traits with different frequencies of causal variants. Genetica 150, 51–57 (2022). https://doi.org/10.1007/s10709-021-00140-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10709-021-00140-8