Genome-wide prediction of breeding values and mapping of quantitative trait loci in stratified and admixed populations

ShaarbafToosi, Ali

Genome-wide prediction of breeding values and mapping of quantitative trait loci in stratified and admixed populations

dc.contributor.advisor	Rohan L. Fernando
dc.contributor.author	ShaarbafToosi, Ali
dc.contributor.department	Animal Science
dc.date	2018-08-11T09:09:11.000
dc.date.accessioned	2020-06-30T02:44:57Z
dc.date.available	2020-06-30T02:44:57Z
dc.date.copyright	Sun Jan 01 00:00:00 UTC 2012
dc.date.embargo	2013-06-05
dc.date.issued	2012-01-01
dc.description.abstract	<p>Ideally genome-wide association studies require homogenous samples originating from randomly mating populations with minimal pedigree relationship. However, in reality such samples are very hard to collect. Non-random mating combined with artificial selection has created complex pattern of population structure and relationship in commercial crop and livestock populations. This requires proper modeling of population structure and kinship a necessary step of all genome-wide association studies. Otherwise, the risk of both false-positives (declaring a marker as significant without it be linked to a QTL) and false-negatives (markers linked to a QTL declared as non-significant) increases dramatically.</p> <p>In this thesis, we first applied genomic selection (GS) approach to develop equations for prediction of breeding values of purebred candidates based on a model trained on an admixed or crossbred population. In this approach all markers effects are treated as random and are fitted simultaneously. It was hypothesized that given a high-density marker data and using the GS approach; training in a crossbred or admixed population could be as accurate as training in a purebred population that is the target of selection. In a stochastic simulation study, it was shown that both crossbred and admixed populations could predict breeding values of a purebred population, without the need for explicitly modeling of breed composition and pedigree relationship. However, accuracy of GS was greatly reduced when genes from the target pure breed were not included in the admixed or crossbred training population. In addition, it was shown that the accuracy of GS depends on the genetic distance between the training and validation population, the closer the relationship between the two the higher was the prediction accuracy. Further, increasing of marker density improved the accuracy of prediction especially when a crossbred population has been used as the training dataset. Considering haplotypes with weak linkage disequilibrium (LD), the crossbreds showed extensive LD, whereas the LD in the purebreds was confined to smaller segments. In contrast, examination of the length of haplotypes with strong LD indicated that these haplotypes are much shorter in crossbreds than that in purebreds. Our results showed that in crossbred populations the number of haplotypes with strong LD is less than that in the purebred populations. The findings of this research suggested that the crossbred populations are more suitable for QTL fine mapping than the purebreds.</p> <p>In addition, in another simulation study we compared power, false-positive rate, accuracy and positive predictive value of QTL mapping in an admixed population with and without modeling of breed composition. The performance of ordinary least square (OLS) and mixed model methods (MLM), both fitting one-marker-at-a-time, were compared to that of a Bayesian multiple-regression (BMR) method that fitted all markers simultaneously. The OLS method showed the highest rate of false-positives due to ignoring breed composition and pedigree relationship. The MLM approach showed spurious false-positives when breed composition was not accounted for. The BMR outperformed both OLS and MLM approaches. It was shown that BMR could mitigate the confounding effects of breed composition and relationship without compromising its power. In contrast to the MLM where fitting of breed composition reduced both its power and false-positive rates, when breed composition was considered in the BMR it resulted in loss of power without a change of false-positive rate. It was concluded that the BMR is able to self-correct for the effects of population structure and relatedness.</p>
dc.format.mimetype	application/pdf
dc.identifier	archive/lib.dr.iastate.edu/etd/12756/
dc.identifier.articleid	3763
dc.identifier.contextkey	4186522
dc.identifier.doi	https://doi.org/10.31274/etd-180810-2228
dc.identifier.s3bucket	isulib-bepress-aws-west
dc.identifier.submissionpath	etd/12756
dc.identifier.uri	https://dr.lib.iastate.edu/handle/20.500.12876/26945
dc.language.iso	en
dc.source.bitstream	archive/lib.dr.iastate.edu/etd/12756/ShaarbafToosi_iastate_0097E_13016.pdf\|\|\|Fri Jan 14 19:29:09 UTC 2022
dc.subject.disciplines	Genetics
dc.subject.keywords	Admixed populations
dc.subject.keywords	Genome-wide association study
dc.subject.keywords	Genomic selection
dc.subject.keywords	QTL mapping
dc.title	Genome-wide prediction of breeding values and mapping of quantitative trait loci in stratified and admixed populations
dc.type	article
dc.type.genre	dissertation
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85ecce08-311a-441b-9c4d-ee2a3569506f
thesis.degree.level	dissertation
thesis.degree.name	Doctor of Philosophy

File

Original bundle

Now showing 1 - 1 of 1

Name:: ShaarbafToosi_iastate_0097E_13016.pdf
Size:: 4.55 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Theses and Dissertations