Copy number variation detection using next generation sequencing read counts Wang, Heng Nettleton, Dan Nettleton, Dan Ying, Kai
dc.contributor.department Statistics 2019-08-22T08:01:36.000 2020-07-02T06:57:06Z 2020-07-02T06:57:06Z 2014-01-01
dc.description.abstract <p>Background: A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model.</p> <p>Results: A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs.</p> <p>Conclusions: The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data.</p>
dc.description.comments <p>This article is published as Wang, Heng, Dan Nettleton, and Kai Ying. "Copy number variation detection using next generation sequencing read counts." <em>BMC bioinformatics</em> 15 (2014): 109. doi: <a href="">10.1186/1471-2105-15-109</a>.</p>
dc.format.mimetype application/pdf
dc.identifier archive/
dc.identifier.articleid 1194
dc.identifier.contextkey 14814671
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath stat_las_pubs/192
dc.language.iso en
dc.source.bitstream archive/|||Fri Jan 14 21:53:32 UTC 2022
dc.source.uri 10.1186/1471-2105-15-109
dc.subject.disciplines Bioinformatics
dc.subject.disciplines Genetics and Genomics
dc.subject.disciplines Plant Breeding and Genetics
dc.subject.disciplines Statistical Models
dc.subject.keywords Count data
dc.subject.keywords Gamma-Poisson mixture
dc.subject.keywords Hidden Markov model
dc.subject.keywords Plant genomics
dc.subject.keywords Poisson mixture model
dc.title Copy number variation detection using next generation sequencing read counts
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication 7d86677d-f28f-4ab1-8cf7-70378992f75b
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
Original bundle
Now showing 1 - 1 of 1
1.38 MB
Adobe Portable Document Format