Updating Quality Scores During HMM-Based Correction of Illumina Next Generation Sequencing Data

dc.contributor.author Zhang, Haijuan
dc.contributor.department Statistics (LAS)
dc.contributor.majorProfessor Karin Dorman
dc.date 2021-06-11T15:19:58.000
dc.date.accessioned 2021-08-14T03:35:23Z
dc.date.available 2021-08-14T03:35:23Z
dc.date.copyright Fri Jan 01 00:00:00 UTC 2021
dc.date.embargo 2021-10-19
dc.date.issued 2021-01-01
dc.description.abstract <p>There are many error correction tools to remove the base calling errors made by Illumina technology, but most do not update the quality scores, even after correcting the errors. The quality score is an important metric quantifying the trustworthiness of the corresponding base call that is used by many downstream sequence analysis tools. This research proposes a method to update quality scores of corrected errors when using PREMIER, a fully-probabilistic error correction method for Illumina sequencing data. I then test the quality of the updates to see if the updated quality scores better reflect the actual probability of error in an Illumina dataset.</p>
dc.format.mimetype PDF
dc.identifier archive/lib.dr.iastate.edu/creativecomponents/829/
dc.identifier.articleid 1836
dc.identifier.contextkey 22558984
dc.identifier.doi https://doi.org/10.31274/cc-20240624-1537
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath creativecomponents/829
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/2vaZWG3r
dc.source.bitstream archive/lib.dr.iastate.edu/creativecomponents/829/creativecomponent.pdf|||Sat Jan 15 02:09:05 UTC 2022
dc.subject.disciplines Bioinformatics
dc.subject.keywords Next Generation Sequencing
dc.subject.keywords Illumina
dc.subject.keywords Probability of Error
dc.subject.keywords Quality Score
dc.subject.keywords Chi-Squared Test
dc.subject.keywords Post-Hoc Analysis
dc.title Updating Quality Scores During HMM-Based Correction of Illumina Next Generation Sequencing Data
dc.type creative component
dc.type.genre creative component
dspace.entity.type Publication
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
thesis.degree.discipline Statistics
thesis.degree.level creativecomponent
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
creativecomponent.pdf
Size:
298.99 KB
Format:
Adobe Portable Document Format
Description: