Updating Quality Scores During HMM-Based Correction of Illumina Next Generation Sequencing Data
dc.contributor.author | Zhang, Haijuan | |
dc.contributor.department | Statistics (LAS) | |
dc.contributor.majorProfessor | Karin Dorman | |
dc.date | 2021-06-11T15:19:58.000 | |
dc.date.accessioned | 2021-08-14T03:35:23Z | |
dc.date.available | 2021-08-14T03:35:23Z | |
dc.date.copyright | Fri Jan 01 00:00:00 UTC 2021 | |
dc.date.embargo | 2021-10-19 | |
dc.date.issued | 2021-01-01 | |
dc.description.abstract | <p>There are many error correction tools to remove the base calling errors made by Illumina technology, but most do not update the quality scores, even after correcting the errors. The quality score is an important metric quantifying the trustworthiness of the corresponding base call that is used by many downstream sequence analysis tools. This research proposes a method to update quality scores of corrected errors when using PREMIER, a fully-probabilistic error correction method for Illumina sequencing data. I then test the quality of the updates to see if the updated quality scores better reflect the actual probability of error in an Illumina dataset.</p> | |
dc.format.mimetype | ||
dc.identifier | archive/lib.dr.iastate.edu/creativecomponents/829/ | |
dc.identifier.articleid | 1836 | |
dc.identifier.contextkey | 22558984 | |
dc.identifier.doi | https://doi.org/10.31274/cc-20240624-1537 | |
dc.identifier.s3bucket | isulib-bepress-aws-west | |
dc.identifier.submissionpath | creativecomponents/829 | |
dc.identifier.uri | https://dr.lib.iastate.edu/handle/20.500.12876/2vaZWG3r | |
dc.source.bitstream | archive/lib.dr.iastate.edu/creativecomponents/829/creativecomponent.pdf|||Sat Jan 15 02:09:05 UTC 2022 | |
dc.subject.disciplines | Bioinformatics | |
dc.subject.keywords | Next Generation Sequencing | |
dc.subject.keywords | Illumina | |
dc.subject.keywords | Probability of Error | |
dc.subject.keywords | Quality Score | |
dc.subject.keywords | Chi-Squared Test | |
dc.subject.keywords | Post-Hoc Analysis | |
dc.title | Updating Quality Scores During HMM-Based Correction of Illumina Next Generation Sequencing Data | |
dc.type | creative component | |
dc.type.genre | creative component | |
dspace.entity.type | Publication | |
relation.isOrgUnitOfPublication | 264904d9-9e66-4169-8e11-034e537ddbca | |
thesis.degree.discipline | Statistics | |
thesis.degree.level | creativecomponent |
File
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- creativecomponent.pdf
- Size:
- 298.99 KB
- Format:
- Adobe Portable Document Format
- Description: