Probabilistic insertion, deletion and substitution error correction using Markov inference in next generation sequencing reads

dc.contributor.advisor Aditya Ramamoorthy
dc.contributor.author Noroozi, Vahid
dc.contributor.department Electrical and Computer Engineering
dc.date 2018-08-11T13:16:21.000
dc.date.accessioned 2020-06-30T03:01:24Z
dc.date.available 2020-06-30T03:01:24Z
dc.date.copyright Fri Jan 01 00:00:00 UTC 2016
dc.date.embargo 2001-01-01
dc.date.issued 2016-01-01
dc.description.abstract <p>Error correction of noisy reads obtained from high-throughput DNA sequencers is an important problem since read quality significantly affects downstream analyses such as detection of genetic variation and the complexity and success of sequence assembly. Most of the current error correction algorithms are only capable of recovering substitution errors. In this work, Pindel, an algorithm that simultaneously corrects insertion, deletion and substitution errors in reads from next generation DNA sequencing platforms is presented. Pindel corrects insertion, deletion and substitution errors by modelling the sequencer output as emissions of an appropriately defined Hidden Markov Model (HMM). Reads are corrected to the corresponding maximum likelihood paths using an appropriately modified Viterbi algorithm. When compared with Karect and Fiona, the top two current algorithms capable of correcting insertion, deletion and substitution errors, Pindel exhibits superior accuracy across a range of datasets.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/etd/15097/
dc.identifier.articleid 6104
dc.identifier.contextkey 8882691
dc.identifier.doi https://doi.org/10.31274/etd-180810-4702
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath etd/15097
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/29281
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/etd/15097/0-premierindel_master_9d9ec960e35b2ed7709b3c59730861064ca55372.zip|||Fri Jan 14 20:35:34 UTC 2022
dc.source.bitstream archive/lib.dr.iastate.edu/etd/15097/Noroozi_iastate_0097M_15592.pdf|||Fri Jan 14 20:35:36 UTC 2022
dc.subject.disciplines Bioinformatics
dc.subject.disciplines Computer Sciences
dc.subject.disciplines Electrical and Electronics
dc.subject.keywords Electrical Engineering
dc.subject.keywords DNA Sequencing
dc.subject.keywords Error Correction
dc.subject.keywords Hidden Markov Model
dc.subject.keywords Insertion and deletion
dc.subject.keywords Next Generation Sequencing
dc.subject.keywords Probabilistic Modeling
dc.supplemental.bitstream premierindel_master_9d9ec960e35b2ed7709b3c59730861064ca55372.zip
dc.title Probabilistic insertion, deletion and substitution error correction using Markov inference in next generation sequencing reads
dc.type article
dc.type.genre thesis
dspace.entity.type Publication
relation.isOrgUnitOfPublication a75a044c-d11e-44cd-af4f-dab1d83339ff
thesis.degree.discipline Electrical Engineering
thesis.degree.level thesis
thesis.degree.name Master of Science
File
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Noroozi_iastate_0097M_15592.pdf
Size:
589.5 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
0-premierindel_master_9d9ec960e35b2ed7709b3c59730861064ca55372.zip
Size:
74.16 KB
Format:
Unknown data format
Description: