Prediction of protein secondary structure by mining structural fragment database Cheng, Haitao Jernigan, Robert Sen, Taner Kloczkowski, Andrzej Margaritis, Dimitris Jernigan, Robert
dc.contributor.department Biochemistry, Biophysics and Molecular Biology
dc.contributor.department Computer Science
dc.contributor.department Computer Science 2018-02-19T04:20:36.000 2020-06-29T23:46:10Z 2020-06-29T23:46:10Z Sat Jan 01 00:00:00 UTC 2005 2005-05-26
dc.description.abstract <p>A new method for predicting protein secondary structure from amino acid sequence has been developed. The method is based on multiple sequence alignment of the query sequence with all other sequences with known structure from the protein data bank (PDB) by using BLAST. The fragments of the alignments belonging to proteins from the PBD are then used for further analysis. We have studied various schemes of assigning weights for matching segments and calculated normalized scores to predict one of the three secondary structures: α-helix, β-sheet, or coil. We applied several artificial intelligence techniques: decision trees (DT), neural networks (NN) and support vector machines (SVM) to improve the accuracy of predictions and found that SVM gave the best performance. Preliminary data show that combining the fragment mining approach with GOR V (Kloczkowski et al, Proteins 49 (2002) 154–166) for regions of low sequence similarity improves the prediction accuracy.</p>
dc.description.comments <p>This is a manuscript of an article published as Cheng, Haitao, Taner Z. Sen, Andrzej Kloczkowski, Dimitris Margaritis, and Robert L. Jernigan. "Prediction of protein secondary structure by mining structural fragment database." Polymer 46, no. 12 (2005): 4314-4321. doi: <a href="" target="_blank">10.1016/j.polymer.2005.02.040</a>. Posted with permission.</p>
dc.format.mimetype application/pdf
dc.identifier archive/
dc.identifier.articleid 1179
dc.identifier.contextkey 11160621
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath bbmb_ag_pubs/171
dc.language.iso en
dc.source.bitstream archive/|||Fri Jan 14 21:16:11 UTC 2022
dc.source.uri 10.1016/j.polymer.2005.02.040
dc.subject.disciplines Biochemistry, Biophysics, and Structural Biology
dc.subject.disciplines Bioinformatics
dc.subject.disciplines Computer Sciences
dc.subject.disciplines Molecular Biology
dc.subject.keywords Secondary structure
dc.subject.keywords Sequence
dc.subject.keywords Cut-off
dc.title Prediction of protein secondary structure by mining structural fragment database
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication 50d10ea7-68f5-4cc5-8858-375cef177ed2
relation.isOrgUnitOfPublication c70f85ae-e0cd-4dce-96b5-4388aac08b3f
relation.isOrgUnitOfPublication f7be4eb9-d1d0-4081-859b-b15cee251456
Original bundle
Now showing 1 - 1 of 1
945.51 KB
Adobe Portable Document Format