Bioinformatics in maize genome research

Guo, Ling

Bioinformatics in maize genome research

dc.contributor.advisor	Patrick S. Schnable
dc.contributor.advisor	Daniel A. Ashlock
dc.contributor.author	Guo, Ling
dc.contributor.department	Theses & dissertations (Interdisciplinary)
dc.date	2018-08-22T22:52:58.000
dc.date.accessioned	2020-06-30T07:48:24Z
dc.date.available	2020-06-30T07:48:24Z
dc.date.copyright	Mon Jan 01 00:00:00 UTC 2007
dc.date.issued	2007-01-01
dc.description.abstract	<p>Motivation. Clustering has become an integral part of microarray data analysis and interpretation. It is helpful to reduce the scale of information generated by microarray experiment to the level that biologists can generate hypothesis. There is a danger that artifacts induced by clustering methods can cause misinterpretation of the data. Clustering method that can accurately capture the natural structure of the data would be a useful tool for biologists to discovery the biological meaning buried in the data. To this end, a new clustering algorithm, called K-means multiclustering, is introduced. The method can avoid the artifacts induced by distance or similarity metrics by amalgamating the results of many K-means clusterings. Results. The multiclustering algorithm is a model-free clustering method. It is found to be reliable and consist in capturing the underlying data structure with high accuracy that is competitive with model based clustering and superior to other methods on synthetic micorarry data generated in a manner consistent with the hypothesis of model based clustering. The algorithm has a high level of immunity to artifacts introduced by the metric used to measure the distance between data points. It can successfully cluster data sets which are designed to have different shapes and variation and cannot be correctly clustered by traditional clustering method. The cut plot computed by this method is a very simple and useful summary of the data structure. A detailed view of the formation of clustering can also be generated by the method to reveal the underlying hierarchical structure of data set.</p>
dc.format.mimetype	application/pdf
dc.identifier	archive/lib.dr.iastate.edu/rtd/15933/
dc.identifier.articleid	16932
dc.identifier.contextkey	7051648
dc.identifier.doi	https://doi.org/10.31274/rtd-180813-17131
dc.identifier.s3bucket	isulib-bepress-aws-west
dc.identifier.submissionpath	rtd/15933
dc.identifier.uri	https://dr.lib.iastate.edu/handle/20.500.12876/69616
dc.language.iso	en
dc.source.bitstream	archive/lib.dr.iastate.edu/rtd/15933/3274875.PDF\|\|\|Fri Jan 14 20:48:51 UTC 2022
dc.subject.disciplines	Bioinformatics
dc.subject.keywords	Genetics
dc.subject.keywords	development and cell biology;Bioinformatics and computational biology;
dc.title	Bioinformatics in maize genome research
dc.type	article
dc.type.genre	dissertation
dspace.entity.type	Publication
thesis.degree.discipline	Bioinformatics and Computational Biology
thesis.degree.level	dissertation
thesis.degree.name	Doctor of Philosophy

File

Original bundle

Now showing 1 - 1 of 1

Name:: 3274875.PDF
Size:: 3 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Theses and Dissertations