Initializing Partition-Optimization Algorithms

Thumbnail Image
Date
2009-01-01
Authors
Maitra, Ranjan
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Authors
Research Projects
Organizational Units
Organizational Unit
Statistics
As leaders in statistical research, collaboration, and education, the Department of Statistics at Iowa State University offers students an education like no other. We are committed to our mission of developing and applying statistical methods, and proud of our award-winning students and faculty.
Journal Issue
Is Version Of
Versions
Series
Department
Statistics
Abstract

Clustering data sets is a challenging problem needed in a wide array of applications. Partition-optimization approaches, such as k-means or expectation-maximization (EM) algorithms, are suboptimal and find solutions in the vicinity of their initialization. This paper proposes a staged approach to specifying initial values by finding a large number of local modes and then obtaining representatives from the most separated ones. Results on test experiments are excellent. We also provide a detailed comparative assessment of the suggested algorithm with many commonly used initialization approaches in the literature. Finally, the methodology is applied to two data sets on diurnal microarray gene expressions and industrial releases of mercury.

Comments

This is a manuscript of an article from IEEE/ACM Transactions on Computational Biology and Bioinformatics 6 (2009): 144, doi: 10.1109/TCBB.2007.70244. Posted with permission. Copyright 2009 IEEE.

Description
Keywords
Citation
DOI
Subject Categories
Copyright
Thu Jan 01 00:00:00 UTC 2009
Collections