Merging K‐means with hierarchical clustering for identifying general‐shaped groups

dc.contributor.author Peterson, Anna
dc.contributor.author Ghosh, Arka
dc.contributor.author Maitra, Ranjan
dc.contributor.department Statistics
dc.date 2019-06-27T09:40:37.000
dc.date.accessioned 2020-07-02T06:56:55Z
dc.date.available 2020-07-02T06:56:55Z
dc.date.copyright Mon Jan 01 00:00:00 UTC 2018
dc.date.issued 2018-01-01
dc.description.abstract <p>Clustering partitions a dataset such that observations placed together in a group are similar but different from those in other groups. Hierarchical and <em>K</em>‐means clustering are two approaches but have different strengths and weaknesses. For instance, hierarchical clustering identifies groups in a tree‐like structure but suffers from computational complexity in large datasets, while <em>K</em>‐means clustering is efficient but designed to identify homogeneous spherically shaped clusters. We present a hybrid non‐parametric clustering approach that amalgamates the two methods to identify general‐shaped clusters and that can be applied to larger datasets. Specifically, we first partition the dataset into spherical groups using <em>K</em>‐means. We next merge these groups using hierarchical methods with a data‐driven distance measure as a stopping criterion. Our proposal has the potential to reveal groups with general shapes and structure in a dataset. We demonstrate good performance on several simulated and real datasets.</p>
dc.description.comments <p>This is the peer-reviewed version of the following article: Peterson, Anna D., Arka P. Ghosh, and Ranjan Maitra. "Merging K‐means with hierarchical clustering for identifying general‐shaped groups." <em>Stat</em> 7, no. 1 (2018): e172, which has been published in final form at DOI: <a href="http://dx.doi.org/10.1002/sta4.172" target="_blank">10.1002/sta4.172</a>. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving. </p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/stat_las_pubs/165/
dc.identifier.articleid 1170
dc.identifier.contextkey 14423481
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath stat_las_pubs/165
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/90471
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/stat_las_pubs/165/2018_MaitraRanjan_MergingKmeans.pdf|||Fri Jan 14 21:01:26 UTC 2022
dc.source.uri 10.1002/sta4.172
dc.subject.disciplines Categorical Data Analysis
dc.subject.disciplines Statistics and Probability
dc.subject.keywords complete linkage
dc.subject.keywords distance measure
dc.subject.keywords hierarchical clustering
dc.subject.keywords K‐means algorithm
dc.subject.keywords single linkage
dc.title Merging K‐means with hierarchical clustering for identifying general‐shaped groups
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2018_MaitraRanjan_MergingKmeans.pdf
Size:
7.24 MB
Format:
Adobe Portable Document Format
Description:
Collections