Clustering with semi metrics

Thumbnail Image
Date
2001-01-01
Authors
Koul, Neeraj
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Altmetrics
Authors
Research Projects
Organizational Units
Organizational Unit
Computer Science

Computer Science—the theory, representation, processing, communication and use of information—is fundamentally transforming every aspect of human endeavor. The Department of Computer Science at Iowa State University advances computational and information sciences through; 1. educational and research programs within and beyond the university; 2. active engagement to help define national and international research, and 3. educational agendas, and sustained commitment to graduating leaders for academia, industry and government.

History
The Computer Science Department was officially established in 1969, with Robert Stewart serving as the founding Department Chair. Faculty were composed of joint appointments with Mathematics, Statistics, and Electrical Engineering. In 1969, the building which now houses the Computer Science department, then simply called the Computer Science building, was completed. Later it was named Atanasoff Hall. Throughout the 1980s to present, the department expanded and developed its teaching and research agendas to cover many areas of computing.

Dates of Existence
1969-present

Related Units

Journal Issue
Is Version Of
Versions
Series
Department
Abstract

With the advent of Human Genome Project and other genome sequencing efforts, we are now faced with the challenge of developing not only new methods of data analysis but also improving the already existing methods of data analysis so that they can be better used to take advantage of the data. Here we revisit clustering as a tool for large-scale gene expression(or any other data) analysis. Distance measures are an integral part of any clustering algorithm as a means of checking how similar two objects are. The present distance measures are not flexible enough to capture user defined similarities. We define a similarity measure called Universal Distance Measure (UDM) that is flexible enough to capture any abstract notion of similarity. UDM incorporates the user view about two or more objects being same to come up with a well defined distance measure. Further we also investigate how sub clusters interact as a group to form new clusters. We call this interaction Methodology of Joining (MOJ) and study various aspects of it. We believe the use of UDM and MOJ will help us to capture more complex relations that we are looking for in the data.

Comments
Description
Keywords
Citation
Source
Copyright
Mon Jan 01 00:00:00 UTC 2001