Summary
Cluster analysis is the art of finding groups in data.
It is applied in many domains, sometimes under other names such as numerical taxonomy or automatic data classification.
Finding Groups in Data: An Introduction to Cluster Analysis presents a small set of clustering methods that have many applications.
The authors show the general user, who may not have a mathematical or statistical background, how to use this powerful tool.
The first chapter discusses the various types of data (including interval-scaled and binary variables, as well as similarity data) and shows how to choose an appropriate clustering method.
The remaining six chapters cover six different clustering methods, and can be read independently of one another.
Each chapter follows a common format.
The first three sections give a short description of the clustering method, explain how to use it, and analyze a set of examples.
The two following sections (which may be skipped without loss of understanding) discuss the algorithm and its implementation, and some related methods in the literature.
The authors present three partitioning methods and three hierarchical techniques.
These six procedures have been chosen for their robustness, consistency, and general applicability.
Some of the methods are new, such as the approach for partitioning large data sets, and the L1 method for fuzzy clustering.
Also, the clusterings are accompanied by graphical displays and corresponding quality coefficients, which help the user to select the number of clusters and to see whether the method has found groups that were actually present in the data.
The programs described here are for the IBM PC, but the source code is very portable and has been run on several types of mainframes.
The programs, together with their sources and the data sets used in the book, are available on floppy disks by writing to the authors.
Finding Groups in Data: An Introduction to Cluster Analysis should prove useful to applied statisticians, students, and anyone using quantitative methods.
|