Clustering for Data Mining: A Data Recovery Approach and over one million other books are available for Amazon Kindle. Learn more


or
Sign in to turn on 1-Click ordering.
or
Amazon Prime Free Trial required. Sign up when you check out. Learn More
Sell Back Your Copy
For a $3.61 Gift Card
Trade in
More Buying Choices
Have one to sell? Sell yours here
Clustering for Data Mining: A Data Recovery Approach (Chapman & Hall/CRC Computer Science & Data Analysis)
 
 
Start reading Clustering for Data Mining: A Data Recovery Approach on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

Clustering for Data Mining: A Data Recovery Approach (Chapman & Hall/CRC Computer Science & Data Analysis) [Hardcover]

Boris Mirkin (Author)
5.0 out of 5 stars  See all reviews (2 customer reviews)

List Price: $104.95
Price: $90.37 & this item ships for FREE with Super Saver Shipping. Details
You Save: $14.58 (14%)
  Special Offers Available
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.
Only 3 left in stock--order soon (more on the way).
Want it delivered Tuesday, January 31? Choose One-Day Shipping at checkout. Details
Textbook Student FREE Two-Day Shipping for Students. Learn more

Formats

Amazon Price New from Used from
Kindle Edition $67.96  
Hardcover $90.37  
There is a newer edition of this item:
Clustering: A Data Recovery Approach, Second Edition (Chapman & Hall/CRC Computer Science & Data Analysis) Clustering: A Data Recovery Approach, Second Edition (Chapman & Hall/CRC Computer Science & Data Analysis) 5.0 out of 5 stars (2)
$93.30
Available for Pre-order

Book Description

1584885343 978-1584885344 April 29, 2005 1
Often considered more as an art than a science, the field of clustering has been dominated by learning through examples and by techniques chosen almost through trial-and-error. Even the most popular clustering methods--K-Means for partitioning the data set and Ward's method for hierarchical clustering--have lacked the theoretical attention that would establish a firm relationship between the two methods and relevant interpretation aids.

Rather than the traditional set of ad hoc techniques, Clustering for Data Mining: A Data Recovery Approach presents a theory that not only closes gaps in K-Means and Ward methods, but also extends them into areas of current interest, such as clustering mixed scale data and incomplete clustering. The author suggests original methods for both cluster finding and cluster description, addresses related topics such as principal component analysis, contingency measures, and data visualization, and includes nearly 60 computational examples covering all stages of clustering, from data pre-processing to cluster validation and results interpretation.

This author's unique attention to data recovery methods, theory-based advice, pre- and post-processing issues that are beyond the scope of most texts, and clear, practical instructions for real-world data mining make this book ideally suited for virtually all purposes: for teaching, for self-study, and for professional reference.

Special Offers and Product Promotions

  • Buy $50 in qualifying physical textbooks, get $5 in Amazon MP3 Credit. Here's how (restrictions apply)

Customers Who Viewed This Item Also Viewed


Editorial Reviews

Review

The particular decomposition studied in this book is the decomposition of the total sum of squares matrix into between and within cluster components, and the book develops this decomposition, and its associated diagnostics, further than I have seen them developed for cluster analysis before. Overall, the book presents an unusual, perhaps even rather idiosyncratic approach to cluster analysis, from the perspective of someone who is clearly an enthusiast for the insights these tools can bring to understanding data.
-D.J. Hand, Short Book Reviews of the ISI

About the Author

Boris Mirkin is a professor of computer science at the University of London, UK.

--This text refers to an alternate Hardcover edition.

Product Details

  • Hardcover: 296 pages
  • Publisher: Chapman and Hall/CRC; 1 edition (April 29, 2005)
  • Language: English
  • ISBN-10: 1584885343
  • ISBN-13: 978-1584885344
  • Product Dimensions: 9.2 x 6.4 x 0.9 inches
  • Shipping Weight: 1.2 pounds (View shipping rates and policies)
  • Average Customer Review: 5.0 out of 5 stars  See all reviews (2 customer reviews)
  • Amazon Best Sellers Rank: #1,942,572 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, read author blogs, and more.

 

Customer Reviews

2 Reviews
5 star:
 (2)
4 star:    (0)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
5.0 out of 5 stars (2 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

8 of 9 people found the following review helpful:
5.0 out of 5 stars Very USEFUL, September 10, 2005
This review is from: Clustering for Data Mining: A Data Recovery Approach (Chapman & Hall/CRC Computer Science & Data Analysis) (Hardcover)
This book gives a smooth, motivated and example-rich
introduction to clustering, which is innovative in many aspects.
Answers to important questions that are very rarely addressed if
addressed at all, are provided.
Examples:
(a) what to do if the user has no idea of the number
of clusters and/or their location - use what is called intelligent k-means;
(b) what to do if the data contain both numeric and categorical
features - use what is called three-step standardization procedure;
(c) how to catch anomalous patterns, (d) how to validate clusters, etc.
Some of these may be subject to criticism, however some motivation is always
supplied, and the results are always reproducible thus testable.
The book introduces a number
of non-conventional cluster interpretation aids derived from a data
geometry view accepted by the author and based on what is referred
the contribution weights - basically showing those elements of cluster
structures that distinguish clusters from the rest. These contribution
weights, applied to categorical data, appear to be highly compatible
with what statisticians such as A. Quetelet and K. Pearson were developing
in the past couple of centuries, which is a highly original and welcome
development. The book reviews a rich set of approaches being accumulated
in such hot areas as text mining and bioinformatics, and shows that
clustering is not just a set of naive methods for data processing but
forms an evolving area of data science.
I adopted the book as a text for my courses in data mining for bachelor
and master degrees.

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


7 of 11 people found the following review helpful:
5.0 out of 5 stars Clusters of Data, Not Micro Computer Clusters, June 2, 2005
This review is from: Clustering for Data Mining: A Data Recovery Approach (Chapman & Hall/CRC Computer Science & Data Analysis) (Hardcover)
First, understand that the type of clustering being discussed in this book is the statistical technique of finding clusters of data in a collection, where the collection is typically a database. This is not about clustered micro computers being used to work on big computational tasks as though it is a supercomputer.

Clusters of customers is a key area in data mining and knowledge discovery. You are usually trying to find groups of people with similar buying patterns but not necessarily identical. For instance if you have a group of people that have purchased a book on PHP, you might want to try to sell them a book on MySQL, or Apache, or Linnux. These programs fit together, but are not identical. Still the customer who purchased the PHP book is more likely to want a MySQL book than he is to want an audio CD of a murder mystery.

In this book, two of the most popular clustering techniques, K-Means and Ward's Method are presented. They are presented for a reader interested in the technical aspects of data mining as a theoretician or a practitioner. It is intended (the author says) that the material be useful to a reader with no mathematical background beyond high school. But the author also says, it might be of help if the reader is acquainted with basic notions of calculus, statistics, matrix algebra, graph theory and logic. (The author went to a different high school than I).

Clustering is described in this book to be used in a wide variety of applications, most of which are oriented to discovering social patterns, biological taxonomies, machine learning, etc. The book discusses the various techniques that have been developed and gives examples where they have been used in a wide variety of applications.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No

Share your thoughts with other customers: Create your own review
 
 
 
Only search this product's reviews



Inside This Book (learn more)
First Sentence:
Clustering is a discipline devoted to revealing and describing homogeneous groups of entities, that is, clusters, in data sets. Read the first page
Key Phrases - Statistically Improbable Phrases (SIPs): (learn more)
entire entity set, scatter decomposition, tightness function, category utility function, divisive clustering algorithm, silhouette width, contingency data, data scatter, initial centroids, conjunctive description, maximal clusters, cluster hierarchy, towns data, recovery approach, interpretation aids, clustering goals, impurity function, confusion data, recovery framework, merged cluster, categorical features, anomalous pattern, intelligent version, original data table, town data
Key Phrases - Capitalized Phrases (CAPs): (learn more)
Leo Tolstoy, Oliver Twist, Mark Twain, Charles Dickens, Method Real, Row-point Centroid
New!
Books on Related Topics | Concordance | Text Stats
Browse Sample Pages:
Front Cover | Table of Contents | First Pages | Index | Back Cover | Surprise Me!
Search Inside This Book:





Suggested Tags from Similar Products

 (What's this?)
Be the first one to add a relevant tag (keyword that's strongly related to this product).
 

Your tags: Add your first tag
 

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   
Related forums


Listmania!


So You'd Like to...


Create a guide


Look for Similar Items by Category


Look for Similar Items by Subject