Join Amazon Prime and ship Two-Day for free and Overnight for $3.99. Already a member? Sign in.
Clustering for Data Mining: A Data Recovery Approach and over 300,000 other books are available for Amazon Kindle – Amazon’s new wireless reading device. Learn more

 

or
Sign in to turn on 1-Click ordering.
 
 
More Buying Choices
23 used & new from $63.41

Have one to sell? Sell yours here
 
   
Clustering for Data Mining: A Data Recovery Approach (Computer Science and Data Analysis)
 
 
Start reading Clustering for Data Mining: A Data Recovery Approach on your Kindle in under a minute.

Don’t have a Kindle? Get yours here.
 
  

Clustering for Data Mining: A Data Recovery Approach (Computer Science and Data Analysis) (Hardcover)

by Boris Mirkin (Author) "Clustering is a discipline devoted to revealing and describing homogeneous groups of entities, that is, clusters, in data sets..." (more)
Key Phrases: entire entity set, scatter decomposition, tightness function, Leo Tolstoy, Oliver Twist, Mark Twain (more...)
5.0 out of 5 stars See all reviews (2 customer reviews)

List Price: $93.95
Price: $63.41 & this item ships for FREE with Super Saver Shipping. Details
You Save: $30.54 (33%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.

Only 3 left in stock--order soon (more on the way).

Want it delivered Monday, July 20? Choose One-Day Shipping at checkout. Details
14 new from $63.41 9 used from $79.67
Also Available in: List Price: Our Price: Other Offers:
Kindle Edition (Kindle Book) $57.07

Frequently Bought Together

Clustering for Data Mining: A Data Recovery Approach (Computer Science and Data Analysis) + Introduction to Clustering Large and High-Dimensional Data + Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability)
Price For All Three: $207.10

Show availability and shipping details


Customers Who Bought This Item Also Bought



Editorial Reviews

Review
The particular decomposition studied in this book is the decomposition of the total sum of squares matrix into between and within cluster components, and the book develops this decomposition, and its associated diagnostics, further than I have seen them developed for cluster analysis before. Overall, the book presents an unusual, perhaps even rather idiosyncratic approach to cluster analysis, from the perspective of someone who is clearly an enthusiast for the insights these tools can bring to understanding data.
-D.J. Hand, Short Book Reviews of the ISI

Product Description
Often considered more as an art than a science, the field of clustering has been dominated by learning through examples and by techniques chosen almost through trial-and-error. Even the most popular clustering methods--K-Means for partitioning the data set and Ward's method for hierarchical clustering--have lacked the theoretical attention that would establish a firm relationship between the two methods and relevant interpretation aids.

Rather than the traditional set of ad hoc techniques, Clustering for Data Mining: A Data Recovery Approach presents a theory that not only closes gaps in K-Means and Ward methods, but also extends them into areas of current interest, such as clustering mixed scale data and incomplete clustering. The author suggests original methods for both cluster finding and cluster description, addresses related topics such as principal component analysis, contingency measures, and data visualization, and includes nearly 60 computational examples covering all stages of clustering, from data pre-processing to cluster validation and results interpretation.

This author's unique attention to data recovery methods, theory-based advice, pre- and post-processing issues that are beyond the scope of most texts, and clear, practical instructions for real-world data mining make this book ideally suited for virtually all purposes: for teaching, for self-study, and for professional reference. ---------------------Features--------------------- · Introduces classical clustering methods extended, via the data recovery approach, to modern data mining tasks · Describes the theory that leads to these methods and relevant interpretation aids, fills gaps in the established theory, and corrects common misconceptions · Treats the two most popular methods, K-Means and Ward clustering, offering the first theoretically motivated instructions for automating all steps of data mining with clustering · Offers an up-to-date description of current data mining issues, such as feature selection and cluster validation · Presents a wealth of computational examples covering all stages of clustering


Product Details

  • Hardcover: 296 pages
  • Publisher: Chapman & Hall/CRC; 1 edition (April 29, 2005)
  • Language: English
  • ISBN-10: 1584885343
  • ISBN-13: 978-1584885344
  • Product Dimensions: 9.2 x 6.4 x 0.9 inches
  • Shipping Weight: 1.2 pounds (View shipping rates and policies)
  • Average Customer Review: 5.0 out of 5 stars See all reviews (2 customer reviews)
  • Amazon.com Sales Rank: #1,077,326 in Books (See Bestsellers in Books)

Inside This Book (learn more)




Suggested Tags from Similar Products

 (What's this?)
Be the first one to add a relevant tag (keyword that's strongly related to this product).
Check a corresponding box or enter your own tags in the field below.

Your tags: Add your first tag
 
Help others find this product — tag it for Amazon search
No one has tagged this product for Amazon search yet. Why not be the first to suggest a search for which it should appear?

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

 

Customer Reviews

2 Reviews
5 star:
 (2)
4 star:    (0)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
5.0 out of 5 stars (2 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
7 of 8 people found the following review helpful:
5.0 out of 5 stars Very USEFUL, September 10, 2005
This book gives a smooth, motivated and example-rich
introduction to clustering, which is innovative in many aspects.
Answers to important questions that are very rarely addressed if
addressed at all, are provided.
Examples:
(a) what to do if the user has no idea of the number
of clusters and/or their location - use what is called intelligent k-means;
(b) what to do if the data contain both numeric and categorical
features - use what is called three-step standardization procedure;
(c) how to catch anomalous patterns, (d) how to validate clusters, etc.
Some of these may be subject to criticism, however some motivation is always
supplied, and the results are always reproducible thus testable.
The book introduces a number
of non-conventional cluster interpretation aids derived from a data
geometry view accepted by the author and based on what is referred
the contribution weights - basically showing those elements of cluster
structures that distinguish clusters from the rest. These contribution
weights, applied to categorical data, appear to be highly compatible
with what statisticians such as A. Quetelet and K. Pearson were developing
in the past couple of centuries, which is a highly original and welcome
development. The book reviews a rich set of approaches being accumulated
in such hot areas as text mining and bioinformatics, and shows that
clustering is not just a set of naive methods for data processing but
forms an evolving area of data science.
I adopted the book as a text for my courses in data mining for bachelor
and master degrees.

Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
7 of 11 people found the following review helpful:
5.0 out of 5 stars Clusters of Data, Not Micro Computer Clusters, June 2, 2005
First, understand that the type of clustering being discussed in this book is the statistical technique of finding clusters of data in a collection, where the collection is typically a database. This is not about clustered micro computers being used to work on big computational tasks as though it is a supercomputer.

Clusters of customers is a key area in data mining and knowledge discovery. You are usually trying to find groups of people with similar buying patterns but not necessarily identical. For instance if you have a group of people that have purchased a book on PHP, you might want to try to sell them a book on MySQL, or Apache, or Linnux. These programs fit together, but are not identical. Still the customer who purchased the PHP book is more likely to want a MySQL book than he is to want an audio CD of a murder mystery.

In this book, two of the most popular clustering techniques, K-Means and Ward's Method are presented. They are presented for a reader interested in the technical aspects of data mining as a theoretician or a practitioner. It is intended (the author says) that the material be useful to a reader with no mathematical background beyond high school. But the author also says, it might be of help if the reader is acquainted with basic notions of calculus, statistics, matrix algebra, graph theory and logic. (The author went to a different high school than I).

Clustering is described in this book to be used in a wide variety of applications, most of which are oriented to discovering social patterns, biological taxonomies, machine learning, etc. The book discusses the various techniques that have been developed and gives examples where they have been used in a wide variety of applications.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
Ad
 
Only search this product's reviews



Customer Discussions

 Beta (What's this?)
New! See all customer communities, and bookmark your communities to keep track of them.
This product's forum (0 discussions)
  Discussion Replies Latest Post
  No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
  [Cancel]


Active discussions in related forums
   


Product Information from the Amapedia Community

Beta (What's this?)


Look for Similar Items by Category


Cut Wood Down to Size

Cut Wood Down to Size

Split wood with ease using a log splitter from the Outdoor Power & Lawn Equipment Store.

Shop all log splitters

 

Best Books of 2008

Best of 2008
Find our top 100 editors' picks as well as customers' favorites in dozens of categories in our Best Books of 2008 Store.
 

Buy Three Books, Get a Fourth Free

4-for-3 Books
Order any four eligible books under $10 and get the lowest-price book free in our 4-for-3 Books Store. See more details.
 

Smooth Operator

Shop for planers
With a planer every workpiece in your project can be a perfect match.

Shop for planers

 
Ad

 

Feedback

If you need help or have a question for Customer Service, contact us.
 Would you like to update product info or give feedback on images?
Is there any other feedback you would like to provide?

Your comments can help make our site better for everyone.


Where's My Stuff?

Shipping & Returns

Need Help?

Your Recent History

  (What's this?)
You have no recently viewed items or searches.

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.

Look to the right column to find helpful suggestions for your shopping session.

Continue shopping: Top Sellers
Free
Free by Chris Anderson
Paranoia
Paranoia by Joseph Finder
My Soul to Lose
My Soul to Lose by Rachel Vincent
Glenn Beck's Common Sense

Conditions of Use | Privacy Notice © 1996-2009, Amazon.com, Inc. or its affiliates