or
Sign in to turn on 1-Click ordering.
 
 
Express Checkout with PayPhrase
What's this? | Create PayPhrase
More Buying Choices
34 used & new from $27.67

Have one to sell? Sell yours here
 
   
Principles of Data Mining (Adaptive Computation and Machine Learning)
 
 
Tell the Publisher!
I’d like to read this book on Kindle

Don’t have a Kindle? Get your Kindle here.
 
  

Principles of Data Mining (Adaptive Computation and Machine Learning) (Hardcover)

~ David J. Hand (Author), Heikki Mannila (Author), Padhraic Smyth (Author)
3.6 out of 5 stars  See all reviews (16 customer reviews)

List Price: $68.00
Price: $43.87 & this item ships for FREE with Super Saver Shipping. Details
You Save: $24.13 (35%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.

Want it delivered Monday, December 21? Choose One-Day Shipping at checkout. Details
Ordering for Christmas? To ensure delivery by December 24, choose FREE Super Saver Shipping at checkout. Read more about holiday shipping.

19 new from $29.75 15 used from $27.67

Frequently Bought Together

Principles of Data Mining (Adaptive Computation and Machine Learning) + The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition (Springer Series in Statistics) + Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Price For All Three: $138.28

Show availability and shipping details


Customers Who Bought This Item Also Bought

The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition (Springer Series in Statistics)

The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition (Springer Series in Statistics)

by Robert Tibshirani
3.9 out of 5 stars (35)  $49.90
Data Mining: Concepts and Techniques, Second Edition (The Morgan Kaufmann Series in Data Management Systems)

Data Mining: Concepts and Techniques, Second Edition (The Morgan Kaufmann Series in Data Management Systems)

by Jiawei Han
3.7 out of 5 stars (30)  $51.58
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)

by Richard O. Duda
3.7 out of 5 stars (28)  $104.87
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

by Eibe Frank
4.0 out of 5 stars (31)  $44.51
Machine Learning (Mcgraw-Hill International Edit)

Machine Learning (Mcgraw-Hill International Edit)

by Tom M. Mitchell
4.3 out of 5 stars (38)  $74.43
Explore similar items

Editorial Reviews

Product Description

The growing interest in data mining is motivated by a common problem across disciplines: how does one store, access, model, and ultimately describe and understand very large data sets? Historically, different aspects of data mining have been addressed independently by different disciplines. This is the first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The book consists of three sections. The first, foundations, provides a tutorial overview of the principles underlying data mining algorithms and their application. The presentation emphasizes intuition rather than rigor. The second section, data mining algorithms, shows how algorithms are constructed to solve specific problems in a principled manner. The algorithms covered include trees and rules for classification and regression, association rules, belief networks, classical statistical models, nonlinear models such as neural networks, and local "memory-based" models. The third section shows how all of the preceding analysis fits together when applied to real-world data mining problems. Topics include the role of metadata, how to handle missing data, and data preprocessing.


About the Author

David J. Hand is Professor of Statistics, Department of Mathematics, Imperial College, London. Heikki Mannila is Research Fellow at Nokia Research Center and Professor, Department of Computer Science and Engineering, Helsinki University of Technology. Padhraic Smyth is Associate Professor, Department of Information and Computer Science, the University of California, Irvine.

Product Details

  • Hardcover: 578 pages
  • Publisher: The MIT Press (August 1, 2001)
  • Language: English
  • ISBN-10: 026208290X
  • ISBN-13: 978-0262082907
  • Product Dimensions: 9 x 8 x 1.2 inches
  • Shipping Weight: 2.5 pounds (View shipping rates and policies)
  • Average Customer Review: 3.6 out of 5 stars  See all reviews (16 customer reviews)
  • Amazon.com Sales Rank: #594,609 in Books (See Bestsellers in Books)

    Popular in this category: (What's this?)

    #62 in  Books > Computers & Internet > Computer Science > Artificial Intelligence > Machine Learning

More About the Author

D. J. Hand
Discover books, learn about writers, read author blogs, and more.

Visit Amazon's D. J. Hand Page

Inside This Book (learn more)
Browse Sample Pages:
Front Cover | Table of Contents | First Pages | Index | Surprise Me!
Search Inside This Book:



Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 
(2)
(2)

Your tags: Add your first tag
 

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

 

Customer Reviews

16 Reviews
5 star:
 (7)
4 star:
 (4)
3 star:
 (1)
2 star:    (0)
1 star:
 (4)
 
 
 
 
 
Average Customer Review
3.6 out of 5 stars (16 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
29 of 29 people found the following review helpful:
5.0 out of 5 stars finally a good statistical and computer science perspective on data mining, January 23, 2008
This book is not an introductory text. Anyone interested in a particular topic should consult the preface of the text to find out what it is about. The negative reviewers were not fair to the authors on that score. Had they read the preface they would have found out (1) how the authors define data mining, (2) that they see it as a subject with an important mix of statistical methodology and computer science and (3) that it is intended as an advanced undergraduate or first year graduate text on the topic.
They also provide a very well organized structure for the text that is well described in the preface. It consists of three parts. Chapter 1 is an essential introduction that is informative to everyone. Chapters 2 through 4 go through basic statistical ideas that statisticians would be very familiar with and others could view as a refresher. The authors have experience teaching this course to engineering and science majors and have found that many of these students unfortunately do not have the prerequisite statistical inference ideas and need this material covered in the course.

Chapters 5 through 8 cover the components of data mining algorithms and the remaining chapters deal with the details of the tasks and algorithms.

The book features a further reading section at the end of each chapter that provides a very nice guide to the useful and most significant relevant literature. The author's have done a very good job at this. One mistake I found was a reference to Miller (1980). I think this was intended to be a reference to the seocnd edition fo Rupert Miller's text "Simultaneous Statistical Inference" which was published in 1981 by Springer-Verlag but the full citation is missing from the list of references in the back of the book.

This book deserves 5 stars because it does what it intends to do. It presents the field of data mining in a clear way covering topics on classfication and kernel methods expertly. David Hand has published a great deal on these techniques including many fine books.

Mannila and Smyth bring to the text the computer science perspective. There is much useful material on optimization methods and computational complexity.

Statistical modeling and issues of the "curse of dimensionality" and the "overfitting problem" are key issues that this text emphasizes and expertly addresses.

The only thing the text misses is details on specific algorithms. But I do not grade them down for that because it was not their intention. They emphasize methodology and issues and that is the most critical thing a practitioner needs to know first before embarking on his own attack at mining data.

The text does provide most of the current important methods. Although Vapnik's work is mentioned and his two books are referenced there is very little discussion of support vector machines and the use of Vapnik-Chervonenkis classes and dimension in data mining. The new book by Hastie, Tibshirani and Friedman goes into much greater detail on specific algorithms include some only briefly discussed in this text (e.g. support vector machines). The support vector approach is also nicely treated in "Learning with Kernels" by Scholkopf and Smola.

I highly recommend this book for anyone interested in data mining. It is a great reference source and an eloquent text to remind you of the pitfalls of thoughtless mining or "data-dredging". It also has many nice practical examples and some interesting success stories on the application of data mining to specific problems.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
25 of 26 people found the following review helpful:
5.0 out of 5 stars A wonderful book but not a cookbook, November 10, 2003
By Robert Ehrlich (Salt Lake City, UT USA) - See all my reviews
(REAL NAME)   
I am a professional data miner (20 yrs. experience) and data mining can be a treacherous business compared to conventional statistical analysis. There are many software packages that offer the novice a seemingly plethora of "information-extracting" tools. There is a tendency in the field to regard one or another of these as the final and eternal answer to a particular objective. This is the best guide so far in assisting the novice data miner in avoiding dumb mistakes and selecting the strongest analytical tool suited to data structure and objectives.

This book can be read and understood by anyone who has had a decent basic course in statistics or or in pattern recognition. It alerts the reader to potential pitfalls in using a particular data mining procedure. It also clearly describes essential differences between procedures. Examples from real data are clear and integrated with the text.

This is not a "cookbook" that teaches you keystroke by keystroke how to implement an algorithm. Instead this book is a guide in understanding the fundamentals behind each procedure (as good as possible assuming low level math skills), and hints on interpetation of output, especially limits to interpretation. It is very well written and can stand alone as a guide or serve as a testbook in a data mining class.

Now if they would just write a book on bayesian decision-making in the same way.

Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
20 of 21 people found the following review helpful:
5.0 out of 5 stars Excellent introductory text on data mining, April 29, 2003
By A Customer
This is an excellent book for students in engineering and computer science who would like an introductory and statistical treatment of data mining. It has much more statistical content than other widely-used data mining texts such as those by Han and Kamber or Witten and Frank. And it is better suited to senior undergraduate or first-year graduate students in CS and EE than the text by Hastie and colleagues, since it has broader coverage of data mining topics and a more tutorial-style introduction to the basic principles of inference from data.

The coverage emphasizes breadth rather than depth and this works well for an introductory text. Numerous and extensive references are provided for further reading. The layout of the book is interesting, proceeding from data visualization (often ignored in many data mining books) through general principles of inference and algorithms, to more specific techniques in classification and regression. If you are interested in data mining and would like a statistically-motivated introduction, then this is the book to start with.

Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
 
Most Recent Customer Reviews

3.0 out of 5 stars make sure you are right audience
It's not that this is a bad book, but you have to make sure you are right audience. The book offers very high-level overviews on various techniques of data mining, but it is... Read more
Published on December 1, 2005 by W

4.0 out of 5 stars It shows me many examples
Even if it is bad as all the gentlemen said, I think at least it gives me many examples which are not mentioned in other books before.
Published on April 7, 2005 by tjzzy

1.0 out of 5 stars Very, Bad Book !
I was very disappointed in this book. There are so many other books in the field of Data Mining that are so much better. This one has very little to offer. Read more
Published on December 22, 2004 by D. K. Wedding

4.0 out of 5 stars Good book for overall breadth of alogrithms..
Very good book for a general overview of data mining algorithms. Covers a wide variety of DM approaches.. however lacks concrete examples to clear concepts thoroughly. Read more
Published on August 15, 2004 by Manish C. Tayal

5.0 out of 5 stars Great book with a great layout!
I'd been struggling with the seemingly infinite ways to approach data mining and this book cleared it all up for me. Read more
Published on March 14, 2004 by Kevin Nasman

1.0 out of 5 stars This is NOT a Data Mining Book .. But a bad statistics book
Finally .. I recevie the book .. I read the list of content and I surprised about it .. and now I know why they dont write the contents here to read before bying the book... Read more
Published on October 13, 2003 by Mustafa

1.0 out of 5 stars This is a Bad Statistics Book Not a Data Mining Book
Finally .. I recevie the book .. I read the list of content and I surprised about it .. and now I know why they dont write the contents here to read before bying the book... Read more
Published on October 13, 2003 by Mustafa

5.0 out of 5 stars Very good introduction to the topic
"Principles of Data Mining" was my first book on the subject, and although I haven't read it all, I can state that this book has done its job in explaining the fundamentals of the... Read more
Published on June 25, 2003 by Dumitru Erhan

4.0 out of 5 stars Good text
This text is well written, but not very technical. It is not particularly useful as a reference, if your goal is to get a project or projects off the ground. Read more
Published on April 27, 2003

4.0 out of 5 stars good book to read
It's an amazing book to read for data mining, from statistics perspective. after reading this book, try to do a project by using ideas you learn from it. Read more
Published on February 8, 2002

Only search this product's reviews



Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   




Product Information from the Amapedia Community

Beta (What's this?)


Look for Similar Items by Category


Look for Similar Items by Subject

 

Feedback

If you need help or have a question for Customer Service, contact us.
 Would you like to update product info or give feedback on images?
Is there any other feedback you would like to provide?

Your comments can help make our site better for everyone.


Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.