Data Mining: and thousands of other textbooks are available for instant download on your Kindle Fire tablet or on the free Kindle apps for iPad, Android tablets, PC or Mac.

Sorry, this item is not available in
Image not available for
Color:
Image not available

To view this video download Flash Player

 


or
Sign in to turn on 1-Click ordering
Sell Us Your Item
For a $16.38 Gift Card
Trade in
Kindle Edition
Read instantly on your iPad, PC, Mac, Android tablet or Kindle Fire
Buy Price: $37.38
Rent From: $15.19
 
 
 
More Buying Choices
Have one to sell? Sell yours here

Data Mining: Practical Machine Learning Tools and Techniques, Third Edition (The Morgan Kaufmann Series in Data Management Systems) [Paperback]

Ian H. Witten , Eibe Frank , Mark A. Hall
3.9 out of 5 stars  See all reviews (38 customer reviews)

Buy New
$39.35 & FREE Shipping. Details
Rent
$16.98 - $19.75
Only 2 left in stock (more on the way).
Ships from and sold by Amazon.com. Gift-wrap available.
In Stock.
Want it Tuesday, July 15? Choose One-Day Shipping at checkout. Details
Free Two-Day Shipping for College Students with Amazon Student

Formats

Amazon Price New from Used from
 
Kindle Edition
Rent from
$37.38
$15.19
 
Paperback $39.35  
Shop the new tech.book(store)
New! Introducing the tech.book(store), a hub for Software Developers and Architects, Networking Administrators, TPMs, and other technology professionals to find highly-rated and highly-relevant career resources. Shop books on programming and big data, or read this week's blog posts by authors and thought-leaders in the tech industry. > Shop now

Book Description

January 20, 2011 0123748569 978-0123748560 3

Data Mining: Practical Machine Learning Tools and Techniques offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

Thorough updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including new material on Data Transformations, Ensemble Learning, Massive Data Sets, Multi-instance Learning, plus a new version of the popular Weka machine learning software developed by the authors. Witten, Frank, and Hall include both tried-and-true techniques of today as well as methods at the leading edge of contemporary research.

*Provides a thorough grounding in machine learning concepts as well as practical advice on applying the tools and techniques to your data mining projects *Offers concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods *Includes downloadable Weka software toolkit, a collection of machine learning algorithms for data mining tasks-in an updated, interactive interface. Algorithms in toolkit cover: data pre-processing, classification, regression, clustering, association rules, visualization


Frequently Bought Together

Data Mining: Practical Machine Learning Tools and Techniques, Third Edition (The Morgan Kaufmann Series in Data Management Systems) + Data Mining: Concepts and Techniques, Third Edition (The Morgan Kaufmann Series in Data Management Systems) + Introduction to Data Mining
Price for all three: $204.34

Some of these items ship sooner than the others.

Buy the selected items together


Editorial Reviews

Review

"Co-author Witten is the author of other well-known books on data mining, and he and his co-authors of this book excel in statistics, computer science, and mathematics. Their in- depth backgrounds and insights are the strengths that have permitted them to avoid heavy mathematical derivations in explaining machine learning algorithms so they can help readers from different fields understand algorithms. I strongly recommend this book to all newcomers to data mining, especially to those who wish to understand the fundamentals of machine learning algorithms."--INFORMS Journal of Computing

"The third edition of this practical guide to machine learning and data mining is fully updated to account for technological advances since its previous printing in 2005 and is now even more closely aligned with the use of the Weka open source machine learning, data mining and data modeling application. Beginning with an introduction to data mining, the volume explores basic inputs, outputs and algorithms, the implementation of machine learning schemes and in-depth exploration of the many uses of the Weka data analysis software. Numerous illustration, tables and equations are included throughout and additional resources are available through a companion website. Witten, Frank and Hall are academics with the department of computer science at the University of Waikato, New Zealand, the home of the Weka software project."--Book News, Reference & Research

"I would recommend this book to anyone who is getting started in either data mining or machine learning and wants to learn how the fundamental algorithms work. I liked that the book slowly teaches you the different algorithms piece by piece and that there are also a lot of examples. I plan on taking a machine learning course this upcoming fall semester and feel that the book gave me great insight that the course will be based on mathematics more than I had originally expected. My favorite part of the book was the last chapter where it explains how you can solve different practical data mining scenarios using the different algorithms. If there were more chapters like the last one, the book would have been perfect. This book might not be that useful if you do not plan on using the Weka software or if you are already familiar with the various machine learning algorithms. Overall, Data Mining: Practical Machine Learning Tools and Techniques is a great book to learn about the core concepts of data mining and the Weka software suite."-- ACM SIGSOFT Software Engineering Notes

"This book is a must-read for every aspiring data mining analyst. Its many examples and the technical background it imparts would be a unique and welcome addition to the bookshelf of any graduate or advanced undergraduate student. The book is written for both academic and application-oriented readers, and I strongly recommend it to any reader working in the area of machine learning and data mining."--Computing Reviews.com

From the Back Cover

Data Mining: Practical Machine Learning Tools and Techniques offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

Thorough updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including new material on Data Transformations, Ensemble Learning, Massive Data Sets, Multi-instance Learning, plus a new version of the popular Weka machine learning software developed by the authors. Witten, Frank, and Hall include both tried-and-true techniques of today as well as methods at the leading edge of contemporary research.


Product Details

  • Series: The Morgan Kaufmann Series in Data Management Systems
  • Paperback: 664 pages
  • Publisher: Morgan Kaufmann; 3 edition (January 20, 2011)
  • Language: English
  • ISBN-10: 0123748569
  • ISBN-13: 978-0123748560
  • Product Dimensions: 1.7 x 7.4 x 9.1 inches
  • Shipping Weight: 3 pounds (View shipping rates and policies)
  • Average Customer Review: 3.9 out of 5 stars  See all reviews (38 customer reviews)
  • Amazon Best Sellers Rank: #98,815 in Books (See Top 100 in Books)

More About the Authors

Discover books, learn about writers, read author blogs, and more.

Customer Reviews

3.9 out of 5 stars
(38)
3.9 out of 5 stars
Share your thoughts with other customers
Most Helpful Customer Reviews
87 of 89 people found the following review helpful
5.0 out of 5 stars Worthwhile Update to an Excellent Text March 6, 2011
Format:Paperback|Vine Customer Review of Free Product (What's this?)
Context for this review: I am a data miner with 20 years experience, and own the first edition of this book.

Good:
- Accessible writing style
- Broad coverage of algorithms and data mining issues, with an eye toward practical issues
- Needless technical trivia (derivations and the like) are avoided
- Algorithms are completely spelled out: A competent programmer should be able to turn these descriptions into functioning code.
- Third edition makes meaningful improvements on previous editions

Bad(ish):
- Approximately one-third of this book is now devoted to the WEKA data mining software. I have nothing against WEKA, and it is a good choice for a text such as this, since WEKA is free. In my opinion, though, this coverage consumes too many pages of this book.
- Data mining draws from a number of fields with separate roots (statistics, machine learning, pattern recognition, engineering, etc.), and many techniques go by multiple names. As with many other data mining books, this one does not always point out the aliases by which data mining methods are known.

The bottom line: This is still the best data mining text on the market.
Was this review helpful to you?
28 of 29 people found the following review helpful
4.0 out of 5 stars Applying Machine Learning to Data Mining problems April 1, 2011
Format:Paperback|Vine Customer Review of Free Product (What's this?)
The subtitle of the book should really be emphasized more: Practical Machine Learning Tools and Techniques. This isn't a book about adhoc SQL queries and database statistics, it is about tools to discover relationships you didn't know you were looking for. Much of the book shows how to handle knowledge formation and representation, statistical modeling and projections. The one critique I have in regard is that much of the algorithm breakdowns are done in prose rather than true pseudocode.

I would like to echo other reviews that point out the text focuses on WEKA, and the authors indicate this is by intent. Though they do give much generic information, at some point you have to pick a horse to hitch your carriage to, and an established open-source project in Java is probably most widely accessible. Their coverage of WEKA claims 50% more features than the 2nd ed. and indeed it consumes half the book. I feel this is a good thing, as it lends great practicality to the book, allowing you to dig right in and get something actually done.

There are some additions to the 3rd ed. that modernize the book a bit. Showing how data can be reidentified (and the ethical implications) is pertinent to today's HIPAA-regulated medical environments. They also touch on web and ubiquitous mining, reflecting our growing foray into non-traditional cloud sources of information.
Comment | 
Was this review helpful to you?
20 of 20 people found the following review helpful
5.0 out of 5 stars My favorite practical machine learning book September 3, 2011
Format:Paperback|Vine Customer Review of Free Product (What's this?)
There exists a couple of classics of Machine learning, with various strengths and weaknesses. "The elements of statistical learning" by Hastie and company. Bishop's book, "Pattern Recognition and Machine Learning." And now, this book, "Data Mining." I'd say this is the most practical of the three books. The other two I mentioned are oriented towards theoretical underpinnings, and cataloging the rich zoology of machine learning techniques. This one tells you how to get stuff done. Lots of practical ideas on discretization, denoising, data preparation and performance characterization. It even has practical advice on things you really need an expert opinion on: for example, when using data folding techniques for cross validation ... what is a good number of folds to use? This book will tell you. It's like having a couple of seasoned experts looking over your shoulder when you're trying to get things done. It had a detailed recipe in it for something I really needed to solve... and their recipe worked!
While the subject matter is similar to the Bishop and Hastie books: what this most reminded me of was the classic physics text, "Numerical recipes." It's all very well having a good theoretical understanding of the techniques you're using. It's vastly more important to have advice on using them properly. This is that book; uniquely so, thus far, in my experience.
It's also a brilliant manual for their Weka machine learning environment, which is incredibly useful. I don't use the Weka UI, but I have called upon Weka as a library extension to the R programming environment. Mostly because of this book: it's both a recipe book and a map to a large collection of recipes you can use to solve your machine learning problems.

There isn't so much on time series applications, sadly, which is something I end up working with a lot. I'd love to see an extended chapter on the particular difficulties in using machine learning techniques to mine and forecast time series.
Comment | 
Was this review helpful to you?
28 of 30 people found the following review helpful
4.0 out of 5 stars Mixed Opinion April 27, 2011
Format:Paperback|Vine Customer Review of Free Product (What's this?)
Fantastic book if you need to use WEKA; probably the best recommendation available.

If, however, you're not going to be using WEKA then the book is still valuable, but I challenge the true 'practicality' of it. The content is thorough but perhaps more academically oriented than as industry focused as I would have liked. The author keeps it very accessible, particularly as far as mathematics and statistics go. While this might make the book a little more long winded - in my view it makes it a far easier to get into the groove and allows you to read it like a book.

* Highly recommended for WEKA users
* For others users I suggest you look through to see if it will really be helpful before plunking down the cash
Was this review helpful to you?
Most Recent Customer Reviews
4.0 out of 5 stars Good for overview and intuition
First of all, I would advise to think of this as a 400-page book with a WEKA appendix. Its price is about right for a 400-page machine learning textbook, and you don't even need... Read more
Published 21 days ago by Andrew
3.0 out of 5 stars Good
It's fine, but poor on kind of examples and applications. It's a good product to data mining and weka software
Published 2 months ago by Daniel Montaner
3.0 out of 5 stars More like a manual of Weka software
The book is more like a manual for the Weka software. No clear discussion of the data mining techniques is available. Explanations given are very vague and not helpful. Read more
Published 3 months ago by AAA
4.0 out of 5 stars good book
i like this book my professor suggested this one. And, it is Easy to understand weka tool is good one.
Published 4 months ago by Nuthan Nomula
5.0 out of 5 stars Excellent Introduction
I especially like the "Simple Techniques" chapter. It gets right to the important ideas, without distracting advanced details. Read more
Published 4 months ago by opus111
5.0 out of 5 stars Very comprehensive and even better with Weka
I've had (the first edition of) this book for a while now, and when I first read it some years ago, I thought that it was very comprehensive but written in a mind-numbing dry... Read more
Published 8 months ago by Sujit Pal
5.0 out of 5 stars Useful and well-performed
The book has nothing bad with its printing quality.
The content is useful.
The price is not that high.
The delivering speed is acceptable.
I like it.
Published 10 months ago by Xiaoyu Zhang
1.0 out of 5 stars Avoid
Reading some of these reviews I feel like I must have gotten another book. I really didn't think the book was worth the time or money investment. Read more
Published 12 months ago by Charles
5.0 out of 5 stars Must have
The book is a must have in case you'd like to know how things works under the hood. It describes in details neural nets, decision trees, associative rules and others. Read more
Published 13 months ago by Eugene Morozov
2.0 out of 5 stars Difficult to read with lots of references
I wish the chapters are easy to read. The earlier chapters starts by referencing a section in the future and later chapters refer to earlier stuff. Read more
Published 14 months ago by Database reader
Search Customer Reviews
Search these reviews only


Forums

There are no discussions about this product yet.
Be the first to discuss this product with the community.
Start a new discussion
Topic:
First post:
Prompts for sign-in
 



Look for Similar Items by Category