Customer Reviews


13 Reviews
5 star:
 (11)
4 star:    (0)
3 star:    (0)
2 star:
 (1)
1 star:
 (1)
 
 
 
 
 
Average Customer Review
Share your thoughts with other customers
Create your own review
 
 
Only search this product's reviews

The most helpful favorable review
The most helpful critical review


28 of 30 people found the following review helpful:
5.0 out of 5 stars A must have...
I've had the pleasure of listening to Dorian speak at seminars and even sharing a few brief words with him in person. When he mentioned to me last year that he was working on this book I had no idea how thorough and complete it would be. In fact, I remember wondering to myself how anyone could get their hands around this difficult, yet important aspect of data mining...
Published on April 27, 1999 by Greg James (gjames@netguild.com)

versus
19 of 28 people found the following review helpful:
2.0 out of 5 stars Content is good, but source code contains errors!
This book covers the topic of data preparation very thoroughly.However, it includes source code that won't compile. The book comes up short in referencing the source code that is included on the CD to the various methods that are described in the book. If you don't care about the source code, the book is good. But, if you're a programmer as I am, you expect the source...
Published on September 27, 1999


‹ Previous | 1 2 | Next ›
Most Helpful First | Newest First

28 of 30 people found the following review helpful:
5.0 out of 5 stars A must have..., April 27, 1999
Amazon Verified Purchase(What's this?)
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
I've had the pleasure of listening to Dorian speak at seminars and even sharing a few brief words with him in person. When he mentioned to me last year that he was working on this book I had no idea how thorough and complete it would be. In fact, I remember wondering to myself how anyone could get their hands around this difficult, yet important aspect of data mining. I'm in awe! Anyone in the trenches will immediately understand the value of this book. Those just getting started in data mining will probably have no idea how much simpler their job just became. My only criticism of this book is that its title obscures that fact that there is a wealth of general data mining information contained within it - practical well beyond the data preparation phase. To understand why and how certain data preparation techniques work is to go a long way towards appreciating subtleties throughout the rest of the data mining process. Thanks Dorian!
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


18 of 20 people found the following review helpful:
5.0 out of 5 stars The true data miner toolkit, May 21, 1999
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
This book is simply great, it provides the best practices about the essence of data mining : data preparation

Any analyst knows that 80% of data mining time is spent in data preparation, nevertheless most authors focus on the remaining 20% : techniques and tools, where the value is more "visible"

The truth is that this value is unavaillable, unless you get data prepared, and that's the difficult job

Dorian book's allows you to understand this obscure process through a great analytical process. Strongly recommended

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


9 of 9 people found the following review helpful:
5.0 out of 5 stars Great book about data and Data Mining, October 22, 2003
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
The book "Data Preparation for Data Mining" is not a common Data Mining book. Nowdays a classical Data Mining book contains a metodology how to solve class of standard problems. This means more a set of prescriptions and receipts for elaborating various solutions to customer loyality, retention etc.
The book by Dorian Pyle is different.It is not a Data Mining book, because as the authors claims, Data Mining is only a part of wider subject, which he calls Data Exploration. He shows us wider spectrum of various subjects considering data than you can find in other books. He gives us a good background that helps to recognize the source of problems with data. Some subjects are not to find in other sources. They come directly from author's reach experience. To summarise - the autor of the book managed to describe the whole subject area considering data, that is not to find in other books on this topics. To achive that knowledge we should read many other publications from different areas.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


6 of 6 people found the following review helpful:
5.0 out of 5 stars A must have book, December 26, 2002
By 
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
Anyone who practices data mining lives the issues discussed in this book. The book dissects and explains important data challenges.

Dorian communicates far more than knowledge about the nomenclature of the problems. His focus is instead on the trade-offs one makes while wrestling with data issues. He provides sage counsel on how the practioner can address data for the specific types of analytical problems one faces.

I own about 15 data mining books, this is the one that I use the most.

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


8 of 9 people found the following review helpful:
5.0 out of 5 stars This book saved me when all else had failed., September 10, 2001
By 
Megan Squire (Gibsonville, NC USA) - See all my reviews
(REAL NAME)   
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
I was in the market for some information on how to scale data before using it with a neural network. After trying to wade through material that was somewhat inaccessible to my feeble brain, this book saved me. I was able to implement a simple scaling system in less than 2 hours. I later asked one of the econometricians at my company if I had done the scaling properly, and he said I did! This book is simple to understand, and best of all, it was correct!!
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


3 of 3 people found the following review helpful:
5.0 out of 5 stars The best book on this subject, December 18, 2005
By 
Vallaud (Paris, France) - See all my reviews
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
There are more books available now on data mining preparation but Pyle's book is the best one :
- very clear
- covering all the important topics to need to learn about
- based on many examples and advices coming from real life
To buy absolutely
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


2 of 2 people found the following review helpful:
5.0 out of 5 stars Excellent book, May 8, 2004
By 
M Ferreyra (Trenque Lauquen, Argentina) - See all my reviews
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
I started to work in Data Mining about 7 years ago. I have read many books about this subject and nearly all of them have similar approaches and stress the importance of the data preparation but they all, except one, just discuss lightly this subject. I don't know any book that deal with data preparation as Pyle's book does. Each topic is discussed in-depth. This is a very clear and enjoyable book. I have tested all concepts and the results are amazing.

Chapter 11 introduces the Data Survey topic in which techniques based in Shannon's Information Theory are showed. These information theory approaches are simply wonderful and gives to the Data Mining subject the bases to get models that extract complete relevant information from the data.

I recommend this book to anyone who wants to work seriously with Data Mining.

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


4 of 5 people found the following review helpful:
5.0 out of 5 stars Still very much worth reading in 2007, August 4, 2007
By 
Keith McCormick (North Carolina, USA) - See all my reviews
(REAL NAME)   
Amazon Verified Purchase(What's this?)
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
I have been helping folks learn Clementine - a data mining package - for several years. I have read a number of related books, but never got to this one until recently. That was a mistake. This may be an important book for you if you are new to Data Mining, even if, especially if, you already have expertise in statistics and/or data base technology.

Although I still believe if someone is brand new to the field that they begin with Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management, this should be the second book that they read. Far too many books in this area read like statistics books (notably Data Mining Methods and Models).

Statistics training can be of enormous benefit to data miners, but leads to certain predictable errors. Not only that, many data miners already have statistics training and that just compounds the likelihood that they will make these mistakes when the book author fails to show the difference clearly. Pyle performs consistently well in this regard. He consistently focuses on the kinds of problems data miners are likely to see in their work.

To give just a couple of examples: Few variables will be already stored as continuous, normally distributed variables; principle components analysis might sometimes be a problematic way to eliminate predictors and even be dangerous; missing versus "empty" data; constantly present non-linearity.

His practice data set has a real variety of variable types, and dozens of predictors. If you are figuring out if Data Mining can help you, start with the Berry/Linoff book. But .. if you are about to begin in earnest read this book. Then, time permitting; start reading specific books on modeling or software. For instance, another Larose book has good, detailed coverage of algorithms, and some information on Clementine. Discovering Knowledge in Data: An Introduction to Data Mining
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


19 of 28 people found the following review helpful:
2.0 out of 5 stars Content is good, but source code contains errors!, September 27, 1999
By A Customer
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
This book covers the topic of data preparation very thoroughly.However, it includes source code that won't compile. The book comes up short in referencing the source code that is included on the CD to the various methods that are described in the book. If you don't care about the source code, the book is good. But, if you're a programmer as I am, you expect the source code to work!
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


5.0 out of 5 stars Great Book, October 24, 2009
Amazon Verified Purchase(What's this?)
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
As the old saying goes, Garbage In, Garbage Out (GIGO). "Data Preparation for Data Mining" is the remedy for this pervasive, age-old problem. There are so many different aspects to data quality, it boggles the mind. Mr. Pyle addresses each one in detail, with clear examples and explanations.

The book is well-written and more importantly, understandable. It should be required reading for every researcher and modeler BEFORE they begin their careers. The way data is prepared and aggregated determines the picture one gets from the data. It must be done correctly from the start, or all downstream processing and conclusions are suspect.

The CD that comes with the book is pretty much useless, but aside from that caveat, this is a great book. Buy it - you won't be disappointed.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


‹ Previous | 1 2 | Next ›
Most Helpful First | Newest First

This product

Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems)
Used & New from: $57.53
Add to wishlist See buying options