or
Sign in to turn on 1-Click ordering.
 
 
Express Checkout with PayPhrase
What's this? | Create PayPhrase
Sorry!
More Buying Choices
33 used & new from $17.50

Have one to sell? Sell yours here
 
   
Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems)
 
 
Tell the Publisher!
I’d like to read this book on Kindle

Don’t have a Kindle? Get your Kindle here.
 
  

Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)

~ (Author) "Data exploration starts with data, right?..." (more)
Key Phrases: unprepared data, using prepared data, data preparation techniques, Complete Content, Montreal Canadiens, Paul Revere (more...)
4.8 out of 5 stars  See all reviews (12 customer reviews)

List Price: $75.95
Price: $45.45 & this item ships for FREE with Super Saver Shipping. Details
You Save: $30.50 (40%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Upgrade this book for $13.79 more, and you can read, search, and annotate every page online. See details
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.

Only 5 left in stock--order soon (more on the way).

Want it delivered Thursday, November 12? Choose One-Day Shipping at checkout. Details
17 new from $41.36 16 used from $17.50

Frequently Bought Together

Customers buy this book with Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management by Michael J. A. Berry

Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) + Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management

Customers Who Bought This Item Also Bought

Business Modeling and Data Mining (The Morgan Kaufmann Series in Data Management Systems)

Business Modeling and Data Mining (The Morgan Kaufmann Series in Data Management Systems)

by Dorian Pyle
4.2 out of 5 stars (4)  $58.80
Handbook of Statistical Analysis and Data Mining Applications

Handbook of Statistical Analysis and Data Mining Applications

by Robert Nisbet
4.6 out of 5 stars (9)  $71.96
Data Mining: Concepts and Techniques, Second Edition (The Morgan Kaufmann Series in Data Management Systems)

Data Mining: Concepts and Techniques, Second Edition (The Morgan Kaufmann Series in Data Management Systems)

by Jiawei Han
3.7 out of 5 stars (30)  $55.16
Data Analysis Using SQL and Excel

Data Analysis Using SQL and Excel

by Gordon S. Linoff
5.0 out of 5 stars (10)  $38.99
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

by Ian H. Witten
4.0 out of 5 stars (31)  $41.55
Explore similar items

Editorial Reviews

Product Description

Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.

Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offering both a conceptual overview for managers and complete technical details for IT professionals. Apply his techniques and watch your mining efforts pay off-in the form of improved performance, reduced distortion, and more valuable results.

On the enclosed CD-ROM, you'll find a suite of programs as C source code and compiled into a command-line-driven toolkit. This code illustrates how the author's techniques can be applied to arrive at an automated preparation solution that works for you. Also included are demonstration versions of three commercial products that help with data preparation, along with sample data with which you can practice and experiment.

* Offers in-depth coverage of an essential but largely ignored subject.
* Goes far beyond theory, leading you-step by step-through the author's own data preparation techniques.
* Provides practical illustrations of the author's methodology using realistic sample data sets.
* Includes algorithms you can apply directly to your own project, along with instructions for understanding when automation is possible and when greater intervention is required.
* Explains how to identify and correct data problems that may be present in your application.
* Prepares miners, helping them head into preparation with a better understanding of data sets and their limitations.



From the Back Cover

Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.

Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offering both a conceptual overview for managers and complete technical details for IT professionals. Apply his techniques and watch your mining efforts pay off-in the form of improved performance, reduced distortion, and more valuable results.

Features

  • Offers in-depth coverage of an essential but largely ignored subject.
  • Goes far beyond theory, leading you-step by step-through the author's own data preparation techniques.
  • Provides practical illustrations of the author's methodology using realistic sample data sets.
  • Includes algorithms you can apply directly to your own project, along with instructions for understanding when automation is possible and when greater intervention is required.
  • Explains how to identify and correct data problems that may be present in your application.
  • Prepares miners, helping them head into preparation with a better understanding of data sets and their limitations.


On the enclosed CD-ROM, you'll find a suite of programs as C source code and compiled into a command-line-driven toolkit. This code illustrates how the author's techniques can be applied to arrive at an automated preparation solution that works for you. Also included are demonstration versions of three commercial products that help with data preparation, along with sample data with which you can practice and experiment.


Product Details

  • Paperback: 560 pages
  • Publisher: Morgan Kaufmann; Book & CD-ROM 1st edition (April 5, 1999)
  • Language: English
  • ISBN-10: 1558605290
  • ISBN-13: 978-1558605299
  • Product Dimensions: 9.1 x 7.3 x 1.2 inches
  • Shipping Weight: 2.2 pounds (View shipping rates and policies)
  • Average Customer Review: 4.8 out of 5 stars  See all reviews (12 customer reviews)
  • Amazon.com Sales Rank: #129,340 in Books (See Bestsellers in Books)

    Popular in these categories: (What's this?)

    #64 in  Books > Computers & Internet > Databases > Data Mining
    #87 in  Books > Computers & Internet > Web Development > Security & Encryption > Encryption

More About the Author

Dorian Pyle
Discover books, learn about writers, read author blogs, and more.

Visit Amazon's Dorian Pyle Page

Inside This Book (learn more)




What Do Customers Ultimately Buy After Viewing This Item?


Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 

Your tags: Add your first tag
 

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

 

Customer Reviews

12 Reviews
5 star:
 (11)
4 star:    (0)
3 star:    (0)
2 star:
 (1)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.8 out of 5 stars (12 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
28 of 30 people found the following review helpful:
5.0 out of 5 stars A must have..., April 27, 1999
I've had the pleasure of listening to Dorian speak at seminars and even sharing a few brief words with him in person. When he mentioned to me last year that he was working on this book I had no idea how thorough and complete it would be. In fact, I remember wondering to myself how anyone could get their hands around this difficult, yet important aspect of data mining. I'm in awe! Anyone in the trenches will immediately understand the value of this book. Those just getting started in data mining will probably have no idea how much simpler their job just became. My only criticism of this book is that its title obscures that fact that there is a wealth of general data mining information contained within it - practical well beyond the data preparation phase. To understand why and how certain data preparation techniques work is to go a long way towards appreciating subtleties throughout the rest of the data mining process. Thanks Dorian!
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
18 of 20 people found the following review helpful:
5.0 out of 5 stars The true data miner toolkit, May 21, 1999
By patrick darmon (Paris France) - See all my reviews
This book is simply great, it provides the best practices about the essence of data mining : data preparation

Any analyst knows that 80% of data mining time is spent in data preparation, nevertheless most authors focus on the remaining 20% : techniques and tools, where the value is more "visible"

The truth is that this value is unavaillable, unless you get data prepared, and that's the difficult job

Dorian book's allows you to understand this obscure process through a great analytical process. Strongly recommended

Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
9 of 9 people found the following review helpful:
5.0 out of 5 stars Great book about data and Data Mining, October 22, 2003
The book "Data Preparation for Data Mining" is not a common Data Mining book. Nowdays a classical Data Mining book contains a metodology how to solve class of standard problems. This means more a set of prescriptions and receipts for elaborating various solutions to customer loyality, retention etc.
The book by Dorian Pyle is different.It is not a Data Mining book, because as the authors claims, Data Mining is only a part of wider subject, which he calls Data Exploration. He shows us wider spectrum of various subjects considering data than you can find in other books. He gives us a good background that helps to recognize the source of problems with data. Some subjects are not to find in other sources. They come directly from author's reach experience. To summarise - the autor of the book managed to describe the whole subject area considering data, that is not to find in other books on this topics. To achive that knowledge we should read many other publications from different areas.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
 
Most Recent Customer Reviews

5.0 out of 5 stars Great Book
As the old saying goes, Garbage In, Garbage Out (GIGO). "Data Preparation for Data Mining" is the remedy for this pervasive, age-old problem. Read more
Published 18 days ago by R. Watling

5.0 out of 5 stars Still very much worth reading in 2007
I have been helping folks learn Clementine - a data mining package - for several years. I have read a number of related books, but never got to this one until recently. Read more
Published on August 4, 2007 by Keith McCormick

5.0 out of 5 stars The best book on this subject
There are more books available now on data mining preparation but Pyle's book is the best one :
- very clear
- covering all the important topics to need to learn about... Read more
Published on December 18, 2005 by Vallaud

5.0 out of 5 stars Excellent book
I started to work in Data Mining about 7 years ago. I have read many books about this subject and nearly all of them have similar approaches and stress the importance of the data... Read more
Published on May 8, 2004 by M Ferreyra

5.0 out of 5 stars Must read book for data mining
An excellent book. This book helped me to understand what data preparation is really about. Read this before start any data mining project.
Published on November 26, 2003 by al-dallas-tx

5.0 out of 5 stars Excellent book; a must for those who do data mining.
This book by Dorian Pyle is excellent! It really helps explain the details of how you prepare your data so that data mining can be most effective. I highly recommend it. Read more
Published on November 7, 2003 by Randall S. Collica

5.0 out of 5 stars A must have book
Anyone who practices data mining lives the issues discussed in this book. The book dissects and explains important data challenges. Read more
Published on December 26, 2002 by David B Fiery

5.0 out of 5 stars This book saved me when all else had failed.
I was in the market for some information on how to scale data before using it with a neural network. Read more
Published on September 10, 2001 by Megan Squire

2.0 out of 5 stars Content is good, but source code contains errors!
This book covers the topic of data preparation very thoroughly.However, it includes source code that won't compile. Read more
Published on September 27, 1999

Only search this product's reviews



Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   




Product Information from the Amapedia Community

Beta (What's this?)


Look for Similar Items by Category


Look for Similar Items by Subject

 

Feedback

If you need help or have a question for Customer Service, contact us.
 Would you like to update product info or give feedback on images?
Is there any other feedback you would like to provide?

Your comments can help make our site better for everyone.


Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.