Buy Used
Used - Good See details
$66.98 & this item ships for FREE with Super Saver Shipping. Details

or
Sign in to turn on 1-Click ordering.
 
   
Sell Back Your Copy
For a $41.62 Gift Card
Trade in
Have one to sell? Sell yours here
Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems)
 
 
Tell the Publisher!
I'd like to read this book on Kindle

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) [Paperback]

Dorian Pyle (Author)
4.5 out of 5 stars  See all reviews (13 customer reviews)


Available from these sellers.


Sell Back Your Copy for $41.62
Whether you buy it used on Amazon for $65.97 or somewhere else, you can sell it back through our Book Trade-In Program at the current price of $41.62.
Used Price$65.97
Trade-in Price$41.62
Price after
Trade-in
$24.35

Book Description

The Morgan Kaufmann Series in Data Management Systems April 5, 1999

Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.

Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offering both a conceptual overview for managers and complete technical details for IT professionals. Apply his techniques and watch your mining efforts pay off-in the form of improved performance, reduced distortion, and more valuable results.

On the enclosed CD-ROM, you'll find a suite of programs as C source code and compiled into a command-line-driven toolkit. This code illustrates how the author's techniques can be applied to arrive at an automated preparation solution that works for you. Also included are demonstration versions of three commercial products that help with data preparation, along with sample data with which you can practice and experiment.

* Offers in-depth coverage of an essential but largely ignored subject.
* Goes far beyond theory, leading you-step by step-through the author's own data preparation techniques.
* Provides practical illustrations of the author's methodology using realistic sample data sets.
* Includes algorithms you can apply directly to your own project, along with instructions for understanding when automation is possible and when greater intervention is required.
* Explains how to identify and correct data problems that may be present in your application.
* Prepares miners, helping them head into preparation with a better understanding of data sets and their limitations.



Editorial Reviews

From the Back Cover

Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.

Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offering both a conceptual overview for managers and complete technical details for IT professionals. Apply his techniques and watch your mining efforts pay off-in the form of improved performance, reduced distortion, and more valuable results.

Features

  • Offers in-depth coverage of an essential but largely ignored subject.
  • Goes far beyond theory, leading you-step by step-through the author's own data preparation techniques.
  • Provides practical illustrations of the author's methodology using realistic sample data sets.
  • Includes algorithms you can apply directly to your own project, along with instructions for understanding when automation is possible and when greater intervention is required.
  • Explains how to identify and correct data problems that may be present in your application.
  • Prepares miners, helping them head into preparation with a better understanding of data sets and their limitations.


On the enclosed CD-ROM, you'll find a suite of programs as C source code and compiled into a command-line-driven toolkit. This code illustrates how the author's techniques can be applied to arrive at an automated preparation solution that works for you. Also included are demonstration versions of three commercial products that help with data preparation, along with sample data with which you can practice and experiment.

About the Author

Dorian Pyle is Chief Scientist and Founder of PTI (www.pti.com), which develops and markets PowerhouseT predictive and explanatory analytics software. Dorian has over 20 years experience in artificial intelligence and machine learning techniques which are used in what is known today as "data mining" or "predictive analytics". He has applied this knowledge as a consultant with Knowledge Stream Partners, Xchange, Naviant, Thinking Machines, and Data Miners and with various companies directly involved in credit card marketing for banks and with manufacturing companies using industrial automation. In 1976 he was involved in building artificially intelligent machine learning systems utilizing the pioneering technologies that are currently known as neural computing and associative memories. He is current in and familiar with using the most advanced technologies in data mining including: entropic analysis (information theory), chaotic and fractal decomposition, neural technologies, evolution and genetic optimization, algebra evolvers, case-based reasoning, concept induction and other advanced statistical techniques.


Product Details

  • Paperback: 560 pages
  • Publisher: Morgan Kaufmann; 1 edition (April 5, 1999)
  • Language: English
  • ISBN-10: 1558605290
  • ISBN-13: 978-1558605299
  • Product Dimensions: 9.1 x 7.3 x 1.2 inches
  • Shipping Weight: 2.2 pounds
  • Average Customer Review: 4.5 out of 5 stars  See all reviews (13 customer reviews)
  • Amazon Best Sellers Rank: #656,038 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, read author blogs, and more.

 

Customer Reviews

13 Reviews
5 star:
 (11)
4 star:    (0)
3 star:    (0)
2 star:
 (1)
1 star:
 (1)
 
 
 
 
 
Average Customer Review
4.5 out of 5 stars (13 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

28 of 30 people found the following review helpful:
5.0 out of 5 stars A must have..., April 27, 1999
Amazon Verified Purchase(What's this?)
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
I've had the pleasure of listening to Dorian speak at seminars and even sharing a few brief words with him in person. When he mentioned to me last year that he was working on this book I had no idea how thorough and complete it would be. In fact, I remember wondering to myself how anyone could get their hands around this difficult, yet important aspect of data mining. I'm in awe! Anyone in the trenches will immediately understand the value of this book. Those just getting started in data mining will probably have no idea how much simpler their job just became. My only criticism of this book is that its title obscures that fact that there is a wealth of general data mining information contained within it - practical well beyond the data preparation phase. To understand why and how certain data preparation techniques work is to go a long way towards appreciating subtleties throughout the rest of the data mining process. Thanks Dorian!
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


18 of 20 people found the following review helpful:
5.0 out of 5 stars The true data miner toolkit, May 21, 1999
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
This book is simply great, it provides the best practices about the essence of data mining : data preparation

Any analyst knows that 80% of data mining time is spent in data preparation, nevertheless most authors focus on the remaining 20% : techniques and tools, where the value is more "visible"

The truth is that this value is unavaillable, unless you get data prepared, and that's the difficult job

Dorian book's allows you to understand this obscure process through a great analytical process. Strongly recommended

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


9 of 9 people found the following review helpful:
5.0 out of 5 stars Great book about data and Data Mining, October 22, 2003
This review is from: Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
The book "Data Preparation for Data Mining" is not a common Data Mining book. Nowdays a classical Data Mining book contains a metodology how to solve class of standard problems. This means more a set of prescriptions and receipts for elaborating various solutions to customer loyality, retention etc.
The book by Dorian Pyle is different.It is not a Data Mining book, because as the authors claims, Data Mining is only a part of wider subject, which he calls Data Exploration. He shows us wider spectrum of various subjects considering data than you can find in other books. He gives us a good background that helps to recognize the source of problems with data. Some subjects are not to find in other sources. They come directly from author's reach experience. To summarise - the autor of the book managed to describe the whole subject area considering data, that is not to find in other books on this topics. To achive that knowledge we should read many other publications from different areas.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No

Share your thoughts with other customers: Create your own review
 
 
 
Most Recent Customer Reviews











Only search this product's reviews



Inside This Book (learn more)
First Sentence:
"Data exploration starts with data, right?" Read the first page
Key Phrases - Statistically Improbable Phrases (SIPs): (learn more)
unprepared data, using prepared data, data preparation techniques, alpha labels, populated variables, data exploration process, appropriate numeration, unit state space, data preparation process, softmax scaling, information enfolded, entropic analysis, monotonic variable, predicting origin, alpha variables, entropy map, demonstration code, joint variability, variability capture, replacing missing values, capturing variability, composite waveform, complexity map, novelty detector, bin values
Key Phrases - Capitalized Phrases (CAPs): (learn more)
Complete Content, Montreal Canadiens, Paul Revere, Data Issue, Frequency Figure, Handling Nonnumerical Variables, Continua of Attributes of Variables, Getting the Data, Identifying Problems, Measuring the World, Redistributing Variable Values, First Catch Your Hare, Old North Church, Prepared Information Environment Output, The Bennigans
New!
Books on Related Topics | Concordance | Text Stats
Browse Sample Pages:
Front Cover | Table of Contents | First Pages | Index | Back Cover | Surprise Me!
Search Inside This Book:





Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 

Your tags: Add your first tag
 

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   
Related forums



So You'd Like to...


Create a guide


Look for Similar Items by Category


Look for Similar Items by Subject