or
Sign in to turn on 1-Click ordering.
or
Amazon Prime Free Trial required. Sign up when you check out. Learn More
Kindle Edition
Read instantly on your iPad, PC or Mac, no Kindle required
Buy Price: $54.59
Rent From: $21.84
 
 
 
Sell Back Your Copy
For a $1.77 Gift Card
Trade in
More Buying Choices
Have one to sell? Sell yours here
Data Preparation for Data Mining Using SAS (The Morgan Kaufmann Series in Data Management Systems)
 
 

Data Preparation for Data Mining Using SAS (The Morgan Kaufmann Series in Data Management Systems) [Paperback]

Mamdouh Refaat (Author)
3.7 out of 5 stars  See all reviews (6 customer reviews)

List Price: $80.95
Price: $79.01 & this item ships for FREE with Super Saver Shipping. Details
You Save: $1.94 (2%)
  Special Offers Available
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.
Only 7 left in stock--order soon (more on the way).
Want it delivered Friday, February 3? Choose One-Day Shipping at checkout. Details
Textbook Student FREE Two-Day Shipping for students on millions of items. Learn more

Formats

Amazon Price New from Used from
Kindle Edition
Rent from
$54.59
$21.84
 
Paperback $79.01  

Book Description

0123735777 978-0123735775 October 13, 2006
Are you a data mining analyst, who spends up to 80% of your time assuring data quality, then preparing that data for developing and deploying predictive models? And do you find lots of literature on data mining theory and concepts, but when it comes to practical advice on developing good mining views find little "how to" information? And are you, like most analysts, preparing the data in SAS?

This book is intended to fill this gap as your source of practical recipes. It introduces a framework for the process of data preparation for data mining, and presents the detailed implementation of each step in SAS. In addition, business applications of data mining modeling require you to deal with a large number of variables, typically hundreds if not thousands. Therefore, the book devotes several chapters to the methods of data transformation and variable selection.

FEATURES
* A complete framework for the data preparation process, including implementation details for each step.
* The complete SAS implementation code, which is readily usable by professional analysts and data miners.
* A unique and comprehensive approach for the treatment of missing values, optimal binning, and cardinality reduction.
* Assumes minimal proficiency in SAS and includes a quick-start chapter on writing SAS macros.
* CD includes dozens of SAS macros plus the sample data and the program for the book's case study.

Special Offers and Product Promotions

  • Buy $50 in qualifying physical textbooks, get $5 in Amazon MP3 Credit. Here's how (restrictions apply)

Frequently Bought Together

Data Preparation for Data Mining Using SAS (The Morgan Kaufmann Series in Data Management Systems) + Decision Trees for Business Intelligence and Data Mining: Using SAS Enterprise Miner + Predictive Modeling With SAS Enterprise Miner: Practical Solutions for Business Applications
Price For All Three: $171.58

Show availability and shipping details

Buy the selected items together


Editorial Reviews

Review

It is easy to write books that address broad topics and ideas leaving the reader with the question "Yes, but how?" By combining a comprehensive guide to data preparation for data mining along with specific examples in SAS, Mamdouh's book is a rare find-a blend of theory and the practical at the same time. As anyone who has mined data will confess, 80% of the problem is in data preparation; Mamdouh addresses this difficult subject with strong practical techniques and methods.

If you are working on an SAS data mining project, this book is a must! If you are working on any data mining project, the techniques and methods will be a guiding light! --Frank Byrum, Chief Scientist, CorMine Intelligent Data, LLC

From the Back Cover

"It is easy to write books that address broad topics and ideas leaving the reader with the question "Yes, but how?" By combining a comprehensive guide to data preparation for data mining along with specific examples in SAS, Mamdouh's book is a rare find-a blend of theory and the practical at the same time. As anyone who has mined data will confess, 80% of the problem is in data preparation; Mamdouh addresses this difficult subject with strong practical techniques and methods.

If you are working on an SAS data mining project, this book is a must! If you are working on any data mining project, the techniques and methods will be a guiding light!" --Frank Byrum, Chief Scientist, CorMine Intelligent Data, LLC

Are you a data mining analyst, like many, that spends up to 80% of your time on assuring data quality and preparing that data for developing and deploying predictive models? And do you find lots of literature on data mining theory and concepts, but when it comes to practical advice on developing good mining views find little "how to" information? And are you, like most analysts, preparing the data in SAS?

This book is intended to fill this gap as your source of practical recipes. It introduces a framework for the process of data preparation for data mining, and presents the detailed implementation of each step in SAS. In addition, business applications of data mining modeling require you to deal with a large number of variables, typically hundreds if not thousands. Therefore, the book devotes several chapters to the methods of data transformation and variable selection.

FEATURES
. A complete framework for the data preparation process, including implementation details for each step.
. The complete SAS implementation code, which is readily usable by professional analysts and data miners;
. A unique and comprehensive approach for the treatment of missing values, optimal binning, and cardinality reduction;
. Assumes minimal proficiency in SAS and includes a quick-start chapter on writing SAS macros.
. CD includes dozens of SAS macros plus the sample data and the program for the book's case study.

Mamdouh Refaat is a data mining and business analytics consultant advising major organizations in North America and Europe. He has held several positions in consulting organizations and software vendors, including the director of consulting services at ANGOSS Software Corporation, a global data mining software and service provider.
During his career, Mamdouh has managed numerous data mining consulting projects in marketing, CRM, and credit risk for Fortune 500 organizations in North America and Europe. In addition, he has delivered over 50 professional training courses in data mining and business analytics.
Mamdouh holds a Ph.D. in Engineering from the University of Toronto, and an MBA from the University of Leeds.

Product Details

  • Paperback: 424 pages
  • Publisher: Morgan Kaufmann (October 13, 2006)
  • Language: English
  • ISBN-10: 0123735777
  • ISBN-13: 978-0123735775
  • Product Dimensions: 9.3 x 7.5 x 1.1 inches
  • Shipping Weight: 2 pounds (View shipping rates and policies)
  • Average Customer Review: 3.7 out of 5 stars  See all reviews (6 customer reviews)
  • Amazon Best Sellers Rank: #1,380,212 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, read author blogs, and more.

 

Customer Reviews

6 Reviews
5 star:
 (3)
4 star:
 (1)
3 star:    (0)
2 star:
 (1)
1 star:
 (1)
 
 
 
 
 
Average Customer Review
3.7 out of 5 stars (6 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

4 of 5 people found the following review helpful:
5.0 out of 5 stars Numerical recipes for Data Preparation, February 26, 2007
This review is from: Data Preparation for Data Mining Using SAS (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
It is well known that the majority of time spent doing data mining is in the data preparation. With this book, Dr. Refaat has given analysts the tools they need to perform the often laborious task of data preparation. If you are familiar with the Numerical Recipes series of books (and if you aren't, you should be), you will recognize a similar theme. The author has:
* itemized all the elements the data miner should be aware of when they are doing data preparation,
* presented enough technical description for the reader to understand why they would be performing that particular task,
* provided all the SAS code that can be used to actually perform the data preparation step

This book is what the practicing data miner needs.

Here's a brief on the table of contents:

1. Introduction
* setting the context of data mining

2. Tasks and Data Flow
* describes what data mining can do and where data preparation fits in

3. Review of Data Mining Modeling Techniques
* an overview of data mining techniques

4. SAS Macros: A Quick Start
* just in case you haven't worked with SAS macros

5. Data Acquisition and Integration
* where you get your data from and how it's pulled together

6. Integrity Checks
* how to make sure the data is correct and even what "correct" means

7. Exploratory Data Analysis
* get to know your data

8. Sampling and Partition
* dealing with large data sets as well as getting ready to validate the models you build

9. Data Transformations
* rarely is your source data in the form most effective for modeling - this chapter describes what can be done to produce the most effective models

10. Binning and Reduction of Cardinality
* make your variables less complex and often times, more presentable and understandable

11. Treatment of Missing Values
* you will have missing values in your data - here are several approaches for dealing with them

12. Predictive Power and Variable Reduction I
* introduces the concept of identifying usefulness of input variables and reducing the required number of variables

13. Analysis of Nominal and Ordinal Variables
* how to evaluate relationships with discrete variables

14. Analysis of Continuous Variables
* how to evaluate relationships with continuous variables

15. Principal Component Analysis
* how to use PCA for variable reduction during data preparation

16. Factor Analysis
* how to use Factor Analysis for variable reduction during data preparation

17. Predictive Power and Variable Reduction II
* defines methods of simplifying and reducing input variables with respect to the target variable

18. Putting It All Together
* a case study showing the application of all these techniques for data preparation in a realistic example

Appendix. Listing of SAS Macros
* complete listing of all the SAS code referenced in the book - also included on the CD
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


1 of 1 people found the following review helpful:
2.0 out of 5 stars SAS Codes Do Not Run, February 12, 2009
By 
L. Tian (Chicago, IL) - See all my reviews
(REAL NAME)   
This review is from: Data Preparation for Data Mining Using SAS (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
The book has a lot of useful SAS macros, but some of them do not even run at all. One macro (Decompose) is even missing on the DVD and somehow it is called in another macro in an inappropriate way. Should have debugged the codes before publishing it. So buyer be ware.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


4 of 6 people found the following review helpful:
5.0 out of 5 stars The best data prep book so far, February 13, 2007
This review is from: Data Preparation for Data Mining Using SAS (The Morgan Kaufmann Series in Data Management Systems) (Paperback)
I have been working in data mining and with SAS for the last 10 years. This is the best book without doubt. It is consice, to the point, not a lot of fluf and useless theory. It teaches you how to actually do it! The book took me step by step through the process of data preparation using SAS and let me write fantastic macros. All the macros are included in the CD and are ready to run. I strongly recommend this book to anyone who is using SAS to work the data either for reporting or for modeling. I attended many training courses on data mining, and even data preparation, but nothing is like this book. It reveals all the secrets. For example, how to bin variables using Gini, how to select the best modeling variables using Entropy, Ginin, Chi2, how to reduce the variables using principal component analysis, treatment of missing values occupies and huge chapter that in my opinion has no competitors, mapping categorical variables into dummy variables, reduction of cardinality using Gini (best grouping). All these things until now were the secrets of the 'gurus', not any more thanks to Dr. Refaat and his book! For example, I used to use a decision tree software to select the best variables, then use logistic regression to build the models. Not any more. With the SAS programs in the book, I can now select the best variables and build the model within one SAs script.... I only wish if the author would also write a similar book on modeling... This book is a life saver ...
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No

Share your thoughts with other customers: Create your own review
 
 
 
Most Recent Customer Reviews




Only search this product's reviews



Inside This Book (learn more)
Key Phrases - Statistically Improbable Phrases (SIPs): (learn more)
proc sql noprint, scoring view, call symput, mining view, monotone missing pattern, temp freqs, temp chi, cardinality reduction, data preparation procedures, temp dataset, work nolist, temp cats, proc score, using proc freq, current node list, data preparation process, output dataset, temporary dataset, new indicator variables, mapping dataset, validation partitions, missingness pattern, data temp, bin limits, data mining modeling
Key Phrases - Capitalized Phrases (CAPs): (learn more)
Parameter Description, Pearson Chi-squared, Total Records, Age Status, Balance Status, Temp Vars, Var Variable, Avg Imp, Bank Factors, Date Today
New!
Books on Related Topics | Concordance | Text Stats
Browse Sample Pages:
Front Cover | Table of Contents | First Pages | Index | Surprise Me!
Search Inside This Book:



What Other Items Do Customers Buy After Viewing This Item?


Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 

Your tags: Add your first tag
 

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   
Related forums



So You'd Like to...


Create a guide


Look for Similar Items by Category


Look for Similar Items by Subject