The Data WarehouseETL Toolkit and over one million other books are available for Amazon Kindle. Learn more


or
Sign in to turn on 1-Click ordering.
or
Amazon Prime Free Trial required. Sign up when you check out. Learn More
Kindle Edition
 
   
Sell Back Your Copy
For a $18.74 Gift Card
Trade in
More Buying Choices
Have one to sell? Sell yours here
The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleanin
 
 
Start reading The Data WarehouseETL Toolkit on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleanin [Paperback]

Ralph Kimball (Author), Joe Caserta (Author)
4.9 out of 5 stars  See all reviews (15 customer reviews)

List Price: $45.00
Price: $33.68 & this item ships for FREE with Super Saver Shipping. Details
You Save: $11.32 (25%)
  Special Offers Available
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.
Only 9 left in stock--order soon (more on the way).
Want it delivered Monday, January 30? Choose One-Day Shipping at checkout. Details
Textbook Student FREE Two-Day Shipping for Students. Learn more

Formats

Amazon Price New from Used from
Kindle Edition $30.31  
Paperback $33.68  
Sell Back Your Copy for $18.74
Whether you buy it used on Amazon for $21.65 or somewhere else, you can sell it back through our Book Trade-In Program at the current price of $18.74.
Used Price$21.65
Trade-in Price$18.74
Price after
Trade-in
$2.91

Book Description

0764567578 978-0764567575 September 13, 2004 1
  • Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies
  • Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process
  • Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse
  • Offers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality

Special Offers and Product Promotions

  • Buy $50 in qualifying physical textbooks, get $5 in Amazon MP3 Credit. Here's how (restrictions apply)

Frequently Bought Together

The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleanin + The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Second Edition) + The Data Warehouse Lifecycle Toolkit
Price For All Three: $109.30

Show availability and shipping details

Buy the selected items together
  • In Stock.
    Ships from and sold by Amazon.com.
    This item ships for FREE with Super Saver Shipping. Details

  • The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Second Edition) $43.45

    In Stock.
    Ships from and sold by Amazon.com.
    This item ships for FREE with Super Saver Shipping. Details

  • The Data Warehouse Lifecycle Toolkit $32.17

    In Stock.
    Ships from and sold by Amazon.com.
    This item ships for FREE with Super Saver Shipping. Details



Editorial Reviews

From the Back Cover

The single most authoritative guide on the most difficult phase of building a data warehouse

The extract, transform, and load (ETL) phase of the data warehouse development life cycle is far and away the most difficult, time-consuming, and labor-intensive phase of building a data warehouse. Done right, companies can maximize their use of data storage; if not, they can end up wasting millions of dollars storing obsolete and rarely used data. Bestselling author Ralph Kimball, along with Joe Caserta, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation-ready format.

Serving as a road map for planning, designing, building, and running the back-room of a data warehouse, this book provides complete coverage of proven, timesaving ETL techniques. Beginning with a quick overview of ETL fundamentals, it then looks at ETL data structures, both relational and dimensional. The authors show how to build useful dimensional structures, providing practical examples of techniques.

Along the way you’ll learn how to:

  • Plan and design your ETL system
  • Choose the appropriate architecture from the many possible options
  • Build the development/test/production suite of ETL processes
  • Build a comprehensive data cleaning subsystem
  • Tune the overall ETL process for optimum performance

About the Author

RALPH KIMBALL, PhD, founder of the Kimball Group, has been a leading visionary in the data warehousing industry since 1982 and is one of today’s best-known speakers and educators. He is the author of several bestselling titles published on data warehousing, including The Data Warehouse Toolkit (Wiley).

JOE CASERTA is the founder of Caserta Concepts, LLC, a data warehousing consulting firm. He writes frequently for print and online magazines, and is an active contributor to DWList, the major online community for data warehousing professionals.


Product Details

  • Paperback: 528 pages
  • Publisher: Wiley; 1 edition (September 13, 2004)
  • Language: English
  • ISBN-10: 0764567578
  • ISBN-13: 978-0764567575
  • Product Dimensions: 9.2 x 7.4 x 1.1 inches
  • Shipping Weight: 1.7 pounds (View shipping rates and policies)
  • Average Customer Review: 4.9 out of 5 stars  See all reviews (15 customer reviews)
  • Amazon Best Sellers Rank: #101,811 in Books (See Top 100 in Books)

More About the Authors

Discover books, learn about writers, read author blogs, and more.

 

Customer Reviews

15 Reviews
5 star:
 (13)
4 star:
 (2)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.9 out of 5 stars (15 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

20 of 20 people found the following review helpful:
5.0 out of 5 stars Another strong Data Warehousing book from Ralph Kimball, November 23, 2004
By 
D. Mathews (Mountain View, CA United States) - See all my reviews
(REAL NAME)   
This review is from: The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleanin (Paperback)
In this book Ralph lays down a framework for constructing the DW ETL. This is useful not just in constructing quality ETL processes, but also because Ralph's works tend to 'set' standards in data warehousing. The format of this book is similar to the Lifecycle Toolkit. Ralph takes a very staged, logical approach to the material. Some sections are just great e.g. the chapters on Extraction and Development. A small amount of the material is repeated from the Lifecycle Toolkit and Dimensional Modeling books, but no more than is needed to make this book stand on its own.

Also like the other books, this one takes a vendor agnostic approach. While this may increase the shelf-life of the book, I would have appreciated some comparisons between the major vendors out there today.

Overall: I recommend this one as a buy, even if you have Ralph's other books.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


8 of 8 people found the following review helpful:
5.0 out of 5 stars Great coverage of the ETL building blocks, December 18, 2005
By 
Vincent Mcburney (Melbourne, Australia) - See all my reviews
(REAL NAME)   
This review is from: The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleanin (Paperback)
This is one of the few references out there providing the building blocks of good ETL design. There is plenty of technical documentation and forums out there that are specific to one ETL tool or DBMS but this is a better starting place for ETL developers. It is required reading as ETL projects often take short cuts in design, data quality and metadata management and reporting. This leads to very expensive Data Warehouse administration costs and often a complete rebuild of load jobs.

The book is relevent for people using most ETL or ELT tools and it will remain relevent for years even as the ETL products continue to advance and mature. It is targeted at DW but the basic flow of Extract, Clean, Conform and Deliver is suitable for most types of data loads.

Good coverage of the alternatives to traditional overnight bulk loads in the section on real-time ETL systems (also describes Microbatch) as the businesses and the major ETL vendors shift to SOA.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


12 of 14 people found the following review helpful:
5.0 out of 5 stars An almost complete dwh design with ETL orientation, March 21, 2005
By 
Massimiliano Celaschi (Graffignano, Viterbo Italy) - See all my reviews
This review is from: The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleanin (Paperback)
This book takes almost all issues in a data warehouse design and represents them oriented to ETL features. Actually, ETLing matches the whole of the data warehouse (more or less), so the need to describe them makes this book an autonomous work you can read without referring to previous books by Kimball. Besides, I think that some technical descriptions have been better performed here: in my experience it is impossible to undertake dwh activities without (at least) a sound knowledge about general features (indexes, use of a bulk loader vs. INSERT, etc.) of RDBMS, and this paper addresses them conveniently. On the other hand, the flat style used lacks to give evidence to the very significant issues, which happen so to be mixed up with less important statements; that demands to pay high attention while reading, but a blurring boundary between subtleties and trivialities seems to be a common shortcoming in dwh literature. Even with that flaw, the ETL Toolkit turn out as an outstanding reference to state of the art of dwh technology.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No

Share your thoughts with other customers: Create your own review
 
 
 
Most Recent Customer Reviews











Only search this product's reviews



Inside This Book (learn more)
First Sentence:
Ideally, you must start the design of your ETL system with one of the toughest challenges: surrounding the requirements. Read the first page
Key Phrases - Statistically Improbable Phrases (SIPs): (learn more)
event fact table, snapshot fact table, room metadata, surrogate key pipeline, data flow thread, current load table, data warehouse architect, data warehouse bus architecture, source system team, data warehouse team, target data warehouse, audit dimension, conformed dimensions, fact table records, error event table, graceful modifications, dimensional data warehouse, conforming steps, aggregate fact tables, new surrogate key, nested batches, separate fact tables, conformed data, sentinel file, aggregate navigator
Key Phrases - Capitalized Phrases (CAPs): (learn more)
Release Data Flow, Delivering Dimension Tables, Delivering Fact Tables, Reload Truncate, Zippy Cola, Jane Doe, Table Key, Data Quality Specialist, Data Warehouse Lifecycle Toolkit, New Hope, Batch Key, Expected Monthly Rows, Performance Monitor, Product Key, Source Key, Data Warehouse Toolkit, Expected Monthly Bytes, Len Positions, Second Edition, Ship Via, The Accuracy Dimension
New!
Books on Related Topics | Concordance | Text Stats
Browse Sample Pages:
Front Cover | Table of Contents | First Pages | Index | Back Cover | Surprise Me!
Search Inside This Book:





Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 
(7)

Your tags: Add your first tag
 

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   
Related forums





Look for Similar Items by Category


Look for Similar Items by Subject