The Data WarehouseETL Toolkit and over one million other books are available for Amazon Kindle. Learn more
Qty:1
  • List Price: $45.00
  • Save: $11.87 (26%)
FREE Shipping on orders over $35.
In Stock.
Ships from and sold by Amazon.com.
Gift-wrap available.
The Data WarehouseETL Too... has been added to your Cart
+ $3.99 shipping
Used: Good | Details
Condition: Used: Good
Comment: Shows some signs of wear, and may have some markings on the inside. 100% Money Back Guarantee. Shipped to over one million happy customers. Your purchase benefits world literacy!
Access codes and supplements are not guaranteed with used items.
Sell yours for a Gift Card
We'll buy it for $4.90
Learn More
Trade in now
Have one to sell? Sell on Amazon
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more
See all 2 images

The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data Paperback – October 1, 2004

ISBN-13: 978-0764567575 ISBN-10: 0764567578 Edition: 1st

Buy New
Price: $33.13
53 New from $19.84 38 Used from $11.43
Amazon Price New from Used from
Kindle
"Please retry"
Paperback
"Please retry"
$33.13
$19.84 $11.43
Free Two-Day Shipping for College Students with Amazon Student Free%20Two-Day%20Shipping%20for%20College%20Students%20with%20Amazon%20Student


Get Up to 80% Back When You Sell Us Your Books
$33.13 FREE Shipping on orders over $35. In Stock. Ships from and sold by Amazon.com. Gift-wrap available.

Frequently Bought Together

The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data + The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling + Agile Data Warehouse Design: Collaborative Dimensional Modeling, from Whiteboard to Star Schema
Price for all three: $113.62

Buy the selected items together
NO_CONTENT_IN_FEATURE

Shop the New Digital Design Bookstore
Check out the Digital Design Bookstore, a new hub for photographers, art directors, illustrators, web developers, and other creative individuals to find highly rated and highly relevant career resources. Shop books on web development and graphic design, or check out blog posts by authors and thought-leaders in the design industry. Shop now

Product Details

  • Paperback: 528 pages
  • Publisher: Wiley; 1 edition (October 1, 2004)
  • Language: English
  • ISBN-10: 0764567578
  • ISBN-13: 978-0764567575
  • Product Dimensions: 7.4 x 1.1 x 9.2 inches
  • Shipping Weight: 1.7 pounds (View shipping rates and policies)
  • Average Customer Review: 4.5 out of 5 stars  See all reviews (30 customer reviews)
  • Amazon Best Sellers Rank: #37,857 in Books (See Top 100 in Books)

Editorial Reviews

From the Back Cover

The single most authoritative guide on the most difficult phase of building a data warehouse

The extract, transform, and load (ETL) phase of the data warehouse development life cycle is far and away the most difficult, time-consuming, and labor-intensive phase of building a data warehouse. Done right, companies can maximize their use of data storage; if not, they can end up wasting millions of dollars storing obsolete and rarely used data. Bestselling author Ralph Kimball, along with Joe Caserta, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation-ready format.

Serving as a road map for planning, designing, building, and running the back-room of a data warehouse, this book provides complete coverage of proven, timesaving ETL techniques. Beginning with a quick overview of ETL fundamentals, it then looks at ETL data structures, both relational and dimensional. The authors show how to build useful dimensional structures, providing practical examples of techniques.

Along the way you’ll learn how to:

  • Plan and design your ETL system
  • Choose the appropriate architecture from the many possible options
  • Build the development/test/production suite of ETL processes
  • Build a comprehensive data cleaning subsystem
  • Tune the overall ETL process for optimum performance

About the Author

RALPH KIMBALL, PhD, founder of the Kimball Group, has been a leading visionary in the data warehousing industry since 1982 and is one of today’s best-known speakers and educators. He is the author of several bestselling titles published on data warehousing, including The Data Warehouse Toolkit (Wiley).

JOE CASERTA is the founder of Caserta Concepts, LLC, a data warehousing consulting firm. He writes frequently for print and online magazines, and is an active contributor to DWList, the major online community for data warehousing professionals.

Customer Reviews

A great basic tool book for datawarehousing and ETL.
Judy S. Hallstrom
This is a very good book from the Data Warehouse toolkit series by Ralph Kimball et al.
Dmitry Dvoinikov
It was really a very good book for those need to know the concepts..
HajaM

Most Helpful Customer Reviews

23 of 24 people found the following review helpful By D. Mathews on November 23, 2004
Format: Paperback
In this book Ralph lays down a framework for constructing the DW ETL. This is useful not just in constructing quality ETL processes, but also because Ralph's works tend to 'set' standards in data warehousing. The format of this book is similar to the Lifecycle Toolkit. Ralph takes a very staged, logical approach to the material. Some sections are just great e.g. the chapters on Extraction and Development. A small amount of the material is repeated from the Lifecycle Toolkit and Dimensional Modeling books, but no more than is needed to make this book stand on its own.

Also like the other books, this one takes a vendor agnostic approach. While this may increase the shelf-life of the book, I would have appreciated some comparisons between the major vendors out there today.

Overall: I recommend this one as a buy, even if you have Ralph's other books.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
10 of 10 people found the following review helpful By Vincent Mcburney on December 18, 2005
Format: Paperback
This is one of the few references out there providing the building blocks of good ETL design. There is plenty of technical documentation and forums out there that are specific to one ETL tool or DBMS but this is a better starting place for ETL developers. It is required reading as ETL projects often take short cuts in design, data quality and metadata management and reporting. This leads to very expensive Data Warehouse administration costs and often a complete rebuild of load jobs.

The book is relevent for people using most ETL or ELT tools and it will remain relevent for years even as the ETL products continue to advance and mature. It is targeted at DW but the basic flow of Extract, Clean, Conform and Deliver is suitable for most types of data loads.

Good coverage of the alternatives to traditional overnight bulk loads in the section on real-time ETL systems (also describes Microbatch) as the businesses and the major ETL vendors shift to SOA.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
13 of 15 people found the following review helpful By Massimiliano Celaschi on March 21, 2005
Format: Paperback
This book takes almost all issues in a data warehouse design and represents them oriented to ETL features. Actually, ETLing matches the whole of the data warehouse (more or less), so the need to describe them makes this book an autonomous work you can read without referring to previous books by Kimball. Besides, I think that some technical descriptions have been better performed here: in my experience it is impossible to undertake dwh activities without (at least) a sound knowledge about general features (indexes, use of a bulk loader vs. INSERT, etc.) of RDBMS, and this paper addresses them conveniently. On the other hand, the flat style used lacks to give evidence to the very significant issues, which happen so to be mixed up with less important statements; that demands to pay high attention while reading, but a blurring boundary between subtleties and trivialities seems to be a common shortcoming in dwh literature. Even with that flaw, the ETL Toolkit turn out as an outstanding reference to state of the art of dwh technology.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
6 of 6 people found the following review helpful By Andre Ackermann on June 27, 2006
Format: Paperback
I am currently working as an ETL Developer at a company

Fourier Approach, Centurion, South Africa.

Most of the time this is a fairly hot seat -

because so many business requirements are dependant on the

Quality of Information produced by the ETL process.

I always asked myself,

* Am I doing the right thing?

* Is this the best solution?

* How would other developers do this?

A while ago I attended the course

"ETL Architecture and Design Workshop"

presented by Joe Caserta, and hosted by Alicornio Africa in Johannesburg, South Africa.

Before the presentation we received a copy of the book

"The Data Warehouse ETL Toolkit".

This changed my whole perspective.

The book adressed all my ETL questions,

with examples from real-world situations.

It covers the whole ETL process and gives answers

to almost every question you will ever think of asking.

I must say this is a very handy tool on the desk of any serious ETL Developer.

Regards,

André Ackermann

ETL Developer
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
10 of 12 people found the following review helpful By Franklin Hu on June 8, 2009
Format: Paperback
For a book which has "ETL Toolkit" in the name, I would think this would get right down to the nuts and bolts of how to write an ETL system. Yet, after reading this book, I still have the feeling that I still wouldn't know how to build one. From the book, I know what the issues are about inserting, deleting, and organizing the data, but how does that translate into SQL which actually does the work? There are hardly any SQL or sample code to go by. Issues like dealing with multiple database sources are mentioned, but just what do you do when cities are mentioned in both databases and they don't quite match up? If you decide to roll your own, the book really doesn't give you any basic framework to follow. This book does provide important insights on ETL systems, but don't expect to be able to write one based on this book. For example, you are told to bulk insert data, but you are also expected to exchange all of the natural keys with surrogate keys. If you have to examine each line and exchange keys, you cannot bulk load the data and this ends up being a very CPU intensive process where each row is laboriously examined, exchanged and then inserted one row at a time. This is a very practical ETL difficulty but yet, the book doesn't deal with this problem. It just assumes this won't be a problem for you. In some ways, the book is more about selling you the 'Kimbal' way of data warehouses as other more intuitive ways of constructing a data warehouse are not even considered. Like a sales pitch and a fancy powerpoint presentation, it lacks the substance to actually create a well performing ETL process. That is left as an exercise for the reader.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again

Most Recent Customer Reviews


More About the Authors

Discover books, learn about writers, read author blogs, and more.
Want to discover more products? Check out this page to see more: crm vendors

Set up an Amazon Giveaway

Amazon Giveaway allows you to run promotional giveaways in order to create buzz, reward your audience, and attract new followers and customers. Learn more
The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
This item: The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
Price: $33.13
Ships from and sold by Amazon.com