Pentaho Kettle Solutions and over one million other books are available for Amazon Kindle. Learn more
Buy New
$34.33
Qty:1
  • List Price: $50.00
  • Save: $15.67 (31%)
FREE Shipping on orders over $35.
Only 18 left in stock (more on the way).
Ships from and sold by Amazon.com.
Gift-wrap available.
Add to Cart
Trade in your item
Get a $10.35
Gift Card.
Have one to sell? Sell on Amazon
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more
See this image

Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration Paperback – September 28, 2010

ISBN-13: 978-0470635179 ISBN-10: 0470635177 Edition: 1st

Buy New
Price: $34.33
31 New from $26.67 18 Used from $25.00
Amazon Price New from Used from
Kindle
"Please retry"
Paperback
"Please retry"
$34.33
$26.67 $25.00
Free%20Two-Day%20Shipping%20for%20College%20Students%20with%20Amazon%20Student


Frequently Bought Together

Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration + Pentaho Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL + Pentaho Data Integration 4 Cookbook
Price for all three: $108.63

Buy the selected items together

NO_CONTENT_IN_FEATURE

Save up to 90% on Textbooks
Rent textbooks, buy textbooks, or get up to 80% back when you sell us your books. Shop Now

Product Details

  • Paperback: 720 pages
  • Publisher: Wiley; 1 edition (September 28, 2010)
  • Language: English
  • ISBN-10: 0470635177
  • ISBN-13: 978-0470635179
  • Product Dimensions: 9.1 x 7.5 x 1.6 inches
  • Shipping Weight: 2.4 pounds (View shipping rates and policies)
  • Average Customer Review: 4.5 out of 5 stars  See all reviews (11 customer reviews)
  • Amazon Best Sellers Rank: #713,225 in Books (See Top 100 in Books)

Editorial Reviews

From the Back Cover

The ultimate resource on building and deploying data integration solutions with Kettle

Kettle is a scaleable and extensible open source ETL and data integration tool that lets you extract data from databases, flat and XML files, web services, ERP systems, and OLAP cubes. It provides over 120 built-in transformation steps to validate, cleanse, and conform data, as well as numerous options to load data into data warehouses and many other targets. Kettle is a comprehensive, low-cost alternative to traditional data integration tools like Informatica PowerCenter, IBM InfoSphere DataStage, and BusinessObjects Data Integrator.

This book explains in detail how to use Kettle to create, test, and deploy your own ETL and data integration solutions. You'll learn to use Kettle's programs to create transformations and jobs, use version control, audit data, and schedule your ETL solution. Then you'll progress to more advanced concepts such as clustering and cloud computing, real-time data integration, loading a Data Vault model, and extending Kettle by building your own plugins. In addition, you'll find hands-on examples and case studies that show exactly how to put Kettle's features into practice.

  • Explore the components of the Kettle ETL toolset

  • Discover how to install and configure Kettle and connect it to various data sources and targets

  • Design and build every aspect of an ETL solution using Kettle

  • Learn how to load a data warehouse with Kettle

  • Understand the steps for deploying and scheduling ETL solutions

  • Gain the skills to integrate Kettle with third-party products

  • Learn to extend Kettle and build your own plugins

  • Use clustering and cloud computing to scale and improve the performance of your Kettle ETL solutions

  • Find out how to use Kettle for real-time data integration

About the Author

Matt Casters is Founder of Kettle and works as Chief Data Integration at Pentaho, where he leads Kettle software development. Roland Bouman is an application developer focusing on open source web technology, databases, and business intelligence. Jos van Dongen is an independent business intelligence consultant and well-known author, analyst, and presenter.

Customer Reviews

4.5 out of 5 stars
5 star
7
4 star
3
3 star
1
2 star
0
1 star
0
See all 11 customer reviews
If you're using PDI, you need this book.
Brian C Hagan
The guys who developed the Pentaho Data Integration, aka PDI or Kettle, teamed to write a definitive book on the software.
Fãbio de Salles
I can honestly say, having this book sitting on your desk is better than having Matt sitting on your desk.
Doug Moran

Most Helpful Customer Reviews

7 of 8 people found the following review helpful By Doug Moran on November 1, 2010
Format: Paperback Verified Purchase
I wanted to do this review much sooner but I've been too busy using the book.

Jos and Roland have taken the proven formula they used in Pentaho Solutions and focused it on ETL and Kettle, AKA Pentaho Data Integration. Their magic formula is to seamlessly mix a product users guide with equal parts of real world examples and best practices training. With the addition of Matt Casters, Mr Kettle himself, the depth of knowledge in the book is now equal to it's breadth. The result is a book that you can read cover to cover and learn about all aspects of building and deploying ETL solutions, and is equally useful as a day to day reference.

The book is divided into five parts starting with an obligatory Getting Started. Getting Started, however, goes beyond the traditional "here's how to install it guide" and presents a nice tutorial on the sometimes confusing terminology and practices used in the data world. It explains how Kettle fits into this world and talks about the key concepts in Kettle. The first part ends with an excellent example ETL solution to populate a non trivial yet easily understood star schema. The example covers fact and dimension tables, change data capture, generating date dimensions and the ETL jobs and transforms required to populate the data.

The organization of the second part of the book is based on the 34 subsystems of ETL as defined by Ralph Kimball in "The Data Warehouse Lifecycle Toolkit", considered by many (including me) as the bible of data warehousing. For each subsystem, Kettle Solutions refers to the original chapters that describe the topic and provides examples on how to solve those issues using Kettle. It is a must have for anyone struggling with the concepts presented in the Kimble book.
Read more ›
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
1 of 1 people found the following review helpful By Stephanie Dozier on February 7, 2013
Format: Paperback Verified Purchase
The book is great the problem is that Pentaho has placed updates out for their product too frequently so now this book is no longer current.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
1 of 1 people found the following review helpful By VL on June 17, 2011
Format: Paperback Verified Purchase
Excellent book, very much hands on, if you are sceptical about using this amazing open source ETL solution (or any sustainable open source for that matter) this book will surely put you at ease...
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
By Brian C Hagan on December 30, 2013
Format: Paperback Verified Purchase
If you're using PDI, you need this book. It is the best reference available. The samples alone are worth the price of the book.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
By Fãbio de Salles on April 11, 2013
Format: Kindle Edition Verified Purchase
The guys who developed the Pentaho Data Integration, aka PDI or Kettle, teamed to write a definitive book on the software. Everything you always wanted to know about PDI but didn't know you needed! Plus a Dimensional Modeling chapter written by Kimball himself and an appendix teaching the basics of Data Vault, how to create one and use it to populate a dimensional model. Buy it! It is worth much more than they are asking for!
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
By John W Ballment on January 16, 2013
Format: Paperback Verified Purchase
While I haven't read this book end-to-end (and never planned to), it is my main reference for everything to do with PDI.

Everyone who is developing in PDI should have this book sitting beside them.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again

Customer Images

Search

What Other Items Do Customers Buy After Viewing This Item?