Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.

  • Apple
  • Android
  • Windows Phone
  • Android

To get the free app, enter your email address or mobile phone number.

Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration 1st Edition

4.6 out of 5 stars 13 customer reviews
ISBN-13: 978-0470635179
ISBN-10: 0470635177
Why is ISBN important?
ISBN
This bar-code number lets you verify that you're getting exactly the right version or edition of a book. The 13-digit and 10-digit formats both work.
Scan an ISBN with your phone
Use the Amazon App to scan ISBNs and compare prices.
Sell yours for a Gift Card
We'll buy it for $2.58
Learn More
Trade in now
Have one to sell? Sell on Amazon
Buy used On clicking this link, a new layer will be open
$10.10 On clicking this link, a new layer will be open
Buy new On clicking this link, a new layer will be open
$35.99 On clicking this link, a new layer will be open
More Buying Choices
35 New from $17.52 24 Used from $10.10
Free Two-Day Shipping for College Students with Amazon Student Free%20Two-Day%20Shipping%20for%20College%20Students%20with%20Amazon%20Student


Save Up to 90% on Textbooks Textbooks
$35.99 FREE Shipping. Only 15 left in stock (more on the way). Ships from and sold by Amazon.com. Gift-wrap available.

Frequently Bought Together

  • Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration
  • +
  • Pentaho Data Integration Cookbook Second Edition
  • +
  • Pentaho Data Integration Beginner's Guide, Second Edition
Total price: $135.97
Buy the selected items together

Editorial Reviews

From the Back Cover

The ultimate resource on building and deploying data integration solutions with Kettle

Kettle is a scaleable and extensible open source ETL and data integration tool that lets you extract data from databases, flat and XML files, web services, ERP systems, and OLAP cubes. It provides over 120 built-in transformation steps to validate, cleanse, and conform data, as well as numerous options to load data into data warehouses and many other targets. Kettle is a comprehensive, low-cost alternative to traditional data integration tools like Informatica PowerCenter, IBM InfoSphere DataStage, and BusinessObjects Data Integrator.

This book explains in detail how to use Kettle to create, test, and deploy your own ETL and data integration solutions. You'll learn to use Kettle's programs to create transformations and jobs, use version control, audit data, and schedule your ETL solution. Then you'll progress to more advanced concepts such as clustering and cloud computing, real-time data integration, loading a Data Vault model, and extending Kettle by building your own plugins. In addition, you'll find hands-on examples and case studies that show exactly how to put Kettle's features into practice.

  • Explore the components of the Kettle ETL toolset

  • Discover how to install and configure Kettle and connect it to various data sources and targets

  • Design and build every aspect of an ETL solution using Kettle

  • Learn how to load a data warehouse with Kettle

  • Understand the steps for deploying and scheduling ETL solutions

  • Gain the skills to integrate Kettle with third-party products

  • Learn to extend Kettle and build your own plugins

  • Use clustering and cloud computing to scale and improve the performance of your Kettle ETL solutions

  • Find out how to use Kettle for real-time data integration

About the Author

Matt Casters is Founder of Kettle and works as Chief Data Integration at Pentaho, where he leads Kettle software development. Roland Bouman is an application developer focusing on open source web technology, databases, and business intelligence. Jos van Dongen is an independent business intelligence consultant and well-known author, analyst, and presenter.

NO_CONTENT_IN_FEATURE



Product Details

  • Paperback: 720 pages
  • Publisher: Wiley; 1 edition (September 28, 2010)
  • Language: English
  • ISBN-10: 0470635177
  • ISBN-13: 978-0470635179
  • Product Dimensions: 7.4 x 1.5 x 9.3 inches
  • Shipping Weight: 2.4 pounds (View shipping rates and policies)
  • Average Customer Review: 4.6 out of 5 stars  See all reviews (13 customer reviews)
  • Amazon Best Sellers Rank: #368,640 in Books (See Top 100 in Books)

Customer Reviews

Top Customer Reviews

Format: Paperback Verified Purchase
I wanted to do this review much sooner but I've been too busy using the book.

Jos and Roland have taken the proven formula they used in Pentaho Solutions and focused it on ETL and Kettle, AKA Pentaho Data Integration. Their magic formula is to seamlessly mix a product users guide with equal parts of real world examples and best practices training. With the addition of Matt Casters, Mr Kettle himself, the depth of knowledge in the book is now equal to it's breadth. The result is a book that you can read cover to cover and learn about all aspects of building and deploying ETL solutions, and is equally useful as a day to day reference.

The book is divided into five parts starting with an obligatory Getting Started. Getting Started, however, goes beyond the traditional "here's how to install it guide" and presents a nice tutorial on the sometimes confusing terminology and practices used in the data world. It explains how Kettle fits into this world and talks about the key concepts in Kettle. The first part ends with an excellent example ETL solution to populate a non trivial yet easily understood star schema. The example covers fact and dimension tables, change data capture, generating date dimensions and the ETL jobs and transforms required to populate the data.

The organization of the second part of the book is based on the 34 subsystems of ETL as defined by Ralph Kimball in "The Data Warehouse Lifecycle Toolkit", considered by many (including me) as the bible of data warehousing. For each subsystem, Kettle Solutions refers to the original chapters that describe the topic and provides examples on how to solve those issues using Kettle. It is a must have for anyone struggling with the concepts presented in the Kimble book.
Read more ›
Comment 7 of 8 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback Verified Purchase
The book is great the problem is that Pentaho has placed updates out for their product too frequently so now this book is no longer current.
Comment 3 of 3 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback
Recently, I received my own review copy of a long awaited Pentaho book : Pentaho Kettle Solutions - Building Open Source ETL Solutions with Pentaho Data Integration.

The first book, Pentaho Solutions, was aimed at discovering the basics of BI and Pentaho usage. Now, with this new book, we go deeper into hardcode dataprocessing and datawarehousing using Kettle. But it is not exclusively focused on Kettle : a strong emphasis is placed on data processing basics, technics and theory (Codd vs Kimbal ...). Reading this book will get you to the next upper level on these two topics :

* Data processing and how to build / feed a datawarehouse,
* Kettle development, customization and advanced usage.

This book is a fantastic concentration of knowledge. You will learn from ETL basics, advanced topics, performance management, Kettle development and cloud dataprocessing. Matt, Roland and Jos met a risky challenge : writing a book that do the splits from basic knowledge to high level technics while staying focused on how to use Kettle to solve actual and concrete data problems.

They succeeded.

This book is now sitting on my reference BI shelf, it entered my personal BI Book Hall of Fame.
Comment 1 of 1 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Kindle Edition Verified Purchase
I've given it 5 stars because for me the value I got out of it just in one chapter on the Data Vault was worth the money.

However, there is definitely something to be said about Pentaho Data Integrator (Kettle) coverage that I would've wanted more. The online documentation is good but still leaves a lot to be desired in real workable examples. The book does fill that need to an extent and covers some pretty decent logic flows.

It would have been nicer to have a step by step source to target guide with reasoning and with explanations of the steps and a LOT more focus on the tool than on ETL subsystems. I didn't even care about this because as an experienced data integration expert, I don't need it.

What I wanted was a walkthrough of each step, debugging, tuning, patterns and more on the tool metadata which is extremely powerful. I think it fell a bit short on this even though it did deliver to an extent. For someone wanting to learn Pentaho DI (Kettle) from scratch, it's really not enough and it should have been.

The "Advanced Topics" are very well written and covers some ground that is inspiring.

Overall, it's a really good book.
Comment 1 of 1 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback Verified Purchase
Finished _Pentaho Kettle Solutions_, finding it generally OK. For me, it spends too much time covering Kimball's "34 Subsystems of ETL", fitting Pentaho into that framework. However, I got quite a bit out of the fifth section of the book "Advanced Topics".
Comment 1 of 1 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback Verified Purchase
While I haven't read this book end-to-end (and never planned to), it is my main reference for everything to do with PDI.

Everyone who is developing in PDI should have this book sitting beside them.
Comment 1 of 1 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
By VL on June 17, 2011
Format: Paperback Verified Purchase
Excellent book, very much hands on, if you are sceptical about using this amazing open source ETL solution (or any sustainable open source for that matter) this book will surely put you at ease...
Comment 1 of 1 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse

Set up an Amazon Giveaway

Amazon Giveaway allows you to run promotional giveaways in order to create buzz, reward your audience, and attract new followers and customers. Learn more
Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration
This item: Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration
Price: $35.99
Ships from and sold by Amazon.com

Want to discover more products? Check out these pages to see more: kettle leads, oracle erp system, kettle best price, kettle best seller