HBase: The Definitive Guide and over one million other books are available for Amazon Kindle. Learn more



or
Sign in to turn on 1-Click ordering
Sell Us Your Item
For a $7.25 Gift Card
Trade in
More Buying Choices
Have one to sell? Sell yours here
Start reading HBase: The Definitive Guide on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.
Sorry, this item is not available in
Image not available for
Color:
Image not available

To view this video download Flash Player

 

HBase: The Definitive Guide [Paperback]

Lars George
4.4 out of 5 stars  See all reviews (8 customer reviews)

List Price: $39.99
Price: $32.05 & FREE Shipping. Details
You Save: $7.94 (20%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Only 19 left in stock (more on the way).
Ships from and sold by Amazon.com. Gift-wrap available.
Want it tomorrow, May 23? Choose One-Day Shipping at checkout. Details

Formats

Amazon Price New from Used from
Kindle Edition $17.27  
Paperback $32.05  
Shop the new tech.book(store)
New! Introducing the tech.book(store), a hub for Software Developers and Architects, Networking Administrators, TPMs, and other technology professionals to find highly-rated and highly-relevant career resources. Shop books on programming and big data, or read this week's blog posts by authors and thought-leaders in the tech industry. > Shop now

Book Description

September 20, 2011

If you're looking for a scalable storage solution to accommodate a virtually endless amount of data, this book shows you how Apache HBase can fulfill your needs. As the open source implementation of Google's BigTable architecture, HBase scales to billions of rows and millions of columns, while ensuring that write and read performance remain constant. Many IT executives are asking pointed questions about HBase. This book provides meaningful answers, whether you’re evaluating this non-relational database or planning to put it into practice right away.

  • Discover how tight integration with Hadoop makes scalability with HBase easier
  • Distribute large datasets across an inexpensive cluster of commodity servers
  • Access HBase with native Java clients, or with gateway servers providing REST, Avro, or Thrift APIs
  • Get details on HBase’s architecture, including the storage format, write-ahead log, background processes, and more
  • Integrate HBase with Hadoop's MapReduce framework for massively parallelized data processing jobs
  • Learn how to tune clusters, design schemas, copy tables, import bulk data, decommission nodes, and many other tasks

Frequently Bought Together

HBase: The Definitive Guide + Hadoop: The Definitive Guide + Programming Hive
Price for all three: $91.11

Buy the selected items together

Customers Who Bought This Item Also Bought


Editorial Reviews

About the Author

Lars George has been involved with HBase since 2007, and became a full HBase committer in 2009. He has spoken at various Hadoop User Group meetings, as well as large conferences such as FOSDEM in Brussels. He also started the Munich OpenHUG meetings. He now works closely with Cloudera to support Hadoop and HBase in and around Europe through technical support, consulting work, and training.


Product Details

  • Paperback: 556 pages
  • Publisher: O'Reilly Media; 1 edition (September 20, 2011)
  • Language: English
  • ISBN-10: 1449396100
  • ISBN-13: 978-1449396107
  • Product Dimensions: 6 x 1.2 x 9 inches
  • Shipping Weight: 1.7 pounds (View shipping rates and policies)
  • Average Customer Review: 4.4 out of 5 stars  See all reviews (8 customer reviews)
  • Amazon Best Sellers Rank: #75,106 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, read author blogs, and more.

Customer Reviews

4.4 out of 5 stars
(8)
4.4 out of 5 stars
Most Helpful Customer Reviews
13 of 16 people found the following review helpful
5.0 out of 5 stars Not Likely to be Surpaseed October 5, 2011
Format:Paperback
When a book bills itself as "The Definitive Guide," well, that's a tall order to fill. But, except for updates as new releases of HBase roll out, I can't imagine another book surpassing this one by Lars George.

Lars George has been working with HBase since 2007 and is a full committer to the project as of 2009. He now works for Cloudera (a company providing a commercial flavor of Hadoop, as well as Hadoop support). After reading this book, there's no question in my mind that George has deep understanding, not only of HBase as a data solution, but of the internal workings of HBase.

George gives the background and history of HBase in the larger context of relational databases and NoSQL, which I found to be very helpful. The many diagrams throughout the book are extremely useful in explaining concepts, especially for those of us coming from a relational database background.

George has an excellent and clear writing style. Take, for example, the section where he discusses The Problem with Relational Database Systems, giving a quick rundown of the typical steps for getting an RDBMS to scale up. The flow of his summary reads like the increasing levels of panic that many of us have gone through when dealing with a database-backed application that will not scale.

As an example of how thorough and comprehensive the book is, look at chapter 2, where there is an extensive discussion of the type and class (not desktop PCs!) of machines suitable for running HBase. George gives a truly helpful set of configuration practices, even down to a recommendation for having redundant power supply units.

Another example of his thoroughness comes where George discusses delete methods (Chapter 3). He shows how you can use custom versioning, while admitting that the example is somewhat contrived. Indeed, right after elaborating the example, there is a distinct "Warning" box that admits that custom versioning is not actually recommended. So, even though you may not implement custom versioning, you do understand it as a feature that HBase provides.

Many of the programming examples come with excellent remarks or discussions of the tradeoffs implicit in the techniques, including performance and scaling concerns. Java developers will be most comfortable with the majority of examples, but they can be followed by anyone with some object-oriented programming experience.

I really appreciated the thorough discussion in chapter 8 ("Architecture") of subjects like B+ trees vs. Log-Structured Merge Trees (LSMs), the Write-Ahead Log, and seeks vs. transfers, topics which are relevant not only to HBase but to many database systems of varying architectures.

The level of thoroughness is also the book's only weakness. I'm not sure who the target audience for this book is, because it serves both developers and system or database administrators. While nearly every imaginable HBase topic is touched upon, some would have been better off merely listed, with appropriate references given to sources of more information (for example, all those hardware recommendations). The print edition of the book is 552 pages.

Still, a complaint that a book is too detailed shouldn't be interpreted as much of a complaint. Anyone with an interest in NoSQL databases in general, and HBase in particular should read and study this book. It's not likely to be superseded in the future.
Comment | 
Was this review helpful to you?
1 of 1 people found the following review helpful
2.0 out of 5 stars Probably a great book a couple of years ago February 15, 2013
By yepher
Format:Paperback
I am just getting started with HBase and was looking for a book that gets me up and running. I read through the first half of the book and was getting a pretty good feel for how it seems you should use HBase.

Unfortunately the API used are out of date. I was unable to get any of the examples to work with HBase 0.94.4. I tried downgrading HBase to 0.92.0 but it did not work either. The Maven pom files make it look easy to up the version but it turns out the API has changed quite a bit.

It would be really nice if the GitHub project was brought up to date so one reading through the book can at least get the examples to run with a current version of HBase. I tried several of the project forks but none seemed to work for me. The project README should at least be updated to point to the specific version of HBase that is needed to run the examples.

I made it to chapter four when I realized this book is mostly useless at this point.
Comment | 
Was this review helpful to you?
3.0 out of 5 stars Only book on HBase January 6, 2013
Format:Kindle Edition|Amazon Verified Purchase
When I purchased it was the only book on HBase and I found the money well-spent; although, I feel it lacks in many areas but can give you something to code after a few days of reading.
Comment | 
Was this review helpful to you?
Most Recent Customer Reviews
Search Customer Reviews
Only search this product's reviews

What Other Items Do Customers Buy After Viewing This Item?


Forums

There are no discussions about this product yet.
Be the first to discuss this product with the community.
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Listmania!


Create a Listmania! list

So You'd Like to...


Create a guide


Look for Similar Items by Category