Programming Books C Java PHP Python Learn more Browse Programming Books
Hadoop: The Definitive Guide and thousands of other textbooks are available for instant download on your Kindle Fire tablet or on the free Kindle apps for iPad, Android tablets, PC or Mac.
Buy New
$28.99
Qty:1
  • List Price: $49.99
  • Save: $21.00 (42%)
FREE Shipping on orders over $35.
In Stock.
Ships from and sold by Amazon.com.
Gift-wrap available.
Add to Cart
Trade in your item
Get a $9.50
Gift Card.
Have one to sell? Sell on Amazon
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more
See all 2 images

Hadoop: The Definitive Guide Paperback – May 26, 2012

ISBN-13: 978-1449311520 ISBN-10: 1449311520 Edition: Third Edition

Buy New
Price: $28.99
52 New from $25.00 24 Used from $21.00
Rent from Amazon Price New from Used from
Kindle
"Please retry"
$8.36
Paperback
"Please retry"
$28.99
$25.00 $21.00
Amazon%20Web%20Services
Interested in Cloud Computing? Run virtually everything in the AWS Cloud. Web Apps, Big Data, and more. Get started for free.

Frequently Bought Together

Hadoop: The Definitive Guide + Programming Hive + Hadoop Operations
Price for all three: $99.58

Buy the selected items together
  • Programming Hive $30.59
  • Hadoop Operations $40.00

Customers Who Bought This Item Also Bought

NO_CONTENT_IN_FEATURE
Shop the New Digital Design Bookstore
Check out the Digital Design Bookstore, a new hub for photographers, art directors, illustrators, web developers, and other creative individuals to find highly rated and highly relevant career resources. Shop books on web development and graphic design, or check out blog posts by authors and thought-leaders in the design industry. Shop now

Product Details

  • Paperback: 688 pages
  • Publisher: Yahoo Press; Third Edition edition (May 26, 2012)
  • Language: English
  • ISBN-10: 1449311520
  • ISBN-13: 978-1449311520
  • Product Dimensions: 9.1 x 7 x 1.5 inches
  • Shipping Weight: 2.3 pounds (View shipping rates and policies)
  • Average Customer Review: 4.0 out of 5 stars  See all reviews (52 customer reviews)
  • Amazon Best Sellers Rank: #6,272 in Books (See Top 100 in Books)

Editorial Reviews

Book Description

Storage and Analysis at Internet Scale

About the Author

Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.


More About the Author

Discover books, learn about writers, read author blogs, and more.

Customer Reviews

Wouldn't it make sense to use a URL instead of the phrase "book's website?"
Mark
This is a great book if you intend to understand the whole HDFS and Mapreduce concept in detail.
Arijit Banerjee
I recommend this book to everyone interested in getting started in the Hadoop world!
Philip Ballew

Most Helpful Customer Reviews

52 of 57 people found the following review helpful By Mark on August 25, 2012
Format: Paperback Verified Purchase
I had read all the positive reviews and really had high hopes for the book, waited for the 3rd edition thinking it would be current, but I've mainly felt frustration in reading it once past the first few chapters.

Reference to the Bible in other reviews are apt. The book is a mishmash of chapters with a wide variety of styles and intents. The writing giving the overview is great. But other chapters are a reference manual dump with little motivation. Other chapters tried to be guided tutorial, but lacked in important details (or were out dated by changes). Wish it could have been written with a clearer editorial point of view, or better organized in sections with similar purposes.

Keeping up with a such a fast moving project with a paperback book is no doubt a difficult task. I didn't feel the book did a good job of dealing with the changes that happened with the shift to 1.x .

Most frustrating were the mentions of the "book's website" as a source of up-to-date information. Which website? (hadoopbook.com, oreilly.com, github.com). Wouldn't it make sense to use a URL instead of the phrase "book's website?"

Minor complaint, don't like the code listings without filenames.

Expect to find a lot of time looking for stuff on the web that should have been included in the book or at least documented with a concrete URLs.

There are certainly example of truly fine technical writing in the book. Just wish that level could have been maintained through out the book.
1 Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
152 of 176 people found the following review helpful By Frustrated Hadoop Learner on September 19, 2012
Format: Paperback Verified Purchase
I bought this book as a very experienced programmer but no prior experience with Hadoop, which I need to come up to speed on for a new project. I am extremely disappointed in the book and feel I wasted my money. If there's one thing you want from a book on a new technology, it's the ability to get a basic "Hello World" equivalent program running, from which you can then start iterating. This book completely falls down on this most basic requirement - when you get to the very first example program in the book, it tells you that you need to first compile a bunch of example code from the book's website. That shouldn't be required, but ok, whatever. Then when you go to the book's website, you are told that you first need to install a bunch of extra stuff covered later in the book before you can compile the libraries apparently needed to get anything at all to run. This really makes no sense at all - there's no way I should be having to read all the later chapters to figure out what these things are in order to get my very first example program running. Tossed it into the trash and off in search of a resource done by someone who understands how to structure a tutorial properly.
9 Comments Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
46 of 52 people found the following review helpful By Y. Yuan on October 12, 2012
Format: Paperback Verified Purchase
I purchased this book a few months ago based on many earlier 5-star reviews. I had high hopes that it would be as good as those reviewers highly praised. However, the book is actually unbelievably poorly organized - essentially written in a spaghetti fashion. Yes - it contains a lot of information about Hadoop, but with three basic issues: 1) examples are trivial and hard to get working due to insufficient, unclear or no procedures; 2) many subjects (e.g. streaming) are spread over several chapters and readers have to stitch them together after reading all relevant chapters; and 3) many stataments are either inaccurate or lack supportive data. Ironically, one has to apply MapReduce to all the subjects in order to sort out various subjects in a more logic order. I look forward to the 4th edition with significant quality improvement.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
39 of 44 people found the following review helpful By Big Data Paramedic VINE VOICE on April 12, 2014
Format: Paperback Verified Purchase
This book is the single best source to begin your career in Big Data Development. However this book should not be the first entry point, which will frustrate you. This review hopes to help the juniors and newbies, who want to enter the big data world.

Cloudera CCD-410 certification ranges between tough to very tough. Period.

TRAINING : You are not mandated to take a training. I took a relatively inexpensive training ($300) from edureka dot in, an online training website in India. They give a good overview at 10,000 feet are very good for the price,but no where close enough to get certified. Check out their first session available for free at Youtube. They do have steps to install your own VM, simple project , HIVE,PIG etc. If time and money permits, I strongly suggest going to official cloudera training. It costs about $3000 and includes a free test voucher , so effectively about $2700. Saves you months in preparation time and distinct advantage over your peers that should pay for itself.

Install VM, try few commands, PIG, hive commands, Also try Amazon elastic mapreduce which reduces lot of manual typing and allows you to focus on the coding itself.

LEARNING FROM THIS BOOK: After a training, start with this book. The first Eight chapters are critical (Approximately 300 Pages out of about 550 pages). If you are smart,sharp and young , expect to read these eight chapters about three times, more is just fine. Add some time to read rest of chapters once Or twice before the test and all the external links. If you are a busy professional, give a six month window to take the test. Knowing Java is a definitive plus. Buy the Cloudera mock examination after getting comfortable and familiar with Mapreduce($125). It is a very nice resource.
Read more ›
4 Comments Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again

Customer Images

Most Recent Customer Reviews

Search

What Other Items Do Customers Buy After Viewing This Item?