Data-Intensive Text Processing with MapReduce and over one million other books are available for Amazon Kindle. Learn more



or
Sign in to turn on 1-Click ordering
Sell Us Your Item
For a $12.75 Gift Card
Trade in
More Buying Choices
Have one to sell? Sell yours here
Start reading Data-Intensive Text Processing with MapReduce on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.
Sorry, this item is not available in
Image not available for
Color:
Image not available

To view this video download Flash Player

 

Data-Intensive Text Processing with MapReduce (Synthesis Lectures on Human Language Technologies) [Paperback]

Jimmy Lin , Chris Dyer , Graeme Hirst
4.7 out of 5 stars  See all reviews (3 customer reviews)

List Price: $40.00
Price: $31.46 & FREE Shipping. Details
You Save: $8.54 (21%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.
Want it Tuesday, May 21? Choose One-Day Shipping at checkout. Details

Formats

Amazon Price New from Used from
Kindle Edition $9.99  
Paperback $31.46  
Shop the new tech.book(store)
New! Introducing the tech.book(store), a hub for Software Developers and Architects, Networking Administrators, TPMs, and other technology professionals to find highly-rated and highly-relevant career resources. Shop books on programming and big data, or read this week's blog posts by authors and thought-leaders in the tech industry. > Shop now

Book Description

April 30, 2010 Synthesis Lectures on Human Language Technologies
Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well. Table of Contents: Introduction / MapReduce Basics / MapReduce Algorithm Design / Inverted Indexing for Text Retrieval / Graph Algorithms / EM Algorithms for Text Processing / Closing Remarks

Frequently Bought Together

Data-Intensive Text Processing with MapReduce (Synthesis Lectures on Human Language Technologies) + Hadoop in Action + Hadoop: The Definitive Guide
Price for all three: $86.57

Buy the selected items together
  • Hadoop in Action $27.22
  • Hadoop: The Definitive Guide $27.89

Customers Who Bought This Item Also Bought


Product Details

  • Paperback: 178 pages
  • Publisher: Morgan and Claypool Publishers (April 30, 2010)
  • Language: English
  • ISBN-10: 1608453421
  • ISBN-13: 978-1608453429
  • Product Dimensions: 7.5 x 0.4 x 9.2 inches
  • Shipping Weight: 11.4 ounces (View shipping rates and policies)
  • Average Customer Review: 4.7 out of 5 stars  See all reviews (3 customer reviews)
  • Amazon Best Sellers Rank: #67,846 in Books (See Top 100 in Books)

Customer Reviews

4.7 out of 5 stars
(3)
4.7 out of 5 stars
Share your thoughts with other customers
Most Helpful Customer Reviews
1 of 1 people found the following review helpful
5.0 out of 5 stars A Valuable Resource for Hadoop Programmers December 29, 2011
Format:Paperback|Amazon Verified Purchase
This book provides a valuable discussion of the strategies involved in developing MapReduce algorithms. Although I have done considerable parallel programming using MPI and OpenMP, I find MapReduce algorithms to be somewhat non-intuitive, and this book has helped me to overcome that barrier.
Comment | 
Was this review helpful to you?
5.0 out of 5 stars Educating book February 7, 2012
Format:Paperback|Amazon Verified Purchase
The book is educating and, hopefully, will be helpful for writing map-reduce programs. It concentrates not on API, but on algorithms, which is rare and should be appreciated. Text-processing is a good example of data-intensive processing, but the book may be useful in many other fields.
Comment | 
Was this review helpful to you?
0 of 1 people found the following review helpful
4.0 out of 5 stars Useful intro to the algorithm February 11, 2013
Format:Paperback|Amazon Verified Purchase
I bought this book for a project at work, to prototype a log analysis system using Hadoop. I haven't bought very many technical books in the last few years, but the quality of most online documentation for Hadoop is poor and books seemed like a better option. This book is a good intro to the Map Reduce algorithm, but once I got my head around that I didn't use it very much (probably because my problem space was nothing like the sort of text processing that the book focuses on).
Comment | 
Was this review helpful to you?
Search Customer Reviews
Only search this product's reviews

What Other Items Do Customers Buy After Viewing This Item?


Forums

There are no discussions about this product yet.
Be the first to discuss this product with the community.
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Listmania!


Create a Listmania! list

So You'd Like to...


Create a guide


Look for Similar Items by Category