See buying choices for this item to see if it's one of the millions that are eligible for Amazon Prime.

9 used & new from $47.81

Have one to sell? Sell yours here
 
 
Managing Gigabytes: Compressing and Indexing Documents and Images (Electrical Engineering)
 
Customer image from andrecornal
 
Tell the Publisher!
I’d like to read this book on Kindle

Don’t have a Kindle? Get yours here.
 
  

Managing Gigabytes: Compressing and Indexing Documents and Images (Electrical Engineering) (Hardcover)

by Ian H. Witten (Author), Alistair Moffat (Author), Timothy C. Bell (Author) "In 1911, Professor Lane Cooper published a concordance of William Wordsworth's poetry so that scholars could readily locate words in which they were interested..." (more)
Key Phrases: signature file index, symbolwise models, priming text, Trinity College, Vereenighde Nederlanden, Number Term (more...)
4.7 out of 5 stars See all reviews (11 customer reviews)


Available from these sellers.


2 new from $55.00 7 used from $47.81
Also Available in: List Price: Our Price: Other Offers:
Hardcover (2 Sub) $90.95 $72.76 38 used & new from $38.88

Customers Who Bought This Item Also Bought

Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)

Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)

by David A. Grossman
4.0 out of 5 stars (8)  $53.95
Introduction to Information Retrieval

Introduction to Information Retrieval

by Christopher D. Manning
4.3 out of 5 stars (9)  $48.00
Programming Collective Intelligence: Building Smart Web 2.0 Applications

Programming Collective Intelligence: Building Smart Web 2.0 Applications

by Toby Segaran
4.5 out of 5 stars (48)  $26.39
Lucene in Action (In Action series)

Lucene in Action (In Action series)

by Otis Gospodnetic
4.2 out of 5 stars (20)  $29.67
Google's PageRank and Beyond: The Science of Search Engine Rankings

Google's PageRank and Beyond: The Science of Search Engine Rankings

by Amy N. Langville
4.1 out of 5 stars (15)  $31.60
Explore similar items

Editorial Reviews

Amazon.com Review
Of all the tasks programmers are asked to perform, storing, compressing, and retrieving information are some of the most challenging--and critical to many applications. Managing Gigabytes: Compressing and Indexing Documents and Images is a treasure trove of theory, practical illustration, and general discussion in this fascinating technical subject.

Ian Witten, Alistair Moffat, and Timothy Bell have updated their original work with this even more impressive second edition. This version adds recent techniques such as block-sorting, new indexing techniques, new lossless compression strategies, and many other elements to the mix. In short, this work is a comprehensive summary of text and image compression, indexing, and querying techniques. The history of relevant algorithm development is woven well with a practical discussion of challenges, pitfalls, and specific solutions.

This title is a textbook-style exposition on the topic, with its information organized very clearly into topics such as compression, indexing, and so forth. In addition to diagrams and example text transformations, the authors use "pseudo-code" to present algorithms in a language-independent manner wherever possible. They also supplement the reading with mg--their own implementation of the techniques. The mg C language source code is freely available on the Web.

Alone, this book is an impressive collection of information. Nevertheless, the authors list numerous titles for further reading in selected topics. Whether you're in the midst of application development and need solutions fast or are merely curious about how top-notch information management is done, this hardcover is an excellent investment. --Stephen W. Plain

Topics covered: Text compression models, including Huffman, LZW, and their variants; trends in information management; index creation and compression; image compression; performance issues; and overall system implementation. --This text refers to the Hardcover edition.

Review
"The coverage of compression, file organizations, and indexing techniques for full text and document management systems is unsurpassed. Students, researchers, and practitioners will all benefit from reading this book." -- Bruce Croft, Director, Center for Intelligent Information Retrieval at the University of Massachusetts

"The new edition of Witten, Moffat, and Bell not only has newer and better text search algorithms but much material on image analysis and joint image/text processing. If you care about search engines, you need this book: it is the only one with full details of how they work. The book is both detailed and enjoyable; the authors have combined elegant writing with top-grade programming." -- Michael Lesk, National Science Foundation

"This book is the Bible for anyone who needs to manage large data collections. It's required reading for our search gurus at Infoseek. The authors have done an outstanding job of incorporating and describing the most significant new research in information retrieval over the past five years into this second edition." -- Steve Kirsch, Cofounder, Infoseek Corporation

"This book is the Bible for anyone who needs to manage large data collections. It's required reading for our search gurus at Infoseek. The authors have done an outstanding job of incorporating and describing the most significant new research in information retrieval over the past five years into this second edition."
—Steve Kirsch, Cofounder, Infoseek Corporation

"The new edition of Witten, Moffat, and Bell not only has newer and better text search algorithms but much material on image analysis and joint image/text processing. If you care about search engines, you need this book: it is the only one with full details of how they work. The book is both detailed and enjoyable; the authors have combined elegant writing with top-grade programming."
—Michael Lesk, National Science Foundation

"The coverage of compression, file organizations, and indexing techniques for full text and document management systems is unsurpassed. Students, researchers, and practitioners will all benefit from reading this book."
—Bruce Croft, Director, Center for Intelligent Information Retrieval at the University of Massachusetts -- Review --This text refers to the Hardcover edition.

See all Editorial Reviews


Product Details

  • Hardcover: 429 pages
  • Publisher: Kluwer Academic Publishers; 1st edition (January 15, 1994)
  • Language: English
  • ISBN-10: 0442018630
  • ISBN-13: 978-0442018634
  • Product Dimensions: 10 x 7.5 x 1.2 inches
  • Shipping Weight: 2.2 pounds
  • Average Customer Review: 4.7 out of 5 stars See all reviews (11 customer reviews)
  • Amazon.com Sales Rank: #2,184,928 in Books (See Bestsellers in Books)

    Popular in these categories: (What's this?)

    #14 in  Books > Computers & Internet > Graphic Design > Electronic Documents
    #68 in  Books > Computers & Internet > Programming > Algorithms > Compression

Inside This Book (learn more)
Browse and search another edition of this book.



Books on Related Topics (learn more)
 
 

What Do Customers Ultimately Buy After Viewing This Item?

Managing Gigabytes: Compressing and Indexing Documents and Images (Electrical Engineering)
63% buy the item featured on this page:
Managing Gigabytes: Compressing and Indexing Documents and Images (Electrical Engineering) 4.7 out of 5 stars (11)
Introduction to Information Retrieval
13% buy
Introduction to Information Retrieval 4.3 out of 5 stars (9)
$48.00
Google's PageRank and Beyond: The Science of Search Engine Rankings
11% buy
Google's PageRank and Beyond: The Science of Search Engine Rankings 4.1 out of 5 stars (15)
$31.60
Lucene in Action (In Action series)
7% buy
Lucene in Action (In Action series) 4.2 out of 5 stars (20)
$29.67

Suggested Tags from Similar Products

 (What's this?)
Be the first one to add a relevant tag (keyword that's strongly related to this product).
Check a corresponding box or enter your own tags in the field below.
(15)
(14)
(12)

Your tags: Add your first tag
 
Help others find this product — tag it for Amazon search
No one has tagged this product for Amazon search yet. Why not be the first to suggest a search for which it should appear?

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

 

Customer Reviews

11 Reviews
5 star:
 (8)
4 star:
 (3)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.7 out of 5 stars (11 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
51 of 53 people found the following review helpful:
5.0 out of 5 stars The Wonderful Thing Is: It's the Only One, December 20, 2001
By Peter Norvig (Palo Alto, CA USA) - See all my reviews
(REAL NAME)   
This is the only book there is that will actually teach you how to build an information retrieval system (aka search engine). It discusses all the algorithms and tradeoffs, and comes with free downloadable source code to experiment with. Some of the material is standard, but covered in more implementation detail here than anywhere else. Some of the material is novel: you won't find better coverage of compression unless you hand-assemble twenty research papers, and reverse-engineer them to figure out how they're implemented. But with "Managing Gigabytes", it's all here. (Although, after a particularly envigorating discussion of how to string together a bunch of techniques to compress their corpus and save a couple 100MB, I did a check and found you could buy 512MB of RAM for less than the cost of the book. Knowledge is Power, but sometimes a little cash is more powerful.) The only negative is that this book is not called "Managing Terabytes", as the first edition promised/threatened it might be. RAM and disk are cheap, but not that cheap, and for now terabytes (and sometimes petabytes) are managed only by NASA, Google, and a few others. I can't wait to see the third edition!
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
14 of 14 people found the following review helpful:
4.0 out of 5 stars Good introduction to searching/indexing in data., December 29, 1999
By Amund Tveit (Trondheim Norway) - See all my reviews
MG gave a good introduction to the components of practical Information Retrieval (IR). You can clearly see that the authors have a genuine interest in the field! But, I would like some more theoretical analysis of the algorithms used(i.e. O-notation), and more focus on parallell implementations of IR systems. Another book related to the same area worth mentioning is "Modern Information Retrieval".
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
15 of 16 people found the following review helpful:
4.0 out of 5 stars Very clear, but misses some key real-world issues, August 14, 2001
By Edwin Young (Seattle, WA United States) - See all my reviews
(REAL NAME)   
As others have said, MG is a good introductory text for Information Retrieval. However I think it spends a little too much time on compression techniques and lacks a good discussion of incremental or on-line indexing. The book tends to assume that the set of texts to be searched is static - if new documents can be added or old ones deleted it makes the whole problem much harder and many of MG's techniques are no longer relevant. That said, I strongly look forward to Managing Terabytes (if it ever appears).
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
Ad
 
Most Recent Customer Reviews

5.0 out of 5 stars one of the best book on search engineering
It has been 8 years since it was published and I could see it is still one of the best in IR field. Read more
Published on April 19, 2007 by Zhefu Zhang

5.0 out of 5 stars A Comprehensive Introduction To Text Retrieval Systems
A wonderful feature of this book spans out practicality for various topics including compresion algorithms and theory, document and imaging system and information retrieval. Read more
Published on July 30, 2005 by Gareth Louis

5.0 out of 5 stars Great Book on Information Retrieval
Managing Gigabytes is the best book out there on information retrieval. If you're interested in implementing your own IR system, there's nothing available that comes close to... Read more
Published on May 2, 2004

5.0 out of 5 stars This is a great book.
This is one of those rare books that succeeds both on a theoretical and practical level. The theory underlying management and retrieval of large collections of mixed text and... Read more
Published on September 17, 1999

5.0 out of 5 stars Compression, Algorithms, Full Text Retrieval
Managing Gigabytes is a must read for anyone iterested in how to transmit, access, store, and search large amounts of data. Read more
Published on September 14, 1999

5.0 out of 5 stars Best text available. Has no competition.
This text sets the standard for future information retrieval texts and has replaced the Salton books as the canonical academic text. Read more
Published on September 10, 1999

5.0 out of 5 stars Well, written, with plenty of nuts and bolts
I found MG exceedingly readable, and particularly useful. The ideas are very well explained, and the problems are solved in a stepwise fashion, leading from a simple, inefficient... Read more
Published on August 27, 1999 by nevill@cs.rutgers.edu

4.0 out of 5 stars This is an ideal entry text for IR related study.
This is a hands on text book. That is, words are easy to understand. It doesn't give you tons of links for you to retrieve papers before you can move on to the next section. Read more
Published on July 2, 1997

Only search this product's reviews



Customer Discussions

 Beta (What's this?)
New! See all customer communities, and bookmark your communities to keep track of them.
This product's forum (0 discussions)
  Discussion Replies Latest Post
  No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
  [Cancel]


   


Product Information from the Amapedia Community

Beta (What's this?)



Look for Similar Items by Category


Smooth Operator

Shop for garage door openers

Find garage door products (opener kits, remotes, mini-key-chain controls, and wireless-key entry systems) in the Hardware Store. Opening the garage door shouldn’t be a chore.

Shop all garage door hardware

 

Big Savings in Books

Bargain Books
Find great titles at fantastic prices in our Bargain Books Store.
 

Accessorize Your Tools

Shop for Tool Accessories
From drill bits to fasteners, find all the tool accessories you need in Home Improvement.

Shop for tool accessories

 

Add Flair to Your Hardware

Shop for cabinet knobs
Whether you're remodeling or just need to refresh a living space, cabinet knobs offer a great way to easily pull a room together.

Shop for cabinet knobs

 
Ad

 

Feedback

If you need help or have a question for Customer Service, contact us.
 Would you like to update product info or give feedback on images?
Is there any other feedback you would like to provide?

Your comments can help make our site better for everyone.



Where's My Stuff?

Shipping & Returns

Need Help?

Your Recent History

  (What's this?)
You have no recently viewed items or searches.

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.

Look to the right column to find helpful suggestions for your shopping session.

Continue shopping: Top Sellers
Free
Free by Chris Anderson
Paranoia
Paranoia by Joseph Finder
My Soul to Lose
My Soul to Lose by Rachel Vincent
The Adventures of Sherlock Holmes
The Adventures of Sherlock Holmes by Arthur Conan, Sir, 1859-1930 Doyle

Conditions of Use | Privacy Notice © 1996-2009, Amazon.com, Inc. or its affiliates