Join Amazon Prime and ship Two-Day for free and Overnight for $3.99. Already a member? Sign in.

Quantity: 

or
Sign in to turn on 1-Click ordering.
 
   
More Buying Choices
27 used & new from $62.91

Have one to sell? Sell yours here
 
   
Tell a Friend
Managing Gigabytes: Compressing and Indexing Documents and Images (The Morgan Kaufmann Series in Multimedia Information and Systems)
 
 
Are You an Author or Publisher?
Find out how to publish your own Kindle Books
 
  
Managing Gigabytes: Compressing and Indexing Documents and Images (The Morgan Kaufmann Series in Multimedia Information and Systems) (Hardcover)
by Ian H. Witten (Author), Alistair Moffat (Author), Timothy C. Bell (Author) "In 1911, Professor Lane Cooper published a concordance of William Wordsworth's poetry so that scholars could readily locate words in which they were interested..." (more)
Key Phrases: signature file index, symbolwise models, priming text, Trinity College, Vereenighde Nederlanden, Number Term (more...)
  4.7 out of 5 stars 11 customer reviews (11 customer reviews)  

List Price: $81.95
Price: $72.40 & this item ships for FREE with Super Saver Shipping. Details
You Save: $9.55 (12%)
Upgrade this book for $16.39 more, and you can read, search, and annotate every page online. See details
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.

Want it delivered Tuesday, May 13? Choose One-Day Shipping at checkout. See details

27 used & new available from $62.91
Also Available in: List Price: Our Price: Other Offers:
Hardcover (1st) 8 used & new from $17.36
 
   

Better Together

Buy this book with Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition) by David A. Grossman today!

Managing Gigabytes: Compressing and Indexing Documents and Images (The Morgan Kaufmann Series in Multimedia Information and Systems) Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)
Buy Together Today: $103.40

Customers Who Bought This Item Also Bought

Modern Information Retrieval

Modern Information Retrieval by Ricardo Baeza-Yates

4.4 out of 5 stars (9) 
Mining the Web: Discovering Knowledge from Hypertext Data

Mining the Web: Discovering Knowledge from Hypertext Data by Soumen Chakrabarti

4.4 out of 5 stars (9)  $64.70
Google's PageRank and Beyond: The Science of Search Engine Rankings

Google's PageRank and Beyond: The Science of Search Engine Rankings by Amy N. Langville

4.1 out of 5 stars (13)  $28.00
Foundations of Statistical Natural Language Processing

Foundations of Statistical Natural Language Processing by Christopher D. Manning

4.6 out of 5 stars (11)  $64.00
Lucene in Action (In Action series)

Lucene in Action (In Action series) by Otis Gospodnetic

4.5 out of 5 stars (15)  $29.67
Explore similar items : Books (50)

Editorial Reviews
Amazon.com
Of all the tasks programmers are asked to perform, storing, compressing, and retrieving information are some of the most challenging--and critical to many applications. Managing Gigabytes: Compressing and Indexing Documents and Images is a treasure trove of theory, practical illustration, and general discussion in this fascinating technical subject.

Ian Witten, Alistair Moffat, and Timothy Bell have updated their original work with this even more impressive second edition. This version adds recent techniques such as block-sorting, new indexing techniques, new lossless compression strategies, and many other elements to the mix. In short, this work is a comprehensive summary of text and image compression, indexing, and querying techniques. The history of relevant algorithm development is woven well with a practical discussion of challenges, pitfalls, and specific solutions.

This title is a textbook-style exposition on the topic, with its information organized very clearly into topics such as compression, indexing, and so forth. In addition to diagrams and example text transformations, the authors use "pseudo-code" to present algorithms in a language-independent manner wherever possible. They also supplement the reading with mg--their own implementation of the techniques. The mg C language source code is freely available on the Web.

Alone, this book is an impressive collection of information. Nevertheless, the authors list numerous titles for further reading in selected topics. Whether you're in the midst of application development and need solutions fast or are merely curious about how top-notch information management is done, this hardcover is an excellent investment. --Stephen W. Plain

Topics covered: Text compression models, including Huffman, LZW, and their variants; trends in information management; index creation and compression; image compression; performance issues; and overall system implementation.

Steve Kirsch, Cofounder, Infoseek Corporation
"This book is the Bible for anyone who needs to manage large data collections. It's required reading for our search gurus at Infoseek. The authors have done an outstanding job of incorporating and describing the most significant new research in information retrieval over the past five years into this second edition."

See all Editorial Reviews


Product Details
  • Hardcover: 519 pages
  • Publisher: Morgan Kaufmann; 2 Sub edition (May 15, 1999)
  • Language: English
  • ISBN-10: 1558605703
  • ISBN-13: 978-1558605701
  • Product Dimensions: 9.2 x 7.6 x 1.3 inches
  • Shipping Weight: 2.3 pounds (View shipping rates and policies)
  • Average Customer Review: 4.7 out of 5 stars 11 customer reviews (11 customer reviews)
  • Amazon.com Sales Rank: #216,034 in Books (See Bestsellers in Books)

    Popular in these categories: (What's this?)

    #3 in  Books > Computers & Internet > Programming > Algorithms > Compression
    #42 in  Books > Computers & Internet > Programming > Algorithms > Digital Image Processing

    (Publishers and authors: Improve Your Sales)
  • Also Available in: Hardcover (1st) |  All Editions

  •  Would you like to update product info or give feedback on images? (We'll ask you to sign in so we can get back to you)


Inside This Book (learn more)
First Sentence:
"In 1911, Professor Lane Cooper published a concordance of William Wordsworth's poetry so that scholars could readily locate words in which they were interested." Read the first page
Key Phrases - Statistically Improbable Phrases (SIPs): (learn more)
signature file index, symbolwise models, priming text, permuted text, inverted file entry, compressed inverted files, specified query terms, permuted string, inverted list, unary code, compression effectiveness, signature width, front coding, pyramid coding, ranked queries, minimal perfect hash function, compression subsystem, bilevel images, minimal perfect hashing, cosine method, inverted file index, ranked query, using arithmetic coding, adaptive pixel, query results page
Key Phrases - Capitalized Phrases (CAPs): (learn more)
Trinity College, Vereenighde Nederlanden, Number Term, Library o