or
Sign in to turn on 1-Click ordering.
 
 
Express Checkout with PayPhrase
What's this? | Create PayPhrase
More Buying Choices
31 used & new from $34.18

Have one to sell? Sell yours here
 
   
Mining the Web: Discovering Knowledge from Hypertext Data
 
 

Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)

~ (Author) "The World Wide Web is the largest and most widely known repository of hypertext..." (more)
Key Phrases: vicinity graph, bipartite cores, mixed hubs, Open Directory, Bibliographic Notes, World Wide Web (more...)
4.9 out of 5 stars  See all reviews (8 customer reviews)

List Price: $78.95
Price: $61.39 & this item ships for FREE with Super Saver Shipping. Details
You Save: $17.56 (22%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Upgrade this book for $14.39 more, and you can read, search, and annotate every page online. See details
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.

Want it delivered Tuesday, November 17? Choose One-Day Shipping at checkout. Details
16 new from $44.99 15 used from $34.18

Formats

Amazon Price New from Used from
  Kindle Edition, August 15, 2002 $49.11 -- --
  Hardcover, October 22, 2002 $61.39 $44.99 $34.18

Frequently Bought Together

Mining the Web: Discovering Knowledge from Hypertext Data + Google's PageRank and Beyond: The Science of Search Engine Rankings + Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)
Price For All Three: $132.31

Show availability and shipping details


Customers Who Bought This Item Also Bought

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

by Bing Liu
4.3 out of 5 stars (3)  $42.69
The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data

The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data

by Ronen Feldman
5.0 out of 5 stars (2)  $51.97
Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)

Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)

by David A. Grossman
4.0 out of 5 stars (8)  $37.80
Introduction to Information Retrieval

Introduction to Information Retrieval

by Christopher D. Manning
4.4 out of 5 stars (10)  $47.10
Foundations of Statistical Natural Language Processing

Foundations of Statistical Natural Language Processing

by Christopher D. Manning
4.6 out of 5 stars (11)  $67.03
Explore similar items

Editorial Reviews

Review

"This book, for the first time, makes it possible to offer Web Mining as a real course." -- Professor Jaideep Srivastava, University of Minnesota. -- Review


Review

"...solid and beneficial to readers interested in Web data mining, especially those interested in the details of algorithmic implementation." = Bernard J. Jansen, Information Processing & Management

"The treatment is systematic, comprehensive and in-depth, yet very lucid and accessible to a wide range of Web technology developers. The author's insights and depth of knowledge as on of the pioneering researchers on hypertext information mining and retrieval are also evident in the extensive and useful bibliographic notes provided at the end of each chapter..." - Professor Joydeep Ghosh, University of Texas, Austin

"The author has done the community a great service by synthesizing all the important work in this field into an excellent book, which introduces fairly sophisticated material in an easy-to-read manner. This book for the first time, makes it possible to offer Web Mining as a real course." - Professor Jaideep Srivastava, University of Minnesota

" Mining the Web: Discovering Knowledge from Hypertext from Hypertext Data, by Soumen Chakrabarti, focuses extensively on building a better search engine crawler...Chakrabarti's book begins with a discussion of search engine crawlers in a chapter titled "Crawling the Web." The discussion in this chapter is technical and detailed. Readers learn about features such as the robots.txt file that can be written in a certain way to stop crawlers from visiting a page...The most interesting part of the book is perhaps Chapter 7, "Social Network Analysis." In this chapter, the author presents the most famous search engine algorithms (e.g., PageRank, HITS, SALSA)." - Journal of Marketing Research, Sandeep Krishnamurthy

"All in all this is an excellent book. I enjoyed the book and highly recommend it as a textbook for web data mining classes at graduate or senior undergraduate levels. Chakrabarti has a rich vocabulary and is a gifted writer. I bet he will write new, good books in the future, and he should. I look forward to them." - Fazli Can - Miami University

Product Details

  • Hardcover: 344 pages
  • Publisher: Morgan Kaufmann; 1st edition (October 23, 2002)
  • Language: English
  • ISBN-10: 1558607544
  • ISBN-13: 978-1558607545
  • Product Dimensions: 9.7 x 7.6 x 1 inches
  • Shipping Weight: 1.4 pounds (View shipping rates and policies)
  • Average Customer Review: 4.9 out of 5 stars  See all reviews (8 customer reviews)
  • Amazon.com Sales Rank: #176,415 in Books (See Bestsellers in Books)

    Popular in this category: (What's this?)

    #59 in  Books > Computers & Internet > Databases > Data Mining

More About the Author

Soumen Chakrabarti
Discover books, learn about writers, read author blogs, and more.

Visit Amazon's Soumen Chakrabarti Page

Inside This Book (learn more)




What Do Customers Ultimately Buy After Viewing This Item?


Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 
(1)

Your tags: Add your first tag
 

 

Customer Reviews

8 Reviews
5 star:
 (7)
4 star:
 (1)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.9 out of 5 stars (8 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
44 of 45 people found the following review helpful:
5.0 out of 5 stars Excellent, comprehensive, readable book on mining the Web, August 28, 2003
By David M. Pennock (Pasadena, CA United States) - See all my reviews
Executive summary: This is a fabulous book, written with care and
precision, easy to read yet covering in detail a wide variety of
the most beautiful and promising developments in data mining and
machine learning as it relates to the World Wide Web, including a
prescient vision of where the field is headed in the future.

More detail: There are science authors who are clear experts in
their field, yet have trouble communicating their knowledge. Then
there are science authors who write with clarity, but achieve it
by dumbing down technical details to cater to a broad readership.
Finally, there are authors who are experts and leaders in their
field, who are actively contributing to the forefront of research,
who are excellent writers, and who can communicate complex
concepts to a diverse audience with acumen, without glossing over
important details. Soumen Chakrabarti is one such author. "Mining
the Web" is a stunning achievement. It is an excellent summary of
the past decade or so of research in the area, covering nearly all
of the important bases, including the machinery of Web crawling,
Web information retrieval (i.e., search engines), clustering,
automated classification, semi-supervised approaches, social
network analysis, and focused crawling. Though Chakrabarti himself
has contributed prominently to the field, this book is not at all
the vehicle for self-promotion that other specialist texts
sometimes feel like. The book should be valuable to newcomers,
students, and experts alike, and could certainly serve as an
excellent course textbook. High-level concepts can be grasped with
little mathematical background, yet more technically sophisticated
readers will not be disappointed: most topics do include rigorous
coverage. The text is well organized, well written, and well
conceived. It's design, including generous and illuminating
figures and illustrations, possesses an artist's touch, perhaps
not surprising given that Chakrabarti designs his own font
libraries in his (apparently scant) spare time. It's hard to
imagine where Chakrabarti found the time to write such a
comprehensive and thoughtful book, but I'm not asking any
questions: I'm thrilled with the outcome. The book is a must-have
reference for anyone working in -- or aspiring to work in -- the
crossroads of Web algorithmics, data mining, and machine learning.

David M. Pennock
Senior Research Scientist, Overture Services, Inc.
[website]

Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
7 of 8 people found the following review helpful:
5.0 out of 5 stars A wonderful textbook for machine learning over the web, September 8, 2004
This book is one of the best computer science textbooks i have ever seen. Apart from the wealth of information and discussion on specific WEB crawling and data mining (chapters 2, 3, 7, 8), chapters 4, 5 and 6 constitute a wonderful summary of machine learning in general.

The book's discussion of unsupervised learning (the EM algorithm, advanced algorithms in which the number of clusters is not known in advance), supervised learning (Bayesian networks, entropian methods, SVMs), semisupervised learning, co-training and rule induction is extraordinary in that it is short, intuitive, does not sacrifice mathematical rigor, and accompanied by examples (all taken from information retreival over the web).
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
9 of 12 people found the following review helpful:
5.0 out of 5 stars The Best Web Data Mining Text, July 2, 2003
This book is simply the best web data mining text available. It is simultaneously broad and deep, covering a wide array of topics yet delving into the meatiest parts of Web data mining. Topics covered include classic information retrieval, graph theoretic approaches, Web measurements, and even machine learning methods such as clustering and text classification. One of the reasons why the book succeeds is that Chakrabarti is himself a major contributor to the field. His writing is always clear and precise probably because he frequently lectures on these topics. If you buy one book about data mining on the Web, this should be that book.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
Ad
 
Most Recent Customer Reviews

5.0 out of 5 stars comprehensive web mining book though 326 pages
I still gave it 5 stars though the effective page number is 326. There are mainly 3 sections in the book --- the first section is 79 pages walks you thru the basic structure of a... Read more
Published on April 23, 2007 by Zhefu Zhang

4.0 out of 5 stars Great coverage, but quite a few errors
The book is an absolute must for those working in information retrieval, and in particular web information retrieval and web mining. Read more
Published on June 3, 2005 by I. Christou

5.0 out of 5 stars Readable, approachable, informative
The field of relevance algorithms for the web is still relatively new and the author provides a clear, informative introduction to the still-developing field. Read more
Published on December 14, 2004 by Unknown Comic

5.0 out of 5 stars The best general purpose book on the subject I've seen
Probably not a book you're going to put on your coffee table, but if you've got any interest in this subject matter at all this is a book worth having. Read more
Published on October 5, 2004 by Niall O'Driscoll

5.0 out of 5 stars Much needed book on Web mining
This book is an excellent introduction to a number of techniques in information retrieval, machine learning, data mining, network analysis and the application of such techniques... Read more
Published on April 29, 2003 by Gautam Pant

Only search this product's reviews



Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Discussion Replies Latest Post
Anyone need psychology testbook- trying to sell a used copy 2 19 hours ago
textbook scam 72 22 hours ago
Textbooks for Kindle DX? 61 7 days ago
Search Customer Discussions
Search all Amazon discussions
   




Product Information from the Amapedia Community

Beta (What's this?)

Help us improve this fledgling article by editing it on Amapedia.com opens new browser window



Look for Similar Items by Category


Look for Similar Items by Subject

Ad
 

Feedback

If you need help or have a question for Customer Service, contact us.
 Would you like to update product info or give feedback on images?
Is there any other feedback you would like to provide?

Your comments can help make our site better for everyone.


Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.