Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.

  • Apple
  • Android
  • Windows Phone
  • Android

To get the free app, enter your email address or mobile phone number.

Mining the Web: Discovering Knowledge from Hypertext Data 1st Edition

4.8 out of 5 stars 9 customer reviews
ISBN-13: 978-1558607545
ISBN-10: 1558607544
Why is ISBN important?
ISBN
This bar-code number lets you verify that you're getting exactly the right version or edition of a book. The 13-digit and 10-digit formats both work.
Scan an ISBN with your phone
Use the Amazon App to scan ISBNs and compare prices.
Trade in your item
Get a $1.01
Gift Card.
Have one to sell? Sell on Amazon
Buy used On clicking this link, a new layer will be open
$32.94 On clicking this link, a new layer will be open
Buy new On clicking this link, a new layer will be open
$94.53 On clicking this link, a new layer will be open
More Buying Choices
24 New from $24.99 37 Used from $10.95
Free Two-Day Shipping for College Students with Amazon Student Free%20Two-Day%20Shipping%20for%20College%20Students%20with%20Amazon%20Student

$94.53 FREE Shipping. Only 2 left in stock (more on the way). Ships from and sold by Amazon.com. Gift-wrap available.
click to open popover

Frequently Bought Together

  • Mining the Web: Discovering Knowledge from Hypertext Data
  • +
  • Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL
  • +
  • Spidering Hacks
Total price: $146.27
Buy the selected items together

Editorial Reviews

Review

"...solid and beneficial to readers interested in Web data mining, especially those interested in the details of algorithmic implementation." = Bernard J. Jansen, Information Processing & Management

"The treatment is systematic, comprehensive and in-depth, yet very lucid and accessible to a wide range of Web technology developers. The author's insights and depth of knowledge as on of the pioneering researchers on hypertext information mining and retrieval are also evident in the extensive and useful bibliographic notes provided at the end of each chapter..." - Professor Joydeep Ghosh, University of Texas, Austin

"The author has done the community a great service by synthesizing all the important work in this field into an excellent book, which introduces fairly sophisticated material in an easy-to-read manner. This book for the first time, makes it possible to offer Web Mining as a real course." - Professor Jaideep Srivastava, University of Minnesota

" Mining the Web: Discovering Knowledge from Hypertext from Hypertext Data, by Soumen Chakrabarti, focuses extensively on building a better search engine crawler...Chakrabarti's book begins with a discussion of search engine crawlers in a chapter titled "Crawling the Web." The discussion in this chapter is technical and detailed. Readers learn about features such as the robots.txt file that can be written in a certain way to stop crawlers from visiting a page...The most interesting part of the book is perhaps Chapter 7, "Social Network Analysis." In this chapter, the author presents the most famous search engine algorithms (e.g., PageRank, HITS, SALSA)." - Journal of Marketing Research, Sandeep Krishnamurthy

"All in all this is an excellent book. I enjoyed the book and highly recommend it as a textbook for web data mining classes at graduate or senior undergraduate levels. Chakrabarti has a rich vocabulary and is a gifted writer. I bet he will write new, good books in the future, and he should. I look forward to them." - Fazli Can - Miami University

Book Description

The definitive book on mining the Web from the preeminent authority.
NO_CONTENT_IN_FEATURE


Product Details

  • Hardcover: 344 pages
  • Publisher: Morgan Kaufmann; 1 edition (October 23, 2002)
  • Language: English
  • ISBN-10: 1558607544
  • ISBN-13: 978-1558607545
  • Product Dimensions: 1 x 7.5 x 9.5 inches
  • Shipping Weight: 1.4 pounds (View shipping rates and policies)
  • Average Customer Review: 4.8 out of 5 stars  See all reviews (9 customer reviews)
  • Amazon Best Sellers Rank: #1,228,885 in Books (See Top 100 in Books)

Customer Reviews

5 star
78%
4 star
22%
3 star
0%
2 star
0%
1 star
0%
See all 9 customer reviews
Share your thoughts with other customers

Top Customer Reviews

Format: Hardcover
Executive summary: This is a fabulous book, written with care and
precision, easy to read yet covering in detail a wide variety of
the most beautiful and promising developments in data mining and
machine learning as it relates to the World Wide Web, including a
prescient vision of where the field is headed in the future.
More detail: There are science authors who are clear experts in
their field, yet have trouble communicating their knowledge. Then
there are science authors who write with clarity, but achieve it
by dumbing down technical details to cater to a broad readership.
Finally, there are authors who are experts and leaders in their
field, who are actively contributing to the forefront of research,
who are excellent writers, and who can communicate complex
concepts to a diverse audience with acumen, without glossing over
important details. Soumen Chakrabarti is one such author. "Mining
the Web" is a stunning achievement. It is an excellent summary of
the past decade or so of research in the area, covering nearly all
of the important bases, including the machinery of Web crawling,
Web information retrieval (i.e., search engines), clustering,
automated classification, semi-supervised approaches, social
network analysis, and focused crawling. Though Chakrabarti himself
has contributed prominently to the field, this book is not at all
the vehicle for self-promotion that other specialist texts
sometimes feel like. The book should be valuable to newcomers,
students, and experts alike, and could certainly serve as an
excellent course textbook.
Read more ›
Comment 46 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover
This book is one of the best computer science textbooks i have ever seen. Apart from the wealth of information and discussion on specific WEB crawling and data mining (chapters 2, 3, 7, 8), chapters 4, 5 and 6 constitute a wonderful summary of machine learning in general.

The book's discussion of unsupervised learning (the EM algorithm, advanced algorithms in which the number of clusters is not known in advance), supervised learning (Bayesian networks, entropian methods, SVMs), semisupervised learning, co-training and rule induction is extraordinary in that it is short, intuitive, does not sacrifice mathematical rigor, and accompanied by examples (all taken from information retreival over the web).
Comment 11 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover
The book is an absolute must for those working in information retrieval, and in particular web information retrieval and web mining. These areas are quite hot (again) both for the academics as well as for industry. I personally enjoyed the fact that there is no discussion of semantic web research directions (Jena, OWL etc.) but others might not... The material is quite tightly brought together and very comprehensibly written. However, especially in chapters 4 and 5 there are many pages containing mathematical errors (either in the formulas or in the algorithms described.) For this reason, I rate an otherwise excellent textbook with 4 stars.
Comment 9 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover
This book is simply the best web data mining text available. It is simultaneously broad and deep, covering a wide array of topics yet delving into the meatiest parts of Web data mining. Topics covered include classic information retrieval, graph theoretic approaches, Web measurements, and even machine learning methods such as clustering and text classification. One of the reasons why the book succeeds is that Chakrabarti is himself a major contributor to the field. His writing is always clear and precise probably because he frequently lectures on these topics. If you buy one book about data mining on the Web, this should be that book.
Comment 9 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover
This book is an excellent introduction to a number of techniques in information retrieval, machine learning, data mining, network analysis and the application of such techniques to the Web. It discusses many research issues as well as provides practical insights into constructing Web mining tools and systems. Chakrabarti has brought the wisdom of researchers in the area of Web mining to a wider audience. I think the book will prompt the development of new courses for graduate as well as senior undergraduate students.
The first part of the book deals with interesting practical and theoretical issues related with designing large-scale Web crawlers and search engines. Chapter 4 and 5 are a good introduction to various unsupervised and supervised learning methods. Although proper understanding of advanced methods like the LSI are possible only through adequate foundation in linear algebra (you can get only a flavor of the technique in the book). Part III of the book is my personal favorite. It has detailed description of various social network analysis methods, some of which have been applied by modern search engines like Google. Focused crawling, an area that the author has personally shaped, is also explained well. The book ends with a brief peek into the future of Web mining.
The comprehensive yet easy to read nature of the book makes it a valuable addition to my shelf. It is hard to find a comparable book in the area of Web mining.
Comment 13 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse

Set up an Amazon Giveaway

Mining the Web: Discovering Knowledge from Hypertext Data
Amazon Giveaway allows you to run promotional giveaways in order to create buzz, reward your audience, and attract new followers and customers. Learn more
This item: Mining the Web: Discovering Knowledge from Hypertext Data

Pages with Related Products. See and discover other items: ebay books