Customer Reviews


9 Reviews
5 star:
 (7)
4 star:
 (2)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
Share your thoughts with other customers
Create your own review
 
 
Only search this product's reviews
Most Helpful First | Newest First

46 of 47 people found the following review helpful:
5.0 out of 5 stars Excellent, comprehensive, readable book on mining the Web, August 28, 2003
By 
Dave P (New York, NY United States) - See all my reviews
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
Executive summary: This is a fabulous book, written with care and
precision, easy to read yet covering in detail a wide variety of
the most beautiful and promising developments in data mining and
machine learning as it relates to the World Wide Web, including a
prescient vision of where the field is headed in the future.

More detail: There are science authors who are clear experts in
their field, yet have trouble communicating their knowledge. Then
there are science authors who write with clarity, but achieve it
by dumbing down technical details to cater to a broad readership.
Finally, there are authors who are experts and leaders in their
field, who are actively contributing to the forefront of research,
who are excellent writers, and who can communicate complex
concepts to a diverse audience with acumen, without glossing over
important details. Soumen Chakrabarti is one such author. "Mining
the Web" is a stunning achievement. It is an excellent summary of
the past decade or so of research in the area, covering nearly all
of the important bases, including the machinery of Web crawling,
Web information retrieval (i.e., search engines), clustering,
automated classification, semi-supervised approaches, social
network analysis, and focused crawling. Though Chakrabarti himself
has contributed prominently to the field, this book is not at all
the vehicle for self-promotion that other specialist texts
sometimes feel like. The book should be valuable to newcomers,
students, and experts alike, and could certainly serve as an
excellent course textbook. High-level concepts can be grasped with
little mathematical background, yet more technically sophisticated
readers will not be disappointed: most topics do include rigorous
coverage. The text is well organized, well written, and well
conceived. It's design, including generous and illuminating
figures and illustrations, possesses an artist's touch, perhaps
not surprising given that Chakrabarti designs his own font
libraries in his (apparently scant) spare time. It's hard to
imagine where Chakrabarti found the time to write such a
comprehensive and thoughtful book, but I'm not asking any
questions: I'm thrilled with the outcome. The book is a must-have
reference for anyone working in -- or aspiring to work in -- the
crossroads of Web algorithmics, data mining, and machine learning.

David M. Pennock
Senior Research Scientist, Overture Services, Inc.
[website]

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


11 of 12 people found the following review helpful:
5.0 out of 5 stars A wonderful textbook for machine learning over the web, September 8, 2004
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
This book is one of the best computer science textbooks i have ever seen. Apart from the wealth of information and discussion on specific WEB crawling and data mining (chapters 2, 3, 7, 8), chapters 4, 5 and 6 constitute a wonderful summary of machine learning in general.

The book's discussion of unsupervised learning (the EM algorithm, advanced algorithms in which the number of clusters is not known in advance), supervised learning (Bayesian networks, entropian methods, SVMs), semisupervised learning, co-training and rule induction is extraordinary in that it is short, intuitive, does not sacrifice mathematical rigor, and accompanied by examples (all taken from information retreival over the web).
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


7 of 9 people found the following review helpful:
4.0 out of 5 stars Great coverage, but quite a few errors, June 3, 2005
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
The book is an absolute must for those working in information retrieval, and in particular web information retrieval and web mining. These areas are quite hot (again) both for the academics as well as for industry. I personally enjoyed the fact that there is no discussion of semantic web research directions (Jena, OWL etc.) but others might not... The material is quite tightly brought together and very comprehensibly written. However, especially in chapters 4 and 5 there are many pages containing mathematical errors (either in the formulas or in the algorithms described.) For this reason, I rate an otherwise excellent textbook with 4 stars.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


9 of 12 people found the following review helpful:
5.0 out of 5 stars The Best Web Data Mining Text, July 2, 2003
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
This book is simply the best web data mining text available. It is simultaneously broad and deep, covering a wide array of topics yet delving into the meatiest parts of Web data mining. Topics covered include classic information retrieval, graph theoretic approaches, Web measurements, and even machine learning methods such as clustering and text classification. One of the reasons why the book succeeds is that Chakrabarti is himself a major contributor to the field. His writing is always clear and precise probably because he frequently lectures on these topics. If you buy one book about data mining on the Web, this should be that book.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


13 of 18 people found the following review helpful:
5.0 out of 5 stars Much needed book on Web mining, April 29, 2003
By 
Gautam Pant (Iowa City, IA United States) - See all my reviews
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
This book is an excellent introduction to a number of techniques in information retrieval, machine learning, data mining, network analysis and the application of such techniques to the Web. It discusses many research issues as well as provides practical insights into constructing Web mining tools and systems. Chakrabarti has brought the wisdom of researchers in the area of Web mining to a wider audience. I think the book will prompt the development of new courses for graduate as well as senior undergraduate students.

The first part of the book deals with interesting practical and theoretical issues related with designing large-scale Web crawlers and search engines. Chapter 4 and 5 are a good introduction to various unsupervised and supervised learning methods. Although proper understanding of advanced methods like the LSI are possible only through adequate foundation in linear algebra (you can get only a flavor of the technique in the book). Part III of the book is my personal favorite. It has detailed description of various social network analysis methods, some of which have been applied by modern search engines like Google. Focused crawling, an area that the author has personally shaped, is also explained well. The book ends with a brief peek into the future of Web mining.

The comprehensive yet easy to read nature of the book makes it a valuable addition to my shelf. It is hard to find a comparable book in the area of Web mining.

Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


4 of 5 people found the following review helpful:
5.0 out of 5 stars Readable, approachable, informative, December 14, 2004
Amazon Verified Purchase(What's this?)
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
The field of relevance algorithms for the web is still relatively new and the author provides a clear, informative introduction to the still-developing field. Many references to real problems are discussed, and the author avoids needless use of equations or symbolic logic when a simple textual explanation is more appropriate. This is the book that the authors of "Modelling the Internet and the Web" should have written. Avoid that book, it is a confusing disaster.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


6 of 8 people found the following review helpful:
5.0 out of 5 stars The best general purpose book on the subject I've seen, October 5, 2004
By 
Niall O'Driscoll (San Francisco, CA United States) - See all my reviews
(REAL NAME)   
Amazon Verified Purchase(What's this?)
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
Probably not a book you're going to put on your coffee table, but if you've got any interest in this subject matter at all this is a book worth having. You can flip it open to just about any page and find something interesting. Most of the descriptions in this book move from general to specific, so you can jump around from chapter to chapter getting an overview, or dig more deeply when you want more detail. The references at the end of each chapter are also very useful. Whether you want a survey of the field or are trying to implement something specific, this book is a valuable resource.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


3 of 4 people found the following review helpful:
5.0 out of 5 stars comprehensive web mining book though 326 pages, April 23, 2007
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
I still gave it 5 stars though the effective page number is 326. There are mainly 3 sections in the book --- the first section is 79 pages walks you thru the basic structure of a web search engine, the 2nd one talks about the learning process (clustering, classification and so on), yes, I know it is AI related stuffs, but this book does not have too much equation and is quite readable. From page 203 is the 3rd section --- application which includes page ranking and other interesting topics.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


4.0 out of 5 stars interesting information, November 6, 2010
Amazon Verified Purchase(What's this?)
This review is from: Mining the Web: Discovering Knowledge from Hypertext Data (Hardcover)
very interesting book.. data mining is one of the most interesting fields in computer science and this book covers very interesting parts I enjoyed reading it
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


Most Helpful First | Newest First

This product

Mining the Web: Discovering Knowledge from Hypertext Data
Mining the Web: Discovering Knowledge from Hypertext Data by Soumen Chakrabarti (Hardcover - October 23, 2002)
$85.95 $51.20
In Stock
Add to cart Add to wishlist