Mining the Web: Discovering Knowledge from Hypertext Data and over one million other books are available for Amazon Kindle. Learn more
Qty:1
  • List Price: $96.95
  • Save: $4.85 (5%)
Only 1 left in stock (more on the way).
Ships from and sold by Amazon.com.
Gift-wrap available.
Add to Cart
Want it tomorrow, April 22? Order within and choose One-Day Shipping at checkout. Details
Condition: Used: Good
Comment: Used Good condition book may have signs of cover wear and/or marks on corners and page edges. Inside pages may have highlighting, writing and underlining.
Add to Cart
Trade in your item
Get a $7.29
Gift Card.
Have one to sell?
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more

Mining the Web: Discovering Knowledge from Hypertext Data Hardcover

ISBN-13: 978-1558607545 ISBN-10: 1558607544 Edition: 1st

See all 3 formats and editions Hide other formats and editions
Amazon Price New from Used from Collectible from
Kindle
"Please retry"
Hardcover
"Please retry"
$92.10
$32.00 $20.70
Paperback
"Please retry"
$12.88

Free%20Two-Day%20Shipping%20for%20College%20Students%20with%20Amazon%20Student



Frequently Bought Together

Mining the Web: Discovering Knowledge from Hypertext Data + Spidering Hacks + Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Price for all three: $157.01

Buy the selected items together

NO_CONTENT_IN_FEATURE

Shop the new tech.book(store)
New! Introducing the tech.book(store), a hub for Software Developers and Architects, Networking Administrators, TPMs, and other technology professionals to find highly-rated and highly-relevant career resources. Shop books on programming and big data, or read this week's blog posts by authors and thought-leaders in the tech industry. > Shop now

Product Details

  • Hardcover: 344 pages
  • Publisher: Morgan Kaufmann; 1 edition (October 23, 2002)
  • Language: English
  • ISBN-10: 1558607544
  • ISBN-13: 978-1558607545
  • Product Dimensions: 9.5 x 7.5 x 1 inches
  • Shipping Weight: 1.4 pounds (View shipping rates and policies)
  • Average Customer Review: 4.8 out of 5 stars  See all reviews (9 customer reviews)
  • Amazon Best Sellers Rank: #684,796 in Books (See Top 100 in Books)

Editorial Reviews

Review

"...solid and beneficial to readers interested in Web data mining, especially those interested in the details of algorithmic implementation." = Bernard J. Jansen, Information Processing & Management

"The treatment is systematic, comprehensive and in-depth, yet very lucid and accessible to a wide range of Web technology developers. The author's insights and depth of knowledge as on of the pioneering researchers on hypertext information mining and retrieval are also evident in the extensive and useful bibliographic notes provided at the end of each chapter..." - Professor Joydeep Ghosh, University of Texas, Austin

"The author has done the community a great service by synthesizing all the important work in this field into an excellent book, which introduces fairly sophisticated material in an easy-to-read manner. This book for the first time, makes it possible to offer Web Mining as a real course." - Professor Jaideep Srivastava, University of Minnesota

" Mining the Web: Discovering Knowledge from Hypertext from Hypertext Data, by Soumen Chakrabarti, focuses extensively on building a better search engine crawler...Chakrabarti's book begins with a discussion of search engine crawlers in a chapter titled "Crawling the Web." The discussion in this chapter is technical and detailed. Readers learn about features such as the robots.txt file that can be written in a certain way to stop crawlers from visiting a page...The most interesting part of the book is perhaps Chapter 7, "Social Network Analysis." In this chapter, the author presents the most famous search engine algorithms (e.g., PageRank, HITS, SALSA)." - Journal of Marketing Research, Sandeep Krishnamurthy

"All in all this is an excellent book. I enjoyed the book and highly recommend it as a textbook for web data mining classes at graduate or senior undergraduate levels. Chakrabarti has a rich vocabulary and is a gifted writer. I bet he will write new, good books in the future, and he should. I look forward to them." - Fazli Can - Miami University

Book Description

The definitive book on mining the Web from the preeminent authority.

More About the Author

Discover books, learn about writers, read author blogs, and more.

Customer Reviews

4.8 out of 5 stars
5 star
7
4 star
2
3 star
0
2 star
0
1 star
0
See all 9 customer reviews
This book is simply the best web data mining text available.
Dr. G. W. Flake
The book should be valuable to newcomers, students, and experts alike, and could certainly serve as an excellent course textbook.
Dave P
The comprehensive yet easy to read nature of the book makes it a valuable addition to my shelf.
Gautam Pant

Most Helpful Customer Reviews

46 of 47 people found the following review helpful By Dave P on August 28, 2003
Format: Hardcover
Executive summary: This is a fabulous book, written with care and
precision, easy to read yet covering in detail a wide variety of
the most beautiful and promising developments in data mining and
machine learning as it relates to the World Wide Web, including a
prescient vision of where the field is headed in the future.
More detail: There are science authors who are clear experts in
their field, yet have trouble communicating their knowledge. Then
there are science authors who write with clarity, but achieve it
by dumbing down technical details to cater to a broad readership.
Finally, there are authors who are experts and leaders in their
field, who are actively contributing to the forefront of research,
who are excellent writers, and who can communicate complex
concepts to a diverse audience with acumen, without glossing over
important details. Soumen Chakrabarti is one such author. "Mining
the Web" is a stunning achievement. It is an excellent summary of
the past decade or so of research in the area, covering nearly all
of the important bases, including the machinery of Web crawling,
Web information retrieval (i.e., search engines), clustering,
automated classification, semi-supervised approaches, social
network analysis, and focused crawling. Though Chakrabarti himself
has contributed prominently to the field, this book is not at all
the vehicle for self-promotion that other specialist texts
sometimes feel like. The book should be valuable to newcomers,
students, and experts alike, and could certainly serve as an
excellent course textbook.
Read more ›
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
11 of 12 people found the following review helpful By Ari Rappoport on September 8, 2004
Format: Hardcover
This book is one of the best computer science textbooks i have ever seen. Apart from the wealth of information and discussion on specific WEB crawling and data mining (chapters 2, 3, 7, 8), chapters 4, 5 and 6 constitute a wonderful summary of machine learning in general.

The book's discussion of unsupervised learning (the EM algorithm, advanced algorithms in which the number of clusters is not known in advance), supervised learning (Bayesian networks, entropian methods, SVMs), semisupervised learning, co-training and rule induction is extraordinary in that it is short, intuitive, does not sacrifice mathematical rigor, and accompanied by examples (all taken from information retreival over the web).
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
8 of 10 people found the following review helpful By I. Christou on June 3, 2005
Format: Hardcover
The book is an absolute must for those working in information retrieval, and in particular web information retrieval and web mining. These areas are quite hot (again) both for the academics as well as for industry. I personally enjoyed the fact that there is no discussion of semantic web research directions (Jena, OWL etc.) but others might not... The material is quite tightly brought together and very comprehensibly written. However, especially in chapters 4 and 5 there are many pages containing mathematical errors (either in the formulas or in the algorithms described.) For this reason, I rate an otherwise excellent textbook with 4 stars.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
9 of 12 people found the following review helpful By Dr. G. W. Flake on July 2, 2003
Format: Hardcover
This book is simply the best web data mining text available. It is simultaneously broad and deep, covering a wide array of topics yet delving into the meatiest parts of Web data mining. Topics covered include classic information retrieval, graph theoretic approaches, Web measurements, and even machine learning methods such as clustering and text classification. One of the reasons why the book succeeds is that Chakrabarti is himself a major contributor to the field. His writing is always clear and precise probably because he frequently lectures on these topics. If you buy one book about data mining on the Web, this should be that book.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
13 of 18 people found the following review helpful By Gautam Pant on April 28, 2003
Format: Hardcover
This book is an excellent introduction to a number of techniques in information retrieval, machine learning, data mining, network analysis and the application of such techniques to the Web. It discusses many research issues as well as provides practical insights into constructing Web mining tools and systems. Chakrabarti has brought the wisdom of researchers in the area of Web mining to a wider audience. I think the book will prompt the development of new courses for graduate as well as senior undergraduate students.
The first part of the book deals with interesting practical and theoretical issues related with designing large-scale Web crawlers and search engines. Chapter 4 and 5 are a good introduction to various unsupervised and supervised learning methods. Although proper understanding of advanced methods like the LSI are possible only through adequate foundation in linear algebra (you can get only a flavor of the technique in the book). Part III of the book is my personal favorite. It has detailed description of various social network analysis methods, some of which have been applied by modern search engines like Google. Focused crawling, an area that the author has personally shaped, is also explained well. The book ends with a brief peek into the future of Web mining.
The comprehensive yet easy to read nature of the book makes it a valuable addition to my shelf. It is hard to find a comparable book in the area of Web mining.
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again

Product Images from Customers

Search
ARRAY(0xa3dbd30c)