or
Sign in to turn on 1-Click ordering.
 
 
Express Checkout with PayPhrase
What's this? | Create PayPhrase
Sorry!
More Buying Choices
29 used & new from $24.35

Have one to sell? Sell yours here
 
   
Text Mining Application Programming (Programming Series)
 
See larger image
 
Tell the Publisher!
I’d like to read this book on Kindle

Don’t have a Kindle? Get your Kindle here.
 
  

Text Mining Application Programming (Programming Series) (Paperback)

~ (Author)
4.8 out of 5 stars  See all reviews (5 customer reviews)

List Price: $59.95
Price: $27.06 & this item ships for FREE with Super Saver Shipping. Details
You Save: $32.89 (55%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.

Want it delivered Wednesday, November 11? Choose One-Day Shipping at checkout. Details
16 new from $24.35 13 used from $24.87

Frequently Bought Together

Text Mining Application Programming (Programming Series) + The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data + Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Price For All Three: $114.81

Show availability and shipping details


Customers Who Bought This Item Also Bought

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

by Bing Liu
4.3 out of 5 stars (3)  $37.13
Building Search Applications: Lucene, LingPipe, and Gate

Building Search Applications: Lucene, LingPipe, and Gate

by Manu Konchady
4.5 out of 5 stars (4)  $40.45
Introduction to Information Retrieval

Introduction to Information Retrieval

by Christopher D. Manning
4.4 out of 5 stars (10)  $41.19
Programming Collective Intelligence: Building Smart Web 2.0 Applications

Programming Collective Intelligence: Building Smart Web 2.0 Applications

by Toby Segaran
4.5 out of 5 stars (52)  $26.39
Foundations of Statistical Natural Language Processing

Foundations of Statistical Natural Language Processing

by Christopher D. Manning
4.6 out of 5 stars (11)  $67.03
Explore similar items

Editorial Reviews

Product Description

Text Mining Application Programming teaches software developers how to mine the vast amounts of information available on the Web, internal networks, and desktop files and turn it into usable data. The book helps developers understand the problems associated with managing unstructured text, and explains how to build your own mining tools using standard statistical methods from information theory, artificial intelligence, and operations research. Each of the topics covered are thoroughly explained and then a practical implementation is provided. The book begins with a brief overview of text data, where it can be found, and the typical search engines and tools used to search and gather this text. It details how to build tools for extracting and using the text, and covers the mathematics behind many of the algorithms used in building these tools. From there you'll learn how to build tokens from text, construct indexes, and detect patterns in text. You'll also find methods to extract the names of people, places, and organizations from an email, a news article, or a Web page. The next portion of the book teaches you how to find information on the Web, the structure of the Web, and how to build spiders to crawl the Web. Text categorization is also described in the context of managing email. The final part of the book covers information monitoring, summarization, and a simple Question & Answer (Q&A) system. The code used in the book is written in Perl, but knowledge of Perl is not necessary to run the software. Developers with an intermediate level of experience with Perl can customize the software. Although the book is about programming, methods are explained with English-like pseudocode and the source code is provided on the CD-ROM. After reading this book, you'll be ready to tap into the bevy of information available online in ways you never thought possible.


About the Author

Manu Konchady (Oakton,VA) is a consultant working on open source text mining software. Previously, he worked at Mitre Corp. where he designed and developed software to mine the Internet. He received his Ph.D. in Information Technology from George Mason University and his articles have appeared in Dr. Dobb's Journal and Linux Journal.

Product Details


More About the Author

Manu Konchady
Discover books, learn about writers, read author blogs, and more.

Visit Amazon's Manu Konchady Page

What Do Customers Ultimately Buy After Viewing This Item?


Tags Customers Associate with This Product

 (What's this?)
Click on a tag to find related items, discussions, and people.
 
(3)

Your tags: Add your first tag
 

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

 

Customer Reviews

5 Reviews
5 star:
 (4)
4 star:
 (1)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.8 out of 5 stars (5 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
6 of 7 people found the following review helpful:
5.0 out of 5 stars Good book to bootstrap yourself into Text Mining, May 3, 2008
I am a Java web/search programmer who wanted to "get into" text mining. I found this book an excellent resource for this. Text Mining is a field in which active research is still going on, and other Text Mining books I have looked at reflect this - the authors expect you to have a certain degree of mathematical background to understand what they are saying. This book explains briefly the math behind each of the approaches, but it focuses more on the algorithms that result from the math, so it is easier to read.

Of course, a side effect of this is that the approaches described are not necessarily the state of the art for solving any given problem, but once you get the basic approach to solving a problem, it is relatively easy to find and understand the documentation on the web for the more advanced approaches, since you now know what you are looking for and how it differs from your basic solution.

The book does have a (fairly long) chapter where it covers the math background necessary to get started with Text Mining. If you understand the stuff in there, you will actually be able to think up solutions to text mining problems that are unique to your own situation.

The algorithms in the book are in pseudo-code, but the book comes with a CD (or download from the author's sourceforge project textmine.sf.net) where you can see working Perl code.

Overall, I think this is one of the most useful books that I have purchased in a while. It should appeal most to programmer types who have programmed in their language(s) of choice for a while in areas other than text mining, wants to get into text mining, and doesn't want to spend a lot of time relearning high school and college math before starting off.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
2 of 2 people found the following review helpful:
5.0 out of 5 stars Excellent!, July 4, 2008
By J. Carlson (Boston MA) - See all my reviews
(REAL NAME)   
This is a well written book, code is easy to download, and a number of topics. All in all though, the writing is clear and easily understood so it's well worth the money...
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
2 of 3 people found the following review helpful:
4.0 out of 5 stars A Great Subject, March 29, 2008
By C. Hivert (Foster City, CA, USA) - See all my reviews
(REAL NAME)   
Text mining is one of the most exciting subjects of the web, and too few books are dealing with it. This one is one of them, and it gives quite a few examples of text mining applications, like spam filters or search engine ranking algorithms. The style is easy to follow, and the concepts easy to understand given some maths background.

However, I expected more details, and a richer content overall, thus the four stars. This is still a good book.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
 
Most Recent Customer Reviews

5.0 out of 5 stars An excellent guide to mining the Net
Software developers learn how to mine information on the Web and turn it into valuable data; but developers need to understand how data mining works. Read more
Published on July 2, 2006 by Midwest Book Review

5.0 out of 5 stars How to Find Information
There is an old expression that half of knowing anything is knowing where to find it. And there is little more frustrating to be looking at 'My Computer' trying to find what you... Read more
Published on June 6, 2006 by John Matlock

Only search this product's reviews



Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

Search Customer Discussions
Search all Amazon discussions
   




Product Information from the Amapedia Community

Beta (What's this?)


Look for Similar Items by Category


Look for Similar Items by Subject

 

Feedback

If you need help or have a question for Customer Service, contact us.
 Would you like to update product info or give feedback on images?
Is there any other feedback you would like to provide?

Your comments can help make our site better for everyone.


Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.