Enter your mobile number below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
Getting the download link through email is temporarily not available. Please check back later.

  • Apple
  • Android
  • Windows Phone
  • Android

To get the free app, enter your mobile phone number.

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications 1st Edition

3.5 out of 5 stars 15 customer reviews
ISBN-13: 978-0123869791
ISBN-10: 012386979X
Why is ISBN important?
ISBN
This bar-code number lets you verify that you're getting exactly the right version or edition of a book. The 13-digit and 10-digit formats both work.
Scan an ISBN with your phone
Use the Amazon App to scan ISBNs and compare prices.
Trade in your item
Get a $19.80
Gift Card.
Have one to sell? Sell on Amazon
Rent On clicking this link, a new layer will be open
$23.85 On clicking this link, a new layer will be open
Buy used On clicking this link, a new layer will be open
$40.90 On clicking this link, a new layer will be open
Buy new On clicking this link, a new layer will be open
$67.15 On clicking this link, a new layer will be open
More Buying Choices
29 New from $39.63 21 Used from $36.15
Free Two-Day Shipping for College Students with Prime Student Free%20Two-Day%20Shipping%20for%20College%20Students%20with%20Amazon%20Student


Windows10ForDummiesVideo
Windows 10 For Dummies Video Training
Get up to speed with Windows 10 with this video training course from For Dummies. Learn more.
$67.15 FREE Shipping. In Stock. Ships from and sold by Amazon.com. Gift-wrap available.
click to open popover

Frequently Bought Together

  • Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications
  • +
  • Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More
Total price: $98.43
Buy the selected items together

Editorial Reviews

Review

"They’ve done it again. From the same industry leaders who brought you the "bible" of data mining comes the definitive, go-to text mining resource. This book empowers you to dig in and seize value, with over two dozen hands-on tutorials that drive an incredible range of applications such as predicting marketing success and detecting customer sentiment, criminal lies, writing authorship, and patient schizophrenia. These step-by-step tutorials immediately place you in the practitioner’s driver’s seat, executing on text analytics. Beyond this, 17 more chapters cover the latest methods and the leading tools, making this the most comprehensive resource, and earning it a well-deserved place on your desk aside the authors’ data mining handbook." ― Eric Siegel, Ph.D., Founder, Predictive Analytics World, Text Analytics World and Prediction Impact, Inc.

“Of the number of statistics books that are published each year, only a few stand out as really being important, meaning that they positively influence how future research is done in the subject area of the text. I believe that Practical Text Mining is just such a book.” ― Joseph M. Hilbe, JD, PhD, Arizona State University and Jet Propulsion Laboratory

“When you want real help extracting insight from the mountains of text that you’re facing, this is the book to turn to for immediate practical advice.” ― Karl Rexer, PhD, President, Rexer Analytics, Boston, MA

"The underlying premise is that almost all data in databases takes the form of unstructured text, or summaries of unstructured text, and that historians, marketers, crime investigators, and others need to know how to search that text for meaningful patterns ― a very different process than reading. Contributors in a range of fields share their insights and experience with the process. After setting out the principles, they present tutorials and case studies, then move on to advanced topics." ― Reference and Research Book News, Inc. "The authors of Practical Text Mining and Statistical Analysis for Nonstructured Text Data Applications have managed to produce three books in one. First, in 17 chapters they give a friendly yet comprehensive introduction to the huge field of text mining, a field comprising techniques from several different disciplines and a variety of different tasks. Miner and his colleagues have produced a readable overview of the area that is sure to help the practitioner navigate this large and unruly ocean of techniques. Second, the authors provide a comprehensive list and review of both the commercial and free software available to perform most text data mining tasks. Finally, and most importantly, the authors have also provided an amazing collection of tutorials and case studies. The tutorials illustrate various text mining scenarios and paths actually taken by researchers, while the case studies go into even more depth, showing both the methodology used and the business decisions taken based on the analysis. These practical step-by-step guides are impressive not only in the breadth of their applications but in the depth and detail that each case study delivers. The studies are authored by several guest authors in addition to the book authors and are built on real problems with real solutions. These case studies and tutorials alone make the book worth having. I have never seen such a collection of real business problems published in any field, much less in such a new field as text mining. These, together with the explanations in the chapters, should provide the practitioner wishing to get a broad view of the text mining field an invaluable resource for both learning and practice. ― Richard De Veaux Professor of Statistics; Dept. of Mathematics and Statistics; Williams College; Williamstown MA 01267 "In writing Practical Text Mining and Statistical Analysis for Nonstructured Text Data Applications, the six authors (Miner, Delen, Elder, Fast, Hill, and Nisbet) accepted the daunting task of creating a cohesive operational framework from the disparate aspects and activities of text mining, an emerging field that they appropriately describe as the "Wild West" of data mining. Tapping into their unique expertise and applying a wide cross-application lens, they have succeeded in their mission. Rather than listing the facets of text mining simply as independent academic topics of discussion, the book leans much more to the practical, presenting a conceptual road map to assist users in correlating articulated text mining techniques to categories of actual commonly observed business needs. To finish out the job, summaries for some of the most prevalent commercial text mining solutions are included, along with examples. In this way, the authors have uniquely presented a text mining resource with value to readers across that breadth of business applications."  ― Gerard Britton, J.D. V.P., GRC Analytics, Opera Solutions LLC "Text Mining is one of those phrases people throw around as though it describes something singular. As the authors of Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications show us, nothing could be further from the truth. There is a rich, diverse ecosystem of text mining approaches and technologies available. Readers of this book will discover a myriad of ways to use these text mining approaches to understand and improve their business. Because the authors are a practical bunch the book is full of examples and tutorials that use every approach, multiple commercial and open source tools, and that show the power and trade-offs each involves. The case studies are worked through in detail by the authors so you can see exactly how things would be done and learn how to apply it to your own problems. If you are interested in text mining, and you should be, this book will give you a perspective that is broad, deep and approachable." ― James Taylor CEO Decision Management Solutions

About the Author

Dr. Gary Miner received a B.S. from Hamline University, St. Paul, MN, with biology, chemistry, and education majors; an M.S. in zoology and population genetics from the University of Wyoming; and a Ph.D. in biochemical genetics from the University of Kansas as the recipient of a NASA pre-doctoral fellowship. He pursued additional National Institutes of Health postdoctoral studies at the U of Minnesota and U of Iowa eventually becoming immersed in the study of affective disorders and Alzheimer's disease.

In 1985, he and his wife, Dr. Linda Winters-Miner, founded the Familial Alzheimer's Disease Research Foundation, which became a leading force in organizing both local and international scientific meetings, bringing together all the leaders in the field of genetics of Alzheimer's from several countries, resulting in the first major book on the genetics of Alzheimer’s disease. In the mid-1990s, Dr. Miner turned his data analysis interests to the business world, joining the team at StatSoft and deciding to specialize in data mining. He started developing what eventually became the Handbook of Statistical Analysis and Data Mining Applications (co-authored with Drs. Robert A. Nisbet and John Elder), which received the 2009 American Publishers Award for Professional and Scholarly Excellence (PROSE). Their follow-up collaboration, Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications, also received a PROSE award in February of 2013. Overall, Dr. Miner’s career has focused on medicine and health issues, so serving as the ‘project director’ for this current book on ‘Predictive Analytics of Medicine - Healthcare Issues’ fit his knowledge and skills perfectly.

Gary also serves as VP & Scientific Director of Healthcare Predictive Analytics Corp; as Merit Reviewer for PCORI (Patient Centered Outcomes Research Institute) that awards grants for predictive analytics research into the comparative effectiveness and heterogeneous treatment effects of medical interventions including drugs among different genetic groups of patients; additionally he teaches on-line classes in ‘Introduction to Predictive Analytics’, ‘Text Analytics’, and ‘Risk Analytics’ for the University of California-Irvine, and other classes in medical predictive analytics for the University of California-San Diego; he spends most of his time in his primary role as Senior Analyst-Healthcare Applications Specialist for Dell | Information Management Group, Dell Software (through Dell’s acquisition of StatSoft in April 2014).

Dr. John Elder heads the United States’ leading data mining consulting team, with offices in Charlottesville, Virginia; Washington, D.C.; and Baltimore, Maryland (www.datamininglab.com). Founded in 1995, Elder Research, Inc. focuses on investment, commercial, and security applications of advanced analytics, including text mining, image recognition, process optimization, cross-selling, biometrics, drug efficacy, credit scoring, market sector timing, and fraud detection. John obtained a B.S. and an M.E.E. in electrical engineering from Rice University and a Ph.D. in systems engineering from the University of Virginia, where he’s an adjunct professor teaching Optimization or Data Mining. Prior to 16 years at ERI, he spent five years in aerospace defense consulting, four years heading research at an investment management firm, and two years in Rice's Computational & Applied Mathematics Department.

Dr. Andrew Fast leads research in text mining and social network analysis at Elder Research. Dr. Fast graduated magna cum laude from Bethel University and earned an M.S. and a Ph.D. in computer science from the University of Massachusetts Amherst. There, his research focused on causal data mining and mining complex relational data such as social networks. At ERI, Andrew leads the development of new tools and algorithms for data and text mining for applications of capabilities assessment, fraud detection, and national security. Dr. Fast has published on an array of applications, including detecting securities fraud using the social network among brokers and understanding the structure of criminal and violent groups. Other publications cover modeling peer-to-peer music file sharing networks, understanding how collective classification works, and predicting playoff success of NFL head coaches (work featured on ESPN.com).

Thomas Hill received his Vordiplom in psychology from Kiel University in Germany and earned an M.S. in industrial psychology and a Ph.D. in psychology and quantitative methods from the University of Kansas. He was associate professor (and then research professor) at the University of Tulsa from 1984 to 2009, where he taught data analysis and data mining courses. He also has been vice president for Research and Development and then Analytic Solutions at StatSoft Inc., where he has been involved for over 20 years in the development of data analysis, data and text mining algorithms, and the delivery of analytic solutions. Dr. Hill joined Dell through Dell’s acquisition of StatSoft in April 2014, and he is currently the Executive Director for Analytics at Dell’s Information Management Group.

Dr. Hill has received numerous academic grants and awards from the National Science Foundation, the National Institute of Health, the Center for Innovation Management, the Electric Power Research Institute, and other institutions. He has completed diverse consulting projects with companies from practically all industries and has worked with the leading financial services, insurance, manufacturing, pharmaceutical, retailing, and other companies in the United States and internationally on identifying and refining effective data mining and predictive modeling solutions for diverse applications. Dr. Hill has published widely on innovative applications for data mining and predictive analytics. He is the author (with Paul Lewicki, 2005) of Statistics: Methods and Applications, the Electronic Statistics Textbook (a popular on-line resource on statistics and data mining), a co-author of Practical Text Mining and Statistical Analysis for Non-Structured Text Data Applications (2012); he is also a contributing author to the popular Handbook of Statistical Analysis and Data Mining Applications (2009).

Dr. Robert Nisbet was trained initially in Ecology and Ecosystems Analysis. He has over 30 years’ experience in complex systems analysis and modeling, most recently as a Researcher (University of California, Santa Barbara). In business, he pioneered the design and development of configurable data mining applications for retail sales forecasting, and Churn, Propensity-to-buy, and Customer Acquisition in Telecommunications Insurance, Banking, and Credit industries. In addition to data mining, he has expertise in data warehousing technology for Extract, Transform, and Load (ETL) operations, Business Intelligence reporting, and data quality analyses. He is lead author of the “Handbook of Statistical Analysis & Data Mining Applications” (Academic Press, 2009), and a co-author of "Practical Text Mining" (Academic Press, 2012). Currently, he serves as an Instructor in the University of California, Irvine Predictive Analytics Certification Program, teaching online courses in Effective Data preparation, and co-teaching Introduction to Predictive Analytics.

Dr. Dursun Delen is the William S. Spears Chair in Business Administration and Associate Professor of Management Science and Information Systems in the Spears School of Business at Oklahoma State University (OSU). He received his Ph.D. in industrial engineering and management from OSU in 1997. Prior to his appointment as an assistant professor at OSU in 2001, he worked for a privately owned research and consultancy company, Knowledge Based Systems Inc., in College Station, Texas, as a research scientist for five years, during which he led a number of decision support and other information systems-related research projects funded by federal agencies, including DoD, NASA, NIST and DOE.
NO_CONTENT_IN_FEATURE
New York Times best sellers
Browse the New York Times best sellers in popular categories like Fiction, Nonfiction, Picture Books and more. See more

Product Details

  • Hardcover: 1000 pages
  • Publisher: Academic Press; 1 edition (January 25, 2012)
  • Language: English
  • ISBN-10: 012386979X
  • ISBN-13: 978-0123869791
  • Product Dimensions: 7.6 x 1.7 x 9.3 inches
  • Shipping Weight: 4 pounds (View shipping rates and policies)
  • Average Customer Review: 3.5 out of 5 stars  See all reviews (15 customer reviews)
  • Amazon Best Sellers Rank: #685,387 in Books (See Top 100 in Books)

Customer Reviews

Top Customer Reviews

By Ian K. VINE VOICE on August 10, 2012
Format: Hardcover
The early chapters of this book are misleading. They provide a decent overview of the issues in text minding, but nothing more. The rest of the book does not build on these chapters. Instead the later chapters are nothing more than user manuals or tutorials on software tools (mostly commercial products, not open source tools) to do text mining. Most of these chapters are taken up by screen shots of the tools and notations on what to do at various points (e.g., "click OK").

This is a huge disappointment. This book does not contain the information that you would need to know if you were to develop your own software tools. Nor does it have the utility of a book on combining existing software (i.e., class libraries in Java or R libraries) to create applications. This book is basically an MBA's guide to text mining. Even as an "idiots guide to text mining" this book is of questionable value, since the tools and software interfaces will change. New tools will soon be available and replace the ones in the book.

I've spent the last year using the R mathematics language. The section on R and text mining is completely inadequate.

At best this is an expensive book that has a short self-life. For people who are not software engineers but want to do text mining this book might be useful. However, for a software engineer, the book is of little use. I am very disappointed. I suppose that the fault is partially mine for not looking at the table of contents carefully enough.

In summary: if you are an MBA or marketing professional without a quantitative background (e.g.
Read more ›
4 Comments 56 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover
I picked up a friends copy while visiting him and I wasn't surprised - part of the book wants to sell you stuff (commercial, expensive, not really needed software). There is an obligatory mention of R but nothing concrete, truly useful.

Do your self a favor and pick any of the Orielly or Manning publishers books on the subject. It's a lot more practical, a lot more applied and something you can start doing right now. I am sure Amazon recommendation would be on this page :).

If you are new to text mining and have a little bit of programming experience pick up 'Natural Language Processing with Python' aka the NLTK book. It does teach you basics of python and the fundamentals of Text Mining in a very fluid, step by step manner. For creating robust pipelines look into 'Taming Text'. It's Java based so a background on Java would help. Thirdly, as you gain experience in this stuff, I would recommend looking at Mallet from Andrew McCallum (U of Mass., Amherst) - it's a java library that can be used from the command line and also used in your own code. It is seriously cool.

I know Text Mining is hot and there are commercial 'tools' out there that promise you the world. Nice looking charts and summaries are fine if you are an executive; however, if you want to make decisions on data that make an impact, I'd serious recommend picking up Python/Java/R and use libraries available for those to build complete data pipelines. Train models based on your text data and desired outcome, use those models to make predictions on fresh data, and make this automatic.

BTW - if you want to see real life Text Mining in action, look no further than Google and Facebook. Hint: they never use the commercial, point and click stuff this book advertises.
Comment 14 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover Verified Purchase
It has a lot practical examples on how text mining is used in different industry. I like those examples. But the format and the structure of the book makes it like a user operation manual for those mining tools used in the book. I have expected depth analysis for each example.
Comment 5 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Kindle Edition Verified Purchase
Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications

Where is the 'DataSets' folder in Kindle version? I don't have DVD for kindle version and feel that I am not able to get the full benefits of this book if the kindle 'Skip' this folder.
2 Comments 17 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Kindle Edition Verified Purchase
I don't understand who would be the target audience of the book. Business people who don't know anything about text mining but who can afford to buy commercial text mining software which would costs thousands? More than half of the book is devoted to graphical, step by step instruction how to use a particular program such as Statistica? I'm deeply disappointed.
1 Comment 4 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover Verified Purchase
This is the best book on text mining I have seen to date. The frameworks describing the applications and the way they explain the different concepts are simple and clear. The case studies using the different software applications really bring everything to life. It is my reference book in the matter. A great effort by the authors.
Comment One person found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Hardcover Verified Purchase
Text analytics is a significant part of my work as a data analyst and analytics writer. This is the only book on text mining that I recommend to people who want to build a good working knowledge of text analytics for business and government applications.

This is not the book for programmers and linguists who want to learn how to build their own text analytics tools. And it's not for hackers who operate on a shoestring budget and lots of unpaid (or underpaid) labor. This book is for people who have other things to do, too, for people who pay for labor, rent and so forth, for people who want to understand how text analytics fits in with the data analysis methods that they already use for business.

The book covers text analytics concepts and philosophy, and includes many worked examples. The book gives the reader a good sense of what kinds of applications make sense for text analytics, what results can be expected,and how to analyze text.

The authors are competent analysts who work with real, live, paying clients. I've heard several of them speak, and found them to be some of the clearest and most informative speakers in the field. If I could have them all on call for expert advice, I would! Since that's not an option, I keep this book handy.
Comment One person found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse

Most Recent Customer Reviews

Set up an Amazon Giveaway

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications
Amazon Giveaway allows you to run promotional giveaways in order to create buzz, reward your audience, and attract new followers and customers. Learn more about Amazon Giveaway
This item: Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications