Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More Second Edition
| Matthew A. Russell (Author) Find all the books, read about the author, and more. See search results for this author |
Use the Amazon App to scan ISBNs and compare prices.
There is a newer edition of this item:
How can you tap into the wealth of social web data to discover who’s making connections with whom, what they’re talking about, and where they’re located? With this expanded and thoroughly revised edition, you’ll learn how to acquire, analyze, and summarize data from all corners of the social web, including Facebook, Twitter, LinkedIn, Google+, GitHub, email, websites, and blogs.
- Employ the Natural Language Toolkit, NetworkX, and other scientific computing tools to mine popular social web sites
- Apply advanced text-mining techniques, such as clustering and TF-IDF, to extract meaning from human language data
- Bootstrap interest graphs from GitHub by discovering affinities among people, programming languages, and coding projects
- Build interactive visualizations with D3.js, an extraordinarily flexible HTML5 and JavaScript toolkit
- Take advantage of more than two-dozen Twitter recipes, presented in O’Reilly’s popular "problem/solution/discussion" cookbook format
The example code for this unique data science book is maintained in a public GitHub repository. It’s designed to be easily accessible through a turnkey virtual machine that facilitates interactive learning with an easy-to-use collection of IPython Notebooks.
Frequently bought together

- +
Customers who viewed this item also viewed
Editorial Reviews
Review
When we first published "Mining the Social Web," I thought it was one of the most important books I worked on that year. Now that we're publishing a second edition (which I didn't work on), I find that I agree with myself. With this new edition, "Mining the Social Web" is more important than ever.
While we're seeing more and more cynicism about the value of data, and particularly "big data," that cynicism isn't shared by most people who actually work with data. Data has undoubtedly been overhyped and oversold, but the best way to arm yourself against the hype machine is to start working with data yourself, to find out what you can and can't learn. And there's no shortage of data around. Everything we do leaves a cloud of data behind it: Twitter, Facebook, Google+ -- to say nothing of the thousands of other social sites out there, such as Pinterest, Yelp, Foursquare, you name it. Google is doing a great job of mining your data for value. Why shouldn't you?
There are few better ways to learn about mining social data than by starting with Twitter; Twitter is really a ready-made laboratory for the new data scientist. And this book is without a doubt the best and most thorough approach to mining Twitter data out there. But that's only a starting point. We hear a lot in the press about sentiment analysis and mining unstructured text data; this book shows you how to do it. If you need to mine the data in web pages or email archives, this book shows you how. And if you want to understand how to people collaborate on projects, "Mining the Social Web" is the only place I've seen that analyzes GitHub data.
All of the examples in the book are available on Github. In addition to the example code, which is bundled into IPython notebooks, Matthew has provided a VirtualBox VM that installs Python, all the libraries you need to run the examples, the examples themselves, and an IPython server. Checking out the examples isr
About the Author
Matthew Russell, Chief Technology Officer at Digital Reasoning, Principal at Zaffra, and author of several books on technology including Mining the Social Web (O'Reilly, 2013), now in its second edition. He is passionate about open source software development, data mining, and creating technology to amplify human intelligence. Matthew studied computer science and jumped out of airplanes at the United States Air Force Academy. When not solving hard problems, he enjoys practicing Bikram Hot Yoga, CrossFitting and participating in triathlons.
I'd like to read this book on Kindle
Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.
Product details
- Publisher : O'Reilly Media; Second edition (November 5, 2013)
- Language : English
- Paperback : 448 pages
- ISBN-10 : 1449367615
- ISBN-13 : 978-1449367619
- Item Weight : 1.61 pounds
- Dimensions : 7 x 0.94 x 9.19 inches
- Best Sellers Rank: #1,384,554 in Books (See Top 100 in Books)
- #834 in Data Mining (Books)
- #909 in Social Media Guides
- #1,164 in Database Storage & Design
- Customer Reviews:
About the author

Matthew Russell (@ptwobrussell) is Chief Technology Officer at Digital Reasoning, Principal at Zaffra, and author of several books on technology including Mining the Social Web (O'Reilly, 2013), now in its second edition. He is passionate about open source software development, data mining, and creating technology to amplify human intelligence.
Matthew studied computer science and jumped out of airplanes at the United States Air Force Academy. When not solving hard problems, he enjoys practicing Bikram Hot Yoga, CrossFitting and participating in triathlons.
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonTop reviews from the United States
There was a problem filtering reviews right now. Please try again later.
Only three additional comments:
1) I'm a NON-TECHIE (healthcare) and this book provides such clear guidance that I feel capable of putting it to use, with NO prior experience with, or knowledge of programming. The value of that is not that I'll start programming, however, I can think for myself when it comes to potential uses, limitations, possibilities, challenges for how I want to put this to use.
2) You should get the E-BOOK VERSION, not the paper copy as I did, as it'll automatically be up to date, which is crucial as this is an interactive book.
3) This book should be required reading for everyone. Why? We need informed customers when it comes to digital products, especially as related to the social web, to understand what it is that you are sharing - blindly for most of us. You're essentially allowing people into your house to go through your drawers and documents. There are situations were you might want that. However, be aware of the kind of access your authorizing and to whom.
Matthew A. Russell - wow, well done!
The intro alone is more than worth the cost of admission - the easy installation of a VM saved me hours of dev ops headache I would have had to endure in my Python explorations. And it just gets better from there - everything is set up for the reader's convenience - just hit Ctrl-Enter and you're revealing the secrets of LinkedIn or Twitter or whatever.
The author clearly poured a lot of effort into his project, and it shows: this book sets a new standard for technical books (at least any technical books I've seen). If you have the slightest interest in the topic, check it out.
I also had the pleasure of interviewing Matthew Russell about this book on The Data Skeptic Podcast where we had the chance to have an interesting discussion about the book.
An addendum - SUPER fast support.
Mining the Social Web is exceptionally well written covering all major social media platforms. Mr. Russell is also very approachable and answers questions very quickly.
I really can't say enough good things about this book and how it sets the bar high for future technical books!
This book really gets you exactly what it says it does. You'll learn to work with the APIs for various social network websites. There's no extra cruft or fluff. You know exactly what you're getting and the book walks you through everything very cleanly at a very manageable pace.
Top reviews from other countries
This would have received one star for the ageing had it not been for the writing style. Fun to read.
You will be in no time connecting to Twitter / FB and understanding a lot of concepts about the web.
The best value will come if you have had some time to familiarize yourself with Python and IPython/Notebooks.
Highly highly recommend.
Si vous voulez comprendre le pourquoi du comment, allez plutôt voir "Doing Data Science" chez O'Reilly. Si vous voulez mettre les mains dans le cambouis, cet ouvrage est fait pour vous.



