| Digital List Price: | $24.99 |
| Kindle Price: | $18.04 Save $6.95 (28%) |
| Sold by: | Amazon.com Services LLC |
Your Memberships & Subscriptions
Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.
Read instantly on your browser with Kindle for Web.
Using your mobile phone camera - scan the code below and download the Kindle app.
Follow the author
OK
Data Science Essentials in Python: Collect - Organize - Explore - Predict - Value (The Pragmatic Programmers) 1st Edition, Kindle Edition
Go from messy, unstructured artifacts stored in SQL and NoSQL databases to a neat, well-organized dataset with this quick reference for the busy data scientist. Understand text mining, machine learning, and network analysis; process numeric data with the NumPy and Pandas modules; describe and analyze data using statistical and network-theoretical methods; and see actual examples of data analysis at work. This one-stop solution covers the essential data science you need in Python.
Data science is one of the fastest-growing disciplines in terms of academic research, student enrollment, and employment. Python, with its flexibility and scalability, is quickly overtaking the R language for data-scientific projects. Keep Python data-science concepts at your fingertips with this modular, quick reference to the tools used to acquire, clean, analyze, and store data.
This one-stop solution covers essential Python, databases, network analysis, natural language processing, elements of machine learning, and visualization. Access structured and unstructured text and numeric data from local files, databases, and the Internet. Arrange, rearrange, and clean the data. Work with relational and non-relational databases, data visualization, and simple predictive analysis (regressions, clustering, and decision trees). See how typical data analysis problems are handled. And try your hand at your own solutions to a variety of medium-scale projects that are fun to work on and look good on your resume.
Keep this handy quick guide at your side whether you're a student, an entry-level data science professional converting from R to Python, or a seasoned Python developer who doesn't want to memorize every function and option.
What You Need:
You need a decent distribution of Python 3.3 or above that includes at least NLTK, Pandas, NumPy, Matplotlib, Networkx, SciKit-Learn, and BeautifulSoup. A great distribution that meets the requirements is Anaconda, available for free from www.continuum.io. If you plan to set up your own database servers, you also need MySQL (www.mysql.com) and MongoDB (www.mongodb.com). Both packages are free and run on Windows, Linux, and Mac OS.
- ISBN-13978-1680501841
- Edition1st
- PublisherPragmatic Bookshelf
- Publication dateAugust 10, 2016
- LanguageEnglish
- File size3263 KB
Customers who read this book also read
From the brand
-
The Pragmatic Programmers publishes hands-on, practical books on classic and cutting-edge software development and engineering management topics. We help professionals solve real-world problems, hone their skills, and advance their careers.
-
From the Publisher
About this Book
This book covers data acquisition, cleaning, storing, retrieval, transformation, visualization, elements of advanced data analysis (network analysis), statistics, and machine learning. It is not an introduction to data science or a general data science reference, although you’ll find a quick overview of how to do data science in Chapter 1, What Is Data Science?. I assume that you have learned the methods of data science, including statistics, elsewhere. The subject index at the end of the book refers to the Python implementations of the key concepts, but in most cases you will already be familiar with the concepts.
You’ll find a summary of Python data structures; string, file, and web functions; regular expressions; and even list comprehension in Chapter 2, Core Python for Data Science. This summary is provided to refresh your knowledge of these topics, not to teach them. There are a lot of excellent Python texts, and having a mastery of the language is absolutely important for a successful data scientist.
The first part of the book looks at working with different types of text data, including processing structured and unstructured text, processing numeric data with the NumPy and Pandas modules, and network analysis. Three more chapters address different analysis aspects: working with relational and non-relational databases, data visualization, and simple predictive analysis.
This book is partly a story and partly a reference. Depending on how you see it, you can either read it sequentially or jump right to the index, find the function or concept of concern, and look up relevant explanations and examples. In the former case, if you are an experienced Python programmer, you can safely skip Chapter 2, Core Python for Data Science. If you do not plan to work with external databases (such as MySQL), you can ignore Chapter 4, Working with Databases, as well. Lastly, Chapter 9, Probability and Statistics, assumes that you have no idea about statistics. If you do, you have an excuse to bypass the first two units and find yourself at Unit 47, Doing Stats the Python Way.
About the Audience
At this point, you may be asking yourself if you want to have this book on your bookshelf.
The book is intended for graduate and undergraduate students, data science instructors, entry-level data science professionals—especially those converting from R to Python—and developers who want a reference to help them remember all of the Python functions and options.
Is that you? If so, abandon all hesitation and enter.
Editorial Reviews
Review
- "This book does a fantastic job at summarizing the various activities when wrangling data with Python. Each exercise serves an interesting challenge that is fun to pursue. The book should no doubt be on the reading list of every aspiring data scientist." - Peter Hampton, Ulster University
- "Data Science Essentials in Python gets you to speed with the most common tasks and tools in the data science field. It's a quick introduction to many different techniques for fetching, cleaning, analyzing, and storing your data. This book helps you stay productive so you can spend less time on technology research and more on your intended research." - Jason Montojo, Coauthor of Practical Programming: An Introduction to Computer Science Using Python 3
- "For those who are highly curious and passionate about problem solving and making data discoveries, Data Science Essentials in Python provides deep insights and the right set of tools and techniques to start with. Well-drafted examples and exercises make it practical and highly readable." - Lokesh Kumar Makani, CASB expert, Skyhigh Networks
About the Author
Dmitry Zinoviev has an MS in Physics from Moscow State University and a PhD in Computer Science from Stony Brook University. His research interests include computer simulation and modeling, network science, social network analysis, and digital humanities. He has been teaching at Suffolk University in Boston, MA since 2001.
Product details
- ASIN : B01LZXVRJW
- Publisher : Pragmatic Bookshelf; 1st edition (August 10, 2016)
- Publication date : August 10, 2016
- Language : English
- File size : 3263 KB
- Simultaneous device usage : Unlimited
- Text-to-Speech : Enabled
- Screen Reader : Supported
- Enhanced typesetting : Enabled
- X-Ray : Not Enabled
- Word Wise : Not Enabled
- Sticky notes : On Kindle Scribe
- Print length : 225 pages
- Best Sellers Rank: #2,410,271 in Kindle Store (See Top 100 in Kindle Store)
- #827 in Data Modeling & Design (Kindle Store)
- #1,602 in Python Computer Programming
- #1,883 in Data Modeling & Design (Books)
- Customer Reviews:
About the author

Dmitry is a professor of Computer Science at Suffolk University. He loves C and Python programming, complex networks, computational social science, and digital humanities.
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonReviews with images

-
Top reviews
Top reviews from the United States
There was a problem filtering reviews right now. Please try again later.
pandas, regular expression, etc. Examples code are easy to understand and follow up. You can skim through the book for less than a week and still learn a lot on the fundamentals of data science! It will be a great handbook for you. I recommended it!
Top reviews from other countries
First two exercises of the first chapter took me all day today thanks mainly due to fighting through syntax errors but that's my fault for coming to it unfamiliar with Python. Very satisfying to get things working though and after looking ahead the future chapters have got you doing some pretty powerful stuff.
Detto questo è un libro organizzato in moduli piccoli e procede in modo progressivo.
Ottimo per la Data Science, è anche una miniera di consigli su funzioni Python magari poco note.






