- Paperback: 548 pages
- Publisher: O'Reilly Media; 1 edition (December 10, 2016)
- Language: English
- ISBN-10: 9781491912058
- ISBN-13: 978-1491912058
- ASIN: 1491912057
- Product Dimensions: 7 x 1.2 x 10 inches
- Shipping Weight: 1.8 pounds (View shipping rates and policies)
- Average Customer Review: 42 customer reviews
- Amazon Best Sellers Rank: #19,161 in Books (See Top 100 in Books)
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter your mobile phone number.
Python Data Science Handbook: Essential Tools for Working with Data 1st Edition
Use the Amazon App to scan ISBNs and compare prices.
Frequently bought together
Customers who bought this item also bought
From the Publisher
|Data Science for Business||Data Science from Scratch||Doing Data Science||R for Data Science||Data Science at the Command Line||Python Data Science Handbook|
|What You Need to Know about Data Mining and Data-Analytic Thinking||First Principles with Python||Straight Talk from the Frontline||Visualize, Model, Transform, Tidy, and Import Data||Facing the Future with Time-Tested Tools||Tools and Techniques for Developers|
About the Author
Jake VanderPlas is a long-time user and developer of the Python scientific stack. He currently works as an interdisciplinary research director at the University of Washington, conducts his own astronomy research, and spends time advising and consulting with local scientists from a wide range of fields.
Try the Kindle edition and experience these great reading features:
42 customer reviews
Review this product
Read reviews that mention
Showing 1-8 of 42 reviews
There was a problem filtering reviews right now. Please try again later.
Apart from that major oversight, the book is ok. If you want to learn data science, this is not for you; it doesn't get into the fundamentals much at all. If you are an experienced R user looking for how to translate into python, this will get you started. The rest of my review comes from this perspective.
The book spends far too much time on low-level ipython, numpy, and matplotlib functionality (chapters 1, 2, and 4). You are rarely going to use this stuff.
The pandas section (chapter 3) is fine, but I was a little disappointed in the treatment of the grouping/aggregation functions. The book mentions the split-apply-combine paradigm of Hadley Wickham, but doesn't cover the topic in nearly as much detail as the paper of the same name. I was hoping to learn how to translate the dplyr verbs (group_by, filter, select, mutate, summarize, arrange) into pandas, but this book doesn't provide that. You will learn the basics of grouping and aggregation, but your code is going to be a lot more verbose than it was in R.
The machine learning case studies in chapter 5 are pretty nice - probably the only reason I would recommend this book. The chapter provides a good overview of the scikit-learn API and effective patterns for machine learning problems.
There is no one book for data science, and this one is no exception. Just keep that in mind before buying it.
Other than that, I am really happy with my purchase.
P.S. For those complaining about black and white graphs and diagrams - check the author's GitHub.
I have used it extensively for the intro to ML at Berkeley and for now the book belongs to my short list of desk reference books.