Other Sellers on Amazon
& FREE Shipping
91% positive over last 12 months
Usually ships within 4 to 5 days.
+ $3.99 shipping
87% positive over last 12 months
Usually ships within 2 to 3 days.
+ $3.99 shipping
91% positive over last 12 months
Usually ships within 3 to 4 days.
Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required. Learn more
Read instantly on your browser with Kindle Cloud Reader.
Using your mobile phone camera - scan the code below and download the Kindle app.
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
| Stefanie Molin (Author) Find all the books, read about the author, and more. See search results for this author |
| Price | New from | Used from |
There is a newer edition of this item:
Enhance your purchase
Key Features
- Perform efficient data analysis and manipulation tasks using pandas
- Apply pandas to different real-world domains using step-by-step demonstrations
- Get accustomed to using pandas as an effective data exploration tool
Hands-On Data Analysis with Pandas will show you how to analyze your data, get started with machine learning, and work effectively with Python libraries often used for data science, such as pandas, NumPy, matplotlib, seaborn, and scikit-learn. Using real-world datasets, you will learn how to use the powerful pandas library to perform data wrangling to reshape, clean, and aggregate your data. Then, you will learn how to conduct exploratory data analysis by calculating summary statistics and visualizing the data to find patterns. In the concluding chapters, you will explore some applications of anomaly detection, regression, clustering, and classification, using scikit-learn, to make predictions based on past data.
By the end of this book, you will be equipped with the skills you need to use pandas to ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets.
What you will learn
- Understand how data analysts and scientists gather and analyze data
- Perform data analysis and data wrangling in Python
- Combine, group, and aggregate data from multiple sources
- Create data visualizations with pandas, matplotlib, and seaborn
- Apply machine learning (ML) algorithms to identify patterns and make predictions
- Use Python data science libraries to analyze real-world datasets
- Use pandas to solve common data representation and analysis problems
- Build Python scripts, modules, and packages for reusable analysis code
- Introduction to Data Analysis
- Working with Pandas DataFrames
- Data Wrangling with Pandas
- Aggregating Pandas DataFrames
- Visualizing Data with Pandas and Matplotlib
- Plotting with Seaborn and Customization Techniques
- Financial Analysis - Bitcoin and the Stock Market
- Rule-based Anomaly Detection
- Getting Started with Machine Learning in Python
- Making Better Predictions - Optimizing ML Models
- Machine Learning Anomaly Detection
- The Road Ahead
- ISBN-101789615321
- ISBN-13978-1789615326
- PublisherPackt Publishing
- Publication dateJuly 26, 2019
- LanguageEnglish
- Dimensions7.5 x 1.67 x 9.25 inches
- Print length740 pages
Frequently bought together

Products related to this item
Editorial Reviews
About the Author
Stefanie Molin is a data scientist and software engineer at Bloomberg LP in NYC, tackling tough problems in information security, particularly revolving around anomaly detection, building tools for gathering data, and knowledge sharing. She has extensive experience in data science, designing anomaly detection solutions, and utilizing machine learning in both R and Python in the AdTech and FinTech industries. She holds a B.S. in operations research from Columbia University's Fu Foundation School of Engineering and Applied Science, with minors in economics, and entrepreneurship and innovation. In her free time, she enjoys traveling the world, inventing new recipes, and learning new languages spoken among both people and computers.
Product details
- Publisher : Packt Publishing (July 26, 2019)
- Language : English
- Paperback : 740 pages
- ISBN-10 : 1789615321
- ISBN-13 : 978-1789615326
- Item Weight : 2.67 pounds
- Dimensions : 7.5 x 1.67 x 9.25 inches
- Best Sellers Rank: #1,085,610 in Books (See Top 100 in Books)
- #258 in Database Storage & Design
- #493 in Data Mining (Books)
- #813 in Data Processing
- Customer Reviews:
About the author

Stefanie Molin is a software engineer and data scientist at Bloomberg in New York City, where she tackles tough problems in information security, particularly those revolving around data wrangling/visualization, building tools for gathering data, and knowledge sharing. She holds a bachelor’s of science degree in operations research from Columbia University's Fu Foundation School of Engineering and Applied Science. She is currently pursuing a master’s degree in computer science, with a specialization in machine learning, from Georgia Tech. In her free time, she enjoys traveling the world, inventing new recipes, and learning new languages spoken among both people and computers.
Related products with free delivery on eligible orders
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on Amazon-
Top reviews
Top reviews from the United States
There was a problem filtering reviews right now. Please try again later.
As a Python programmer Learning Pandas I also appreciate a lot the quality of the code. Usually the code you find in data science examples get things done but it’s far from being of good quality. It’s very refreshing to see good examples written in a very pythonic way.
I keep this book next to my desk and I have the feeling I will be coming back to use it as a reference a lot in the coming weeks. The only way this could be improved would be to have the author sit next to you every time you need to use Pandas :)
I can't say enough good things about this book and about how effective it is. I started 10 days ago with a skill of python but no data science background. Now, nearly two weeks later, I'm making my scripts do all kinds of things that are complicated if you don't know Pandas e.g., aggregate statistics based on duration (day, week, ..). I genuinely found this book and the exercises a truly beneficial experience/resource. Honestly I usually don't trust this publisher. Most Packt books I own I think weren't worth their price. This book is surprisingly good quality. Finally they have a good author.


