Data Wrangling with Python: Tips and Tools to Make Your Life Easier 1st Edition
| Jacqueline Kazil (Author) Find all the books, read about the author, and more. See search results for this author |
Use the Amazon App to scan ISBNs and compare prices.
How do you take your data analysis skills beyond Excel to the next level? By learning just enough Python to get stuff done. This hands-on guide shows non-programmers like you how to process information thatâ??s initially too messy or difficult to access. You don't need to know a thing about the Python programming language to get started.
Through various step-by-step exercises, youâ??ll learn how to acquire, clean, analyze, and present data efficiently. Youâ??ll also discover how to automate your data process, schedule file- editing and clean-up tasks, process larger datasets, and create compelling stories with data you obtain.
- Quickly learn basic Python syntax, data types, and language concepts
- Work with both machine-readable and human-consumable data
- Scrape websites and APIs to find a bounty of useful information
- Clean and format data to eliminate duplicates and errors in your datasets
- Learn when to standardize data and when to test and script data cleanup
- Explore and analyze your datasets with new Python libraries and techniques
- Use Python solutions to automate your entire data-wrangling process
Frequently bought together

- +
- +
Customers who viewed this item also viewed
From the brand
-
Sharing the knowledge of experts
O'Reilly's mission is to change the world by sharing the knowledge of innovators. For over 40 years, we've inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.
Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.
-
Editorial Reviews
About the Author
Katharine Jarmul is a Python developer who enjoys data analysis and acquisition, web scraping, teaching Python and all things Unix. She has worked at small and large start ups before starting her consulting career overseas. Originally from Los Angeles, she learned Python while working at the Washington Post in 2008. As one of the founders of PyLadies (http://pyladies.org/), Katharine hopes to promote diversity in Python and other open source languages through education and training. She has led numerous workshops and tutorials ranging from beginner to advanced topics in Python. For more information on upcoming trainings, reach out to her on Twitter (http://twitter.com/kjam) or her her web site (http://kjamistan.com/).
Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.
Product details
- Publisher : O'Reilly Media; 1st edition (March 15, 2016)
- Language : English
- Paperback : 505 pages
- ISBN-10 : 1491948817
- ISBN-13 : 978-1491948811
- Item Weight : 1.93 pounds
- Dimensions : 7 x 1.02 x 9.19 inches
- Best Sellers Rank: #1,166,076 in Books (See Top 100 in Books)
- #706 in Data Mining (Books)
- #749 in Data Modeling & Design (Books)
- #991 in Database Storage & Design
- Customer Reviews:
About the author

Discover more of the author’s books, see similar authors, read author blogs and more
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonTop reviews from the United States
There was a problem filtering reviews right now. Please try again later.
otherwise if you don't mind using python 2.x, this seems like a good book
First and foremost, it is well written: the grammar and punctuation are correct, the example code and explanations are clear and insightful. Most computer books are agonizing to read, but this was a joy. Although it does not provide comprehensive examples of all the modules and data structures, which one might use in data wrangling, it does provide an example for each type of data you might want to import: CSV, JSON, XML, Excel and PDF. Plus there are chapters on using web scraping, APIs, and both Relational and NoSQL databases. Moreover, each example is explained with a step-by-step narrative, that shows the beginner not just how to do it, but how to think about working through the process. This is why I recommended the book to colleague, who is an atmospheric scientist learning to program with python.








