Other Sellers on Amazon
+ $3.99 shipping
91% positive over last 12 months
& FREE Shipping
97% positive over last 12 months
+ $3.99 shipping
91% positive over last 12 months
Usually ships within 4 to 5 days.
Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required. Learn more
Read instantly on your browser with Kindle for Web.
Using your mobile phone camera - scan the code below and download the Kindle app.
Follow the Authors
OK
Data Science (The MIT Press Essential Knowledge series) Illustrated Edition
| Price | New from | Used from |
|
Audible Audiobook, Unabridged
"Please retry" |
$0.00
| Free with your Audible trial | |
|
Audio CD, Audiobook, CD, Unabridged
"Please retry" | $29.99 | — |
Enhance your purchase
The goal of data science is to improve decision making through the analysis of data. Today data science determines the ads we see online, the books and movies that are recommended to us online, which emails are filtered into our spam folders, and even how much we pay for health insurance. This volume in the MIT Press Essential Knowledge series offers a concise introduction to the emerging field of data science, explaining its evolution, current uses, data infrastructure issues, and ethical challenges.
It has never been easier for organizations to gather, store, and process data. Use of data science is driven by the rise of big data and social media, the development of high-performance computing, and the emergence of such powerful methods for data analysis and modeling as deep learning. Data science encompasses a set of principles, problem definitions, algorithms, and processes for extracting non-obvious and useful patterns from large datasets. It is closely related to the fields of data mining and machine learning, but broader in scope. This book offers a brief history of the field, introduces fundamental data concepts, and describes the stages in a data science project. It considers data infrastructure and the challenges posed by integrating data from multiple sources, introduces the basics of machine learning, and discusses how to link machine learning expertise with real-world problems. The book also reviews ethical and legal issues, developments in data regulation, and computational approaches to preserving privacy. Finally, it considers the future impact of data science and offers principles for success in data science projects.
- ISBN-100262535432
- ISBN-13978-0262535434
- EditionIllustrated
- PublisherThe MIT Press
- Publication dateApril 13, 2018
- LanguageEnglish
- Dimensions5 x 0.57 x 6.9 inches
- Print length280 pages
![]() |
Frequently bought together

- +
- +
Customers who viewed this item also viewed
Editorial Reviews
About the Author
Product details
- Publisher : The MIT Press; Illustrated edition (April 13, 2018)
- Language : English
- Paperback : 280 pages
- ISBN-10 : 0262535432
- ISBN-13 : 978-0262535434
- Item Weight : 8.8 ounces
- Dimensions : 5 x 0.57 x 6.9 inches
- Best Sellers Rank: #99,443 in Books (See Top 100 in Books)
- #58 in Data Processing
- #78 in Computer Hacking
- #1,214 in Engineering (Books)
- Customer Reviews:
About the authors

John Kelleher is the Academic Leader of the Information, Communication and Entertainment research institute at the Technological University Dublin. His areas of expertise include artificial intelligence, data analytics and machine learning, natural language processing, spatial cognition, and text analytics. John has worked in a number of different academic and research focused institutes, including Dublin City University, Media Lab Europe, and DFKI (the German Centre for Artificial Intelligence Research).

Brendan Tierney, Oracle ACE Director, is an independent consultant and lectures on Data Mining and Advanced Databases in the Dublin Institute of Technology in Ireland. He has 23+ years of extensive experience working in the areas of Data Mining, Data Warehousing, Data Architecture and Database Design. Brendan has worked on projects in Ireland, UK, across Europe, Canada and USA. Brendan is the editor of the UKOUG Oracle Scene magazine and deputy chair of the OUG Ireland BI SIG. Brendan is a regular speaker at conferences across Europe and the USA and has written technical articles for OTN, Oracle Scene, IOUG SELECT Journal and ODTUG Technical Journal.
twitter : @brendantierney
blog : www.oralytics.com
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on Amazon-
Top reviews
Top reviews from the United States
There was a problem filtering reviews right now. Please try again later.
-History
-Applications (Prediction, clustering, anomaly detection)
-Tools of Data Science (Bayes Rule, Logistic Regression, Neural Networks, Decision Trees)
-Ethical concerns (Where do we cross the line between privacy, security and applications of the Data Science?)
-Growth of Data Science (I wish the authors would've shared how to get into the career field more. Since applying association rule here, anyone that reads the book is likely to be interested in Data Science).
Top reviews from other countries
It covers the recent literature on such computational methods from, the current applications and the challenges behind Data Science. The book also talks about the various types of data along with the use cases like nominal/ordinal (categorical) and numeric data. Eventually, getting to what I think is the best chapter in the book is 'Machine Learning 101', which easily explains the types of what's the difference between supervised learning (classification/regression problems) and unsupervised learning (clustering, segmentation etc.). Only Maths (Algebra/statistics) up to high school/college level is needed to understand the principles of how most of the algorithms are set-up.
The only thing I think this book was disappointing at was the explanation of Deep Learning, which I feel was slightly brushed over compared to Machine Learning, when in some way, Deep Learning may have deserved its own chapter.
Finally, the book ended on the legislation side of Data Ethics, such as GDPR and the trade-off between accurate analysis and privacy among users of the internet/digital applications, again illustrating the future path for Data Science.
I would recommend this book as a handy Data Science reference.









