Bioinformatics Data Skills: Reproducible and Robust Research with Open Source Tools 1st Edition
| Vince Buffalo (Author) Find all the books, read about the author, and more. See search results for this author |
Use the Amazon App to scan ISBNs and compare prices.
Rather than teach bioinformatics as a set of workflows that are likely to change with this rapidly evolving field, this book demsonstrates the practice of bioinformatics through data skills. Rigorous assessment of data quality and of the effectiveness of tools is the foundation of reproducible and robust bioinformatics analysis. Through open source and freely available tools, you'll learn not only how to do bioinformatics, but how to approach problems as a bioinformatician.
- Go from handling small problems with messy scripts to tackling large problems with clever methods and tools
- Focus on high-throughput (or "next generation") sequencing data
- Learn data analysis with modern methods, versus covering older theoretical concepts
- Understand how to choose and implement the best tool for the job
- Delve into methods that lead to easier, more reproducible, and robust bioinformatics analysis
Customers who viewed this item also viewed
From the brand
-
Sharing the knowledge of experts
O'Reilly's mission is to change the world by sharing the knowledge of innovators. For over 40 years, we've inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.
Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.
-
Editorial Reviews
About the Author
Don't have a Kindle? Compra tu Kindle aquí, or download a FREE Kindle Reading App.
Product details
- Publisher : O'Reilly Media; 1st edition (August 18, 2015)
- Language : English
- Paperback : 536 pages
- ISBN-10 : 1449367372
- ISBN-13 : 978-1449367374
- Item Weight : 1.9 pounds
- Dimensions : 7 x 1.09 x 9.19 inches
- Best Sellers Rank: #420,140 in Books (See Top 100 in Books)
- #76 in Bioinformatics (Books)
- #302 in Biochemistry (Books)
- #906 in Software Development (Books)
- Customer Reviews:
About the author

Discover more of the author’s books, see similar authors, read author blogs and more
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonTop reviews from the United States
There was a problem filtering reviews right now. Please try again later.
The author is very adept at leading the audience into a topic in a step-by-step fashion, probably due to his experience in interacting with people who want to get into bioinformatics but only have very limited knowledge of unix and programing. For instance, although he pointed out that SQL was beyond the scope of this book, he nonetheless introduced a series of sqlite operations that should suffice for most research projects except those aiming at providing complex database for large group.
Highly recommended for scientists with background in biology but want to get into data analysis.
The negative I would say is that I've had trouble motivating myself to go through the middle third of the book ("Introduction to R" and "Range Data") chapters. While proper programmers can and do float easily among languages, and I strongly appreciate the Unix chapters, it feels like people fall more into R or Python camps, and while I could be wrong, I don't feel like I should really need to add R at this time.
Pros: great range of topics covered: bash shell, git, ssh, working with data, data compression
Cons: the author is a little long winded and and it does not have great examples
Top reviews from other countries
However, the book doesn't have that much info on Bioinformatics. 70% of the book is about Linux command line, git source control and very basic R. So if you are a comp sci or a programmer then chances that you know all this stuff are pretty high.
The actual bioinformatics read starts at the end of the book.
I was hoping to see at least some info on de novo assembly, sequence alignment, NGS data processing, Galaxy and BioPython.
If you are a tech savvy then I recommend: Bioinformatics: A Practical Handbook Of Next Generation Sequencing And Its Applications
This book has a good listing and a brief explanation of all necessary tools to conduct a full analysis.
So if you are doing next generation sequence analysis and if you want to know good practice then this is a great book, but if you want to get a complete range of bioinformatics data skills then this is going to be one of the many books that you need. The first half is definitely worth reading for ALL bioinformaticians but the second half needs a firmer direction.









