- Paperback: 126 pages
- Publisher: O'Reilly Media; 1 edition (November 5, 2011)
- Language: English
- ISBN-10: 1449309925
- ISBN-13: 978-1449309923
- Product Dimensions: 7 x 0.3 x 9.2 inches
- Shipping Weight: 7.8 ounces (View shipping rates and policies)
- Average Customer Review: 3.6 out of 5 stars See all reviews (11 customer reviews)
- Amazon Best Sellers Rank: #781,998 in Books (See Top 100 in Books)
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter your mobile phone number.
Parallel R 1st Edition
Use the Amazon App to scan ISBNs and compare prices.
Featured Arduino & Raspberry Pi Resources
Check out these resources for all you need to know on Arduino & Raspberry Pi.
Frequently Bought Together
Data Analysis in the Distributed World
About the Author
Q Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. His work has appeared online on The O’Reilly Network and Java.net, and also in print publications such as C/C++ Users Journal, Doctor Dobb’s Journal, and Linux Magazine. In his professional roles, he helps companies to make smart decisions about data and technology.
Stephen Weston has been working in high performance and parallelcomputing for over 25 years. He was employed at Scientific Computing Associates in the 90's, working on the Linda programming system, invented by David Gelernter. He was also a founder of Revolution Computing, leading the development of parallel computing packages for R, including nws, foreach, doSNOW, and doMC. He works at Yale University as an HPC Specialist.
Top Customer Reviews
The book describes 6 approaches to distributed computing:
The chapter starts by showing you how to create a socket cluster on a single machine (later sections discuss MPI clusters, and socket clusters of several machines). Then a section describes how to initialize workers, with a later section giving a slightly advanced discussion on how functions are serialized to workers.
There's a great demonstration (including graphs) of why/when you should use clusterApplyLB instead of clusterApply. There's also a fantastic discussion on potential I/O issues (probably one of the most surprising/confusing issues to people new to distributed computing) and how parApply handles them. Then the authors provide a very useful parApplyLB function.
There are a few (but very important!) paragraphs on random number generation using the rsprng and rlecuyer packages.
The chapter starts by noting that the multicore package only works on a single computer running a POSIX compliant operating system (i.e. most anything except Windows).
The next section describes the mclapply function, and also explains how mclapply creates a cluster each time it's called, why this isn't a speed issue, and how it is actually beneficial. The next few sections describe some of the optional mclapply arguments, and how you can achieve load balancing with mclapply.Read more ›
I would pay many tenfolds the price for more information in this book. The author is definitely an expert: I hope he writes the right book soon as there is a market for it.
R is a great tool and many of us are very interested in parallel --but this book for some will be just an appetizer.
Most Recent Customer Reviews
Very slim book. At the time the first one out there. By now, however, there must be better ones. Does anyone know?Published on September 20, 2014 by Dirk Dittmer
Single source of truth for an area not well understood by most R programmer. Useful if you are looking for speed fromR.Published on June 14, 2013 by S. Wang