Amazon.com: Fault Tolerance in Distributed Systems (0076092031703): Pankaj Jalote: Books


or
Sign in to turn on 1-Click ordering.
or
Amazon Prime Free Trial required. Sign up when you check out. Learn More
More Buying Choices
Have one to sell? Sell yours here
Fault Tolerance in Distributed Systems
 
See larger image
 
Tell the Publisher!
I'd like to read this book on Kindle

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

Fault Tolerance in Distributed Systems [Paperback]

Pankaj Jalote (Author)
4.0 out of 5 stars  See all reviews (2 customer reviews)

List Price: $74.67
Price: $59.13 & this item ships for FREE with Super Saver Shipping. Details
You Save: $15.54 (21%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.
Only 1 left in stock--order soon (more on the way).
Want it delivered Tuesday, February 28? Choose One-Day Shipping at checkout. Details
Textbook Student FREE Two-Day Shipping for students on millions of items. Learn more


Book Description

April 16, 1994 0133013677 978-0133013672 1

Fault tolerance is an approach by which reliability of a computer system can be increased beyond what can be achieved by traditional methods. While hardware supported fault tolerance has been well-documented, the newer, software supported fault tolerance techniques have remained scattered throughout the literature. Comprehensive and self-contained, this book organizes that body of knowledge with a focus on fault tolerance in distributed systems. (The uniprocess case is treated as a special case of distributed systems.) KEY TOPICS: Treats fault tolerant distributed systems as consisting of levels of abstraction, providing different tolerant services. For researchers/practitioners working in the area of fault tolerance.


Frequently Bought Together

Customers buy this book with Fault-Tolerant Systems $62.08

Fault Tolerance in Distributed Systems + Fault-Tolerant Systems
Price For Both: $121.21

Show availability and shipping details

  • This item: Fault Tolerance in Distributed Systems

    In Stock.
    Ships from and sold by Amazon.com.
    This item ships for FREE with Super Saver Shipping. Details

  • Fault-Tolerant Systems

    In Stock.
    Ships from and sold by Amazon.com.
    This item ships for FREE with Super Saver Shipping. Details


Customers Who Bought This Item Also Bought


Editorial Reviews

From the Publisher

Fault tolerance is an approach by which reliability of a computer system can be increased beyond what can be achieved by traditional methods. While hardware supported fault tolerance has been well-documented, the newer, software supported fault tolerance techniques have remained scattered throughout the literature. Comprehensive and self-contained, this book organizes that body of knowledge with a focus on fault tolerance in distributed systems. (The uniprocess case is treated as a special case of distributed systems.)

From the Back Cover

Fault tolerance is an approach by which reliability of a computer system can be increased beyond what can be achieved by traditional methods. While hardware supported fault tolerance has been well-documented, the newer, software supported fault tolerance techniques have remained scattered throughout the literature. Comprehensive and self-contained, this book organizes that body of knowledge with a focus on fault tolerance in distributed systems. (The uniprocess case is treated as a special case of distributed systems.) Treats fault tolerant distributed systems as consisting of levels of abstraction, providing different tolerant services. For researchers/practitioners working in the area of fault tolerance.

Product Details

  • Paperback: 448 pages
  • Publisher: Prentice Hall; 1 edition (April 16, 1994)
  • Language: English
  • ISBN-10: 0133013677
  • ISBN-13: 978-0133013672
  • Product Dimensions: 9.6 x 7.3 x 1 inches
  • Shipping Weight: 1.6 pounds (View shipping rates and policies)
  • Average Customer Review: 4.0 out of 5 stars  See all reviews (2 customer reviews)
  • Amazon Best Sellers Rank: #1,833,112 in Books (See Top 100 in Books)

More About the Authors

Discover books, learn about writers, read author blogs, and more.

 

Customer Reviews

2 Reviews
5 star:    (0)
4 star:
 (2)
3 star:    (0)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.0 out of 5 stars (2 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

6 of 6 people found the following review helpful:
4.0 out of 5 stars Academic textbook for a course in fault tolerance, March 25, 2007
By 
Dmitry Dvoinikov (Ekaterinburg, Russia) - See all my reviews
This review is from: Fault Tolerance in Distributed Systems (Paperback)
It's a pity that I got my hands on this book so late. It would have been much better if I got it soon after it was published in 1994, when I was in the university. Because the book is largely academic. To quote:

-- QUOTE

This book is an attempt to organize the body of knowledge in the area of software fault tolerance. ... [It] can be used as a textbook for a graduate/senior level course on fault tolerance ... or for a professional course in fault tolerance. It can also be used as a reference by researchers/practitioners ...

-- END QUOTE

The book has a nice systematic approach in that it attempts to clearly define what is a system, what is a failure and so on and so forth.

It takes the route of explaining that the distributed system is built around a set of communicating processes running on a different nodes and how redundancy, anything unnecessary and existing entirely for the ability for tolerating faults, is added.

But the big problem is - the book focuses on the joints, not on the bones, so to speak. It tells you about processes running on different computers and how they talk to each other and what can be done to ensure those conversations have certain properties. It speaks about joints - (network) protocols required for the processes to become a distributed system. In that the book reminds a lot of Tannenbaum and van Steen's "Distributed Systems: Principles and Paradigms".

But where it concerns the bones, the processes, all it says is "the process saves its state to persistent storage" or "the process recovers to the most recently established checkpoint". Uh-huh, sounds great, thanks. Needs hell lot of work to build a process in such a way that its state as a whole can be saved and restored from a stable storage. There are other problems for sure.

Eight out of nine book's chapters keep going around like that, telling you about all sorts of protocols for negotiations, clock synchronization, such and such broadcast, distributed snapshots, distributed transactions, voting and duplicating running processes. To be fair, there are quite a few interesting protocols that are nice to be familiar with, just in case.

The ninth and the last chapter in 40 pages touches writing fault-tolerant processes. It explains how redundancy can be added to the code and suggests a specific approach to cutting and wrapping all your code to special small boxes to ensure certain behaviour. Although it's difficult to argue with that, it's definetely not the only possible way.

Throughout the book, nearly everything of use quickly gets so complicated, that it's impractical for every use, and the author frequently admits that. I can see it working in calculations, numerical algorithms, even in the ever so proudly sounding aircraft control, in other words - where there is a single simple input, single simple output, simple logic, totally deterministic, no concurrency or shared state.

-- QUOTE

The schemes discussed above [...] require each process to be deterministic, i.e. given the same inputs, the process performs the same actions. Both of these assumptions do not hold, for example, in languages like CSP and Ada ...

-- END QUOTE

I'd say, it'not just CSP and Ada which do not have total determinism, but a lot of real systems too, no matter which language.

The book has next to none practical examples, you hardly ever get a name of the system which implements this or that, and may be a couple of times you get a brief description of a specific implementation. Most of the time such referencing is done in a scientific way, like "Aristole has shown this in [Ars/1378BC]". Good if you have access to the sources and/or time to look it up.

The book indeed makes a nice textbook for a course, but less so a practical reference. Although the matters discussed in it are unlikely to become obsolete, there probably are a lot of newer books on the subject. Will go look for them.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


4 of 7 people found the following review helpful:
4.0 out of 5 stars Somewhat outdated but comprehensive, October 22, 2000
By A Customer
This review is from: Fault Tolerance in Distributed Systems (Paperback)
This book is already somewhat outdated (six years) in relation to the cutting edge of fault tolerance research but it's a good and comprehensive introduction to the subject, and great to programmers looking for some understaning of fault tolerance, as commercial tools still have much to go before catching up with what is in this book. This field is getting more and more important as business systems are being moved to the internet and need to remain operarional 24/7.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No

Share your thoughts with other customers: Create your own review
 
 
 
Only search this product's reviews



Tag this product

 (What's this?)
Think of a tag as a keyword or label you consider is strongly related to this product.
Tags will help all customers organize and find favorite items.
Your tags: Add your first tag
 

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 


Active discussions in related forums
Search Customer Discussions
Search all Amazon discussions
   
Related forums


Listmania!


Create a Listmania! list

So You'd Like to...


Create a guide


Look for Similar Items by Category


Look for Similar Items by Subject