Have one to sell? Sell yours here
UNIX Fault Management: A Guide for System Administrators
 
See larger image
 
Tell the Publisher!
I'd like to read this book on Kindle

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

UNIX Fault Management: A Guide for System Administrators [Paperback]

Brad Stone (Author), Julie Symons (Author)
3.5 out of 5 stars  See all reviews (2 customer reviews)


Available from these sellers.



Book Description

Hewlett-Packard Professional Books December 10, 1999
If you're responsible for maintaining the integrity and availability of a mission-critical UNIX system, this is the first book that brings together all the information you need most. UNIX Fault Management Administrator's Handbook describes exactly how to implement appropriate, cost-effective system monitoring on any UNIX server, including systems configured as high availability clusters. You'll find detailed descriptions of fault monitoring tools and monitoring frameworks to help you make better purchasing decisions; a detailed overview of the monitoring tasks operators perform; and specific techniques for investigating and recovering from problems. The book includes coverage of monitoring systems, disks, networks, applications, and databases, as well as specific fault management techniques for large-scale enterprises.

Editorial Reviews

From the Inside Flap

Preface

This book is intended for system administrators and operators who are responsible for maintaining the integrity and availability of mission-critical UNIX systems. The book provides a description of the fault monitoring tools and techniques available for UNIX servers, including systems that are configured as high availability clusters.

This book can therefore be a handy quick reference for an operator trying to troubleshoot a problem in the customer environment, by pointing out where to find key diagnostic messages and describing how to take recovery actions.

A system administrator responsible for the initial configuration and administration of UNIX systems will also find this book useful because it describes the procedures to follow to set up the appropriate levels of system monitoring. The product descriptions can also help in making purchasing decisions as the customer determines the appropriate amount of event monitoring needed in their environment.

An overview of the tasks performed by an operator is provided, with details on how events are received and processed. The remainder of the book focuses on the types of events that can be received, how they are detected, how operators receive event notifications, and how problems can be investigated and recovery performed. The goal is to introduce the necessary tools, but not to show how every possible problem can be solved.

This book provides numerous descriptions of how fault management tools and products can be used to solve a variety of problems. Many of the chapters are focused on specific computer components, such as disks or databases, to be helpful to operators with specific roles. Here is a description of the individual chapters:

Chapter 1, "Analyzing the Role of System Operators," describes the tasks performed by a system operator and the evolution of fault management.

Chapter 2, "Enumerating Possible Events," describes the various types of events that are interesting to monitor on a UNIX system.

Chapter 3, "Using Monitoring Frameworks," describes monitoring frameworks and the administrative tasks that must be done before they can be used.

Chapter 4, "Monitoring the System," describes the tools and products used to monitor the UNIX server.

Chapter 5, "Monitoring the Disks," describes the tools and products used to monitor external disk devices.

Chapter 6, "Monitoring the Network," provides an overview of the many tools available for detecting problems and events related to the use of the network.

Chapter 7, "Monitoring the Application," describes methods for monitoring the response times and availability of critical applications.

Chapter 8, "Monitoring the Database," focuses specifically on tools to detect problems and events related to database usage.

Chapter 9, "Enterprise Management," discusses the problems with trying to deal with fault management for the large-scale customer enterprise.

Chapter 10, "UNIX Futures," discusses the future plans of some of the major UNIX system vendors in the area of fault management.

Appendix A, "Standards," describes fault management standards that have emerged and how you can benefit from them.

The Glossary contains the important terms used in the book, and their definitions.

Although it is assumed that most customers concerned about fault management will implement high availability solutions, this book does not describe how to create highly available computing environments. Readers needing additional information on high availability may check Hewlett-PackardÕs external Web site on high availability (hp/go/ha) or read Clusters for High Availability by Peter Weygant.

In general, this book does not discuss the configuration and installation of the hardware and software components of your UNIX system. You should rely on your vendorsÕ product manuals for this.

Many of the examples used in this book were created on HP-UX servers. Other UNIX platforms behave similarly, and we note when tools are supported only on certain UNIX platforms.

From the Back Cover

Maximize UNIX system integrity and availability in mission-critical environments!

If you're responsible for maintaining the integrity and availability of a mission-critical UNIX system, then you need UNIX Fault Management: A Guide for System Administrators, the first book that brings together all of the monitoring and fault management information. Expert UNIX system management engineers Brad Stone and Julie Symons show you exactly how to implement appropriate, cost-effective system monitoring on any UNIX server -- including systems configured as high availability clusters. You'll learn how to:

  • Plan for-and establish-cost-effective, reliable system monitoring procedures
  • Monitor systems, disks, networks, applications, and databases
  • Detect, investigate, and recover from server problems
  • Implement best practices for high availability in enterprise-class UNIX installations-including clusters
  • Take advantage of key fault management trends, new standards, and new technologies

This book contains detailed descriptions of fault monitoring tools and monitoring frameworks to help you make better purchasing decisions. You'll also find a handy quick reference of monitoring tasks and techniques for operators -- including specific, step-by-step recovery solutions. If you can't afford one nanosecond more downtime than necessary, you can't afford to be without UNIX Fault Management.


Product Details

  • Paperback: 400 pages
  • Publisher: Prentice Hall PTR; 1st edition (December 10, 1999)
  • Language: English
  • ISBN-10: 013026525X
  • ISBN-13: 978-0130265258
  • Product Dimensions: 9.2 x 7 x 0.7 inches
  • Shipping Weight: 1.3 pounds
  • Average Customer Review: 3.5 out of 5 stars  See all reviews (2 customer reviews)
  • Amazon Best Sellers Rank: #3,662,340 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, read author blogs, and more.

 

Customer Reviews

2 Reviews
5 star:
 (1)
4 star:    (0)
3 star:    (0)
2 star:
 (1)
1 star:    (0)
 
 
 
 
 
Average Customer Review
3.5 out of 5 stars (2 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

2 of 2 people found the following review helpful:
2.0 out of 5 stars Not up to date, June 7, 2003
By 
Vinicio Valencia (MIAMI, FLORIDA United States) - See all my reviews
(REAL NAME)   
This review is from: UNIX Fault Management: A Guide for System Administrators (Paperback)
This book has an excellent style for explaining you all the things related to fault management with UNIX plattforms and dealing with databases. There's a LOT of information about SNMP and MIB's. But it's approach to fault management is primarly on tools, and the tools described are out of date or even some of those companies no longer exists (like platinum, bought by CA); the scripts provided are somehow simple; the database fault management techniques described are very, very generic and basic. Perhaps it can give you a basic idea and starting point to to this.
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No


3 of 12 people found the following review helpful:
5.0 out of 5 stars Best bet to learn about Unix Fault Management, May 27, 2000
By 
Surya Kiran (Chennai, India) - See all my reviews
This review is from: UNIX Fault Management: A Guide for System Administrators (Paperback)
The book is simply superb....one must have this book at his/her disposal in order to learn thoroughly about Fault Management of Unix Systems
Help other customers find the most helpful reviews 
Was this review helpful to you? Yes No

Share your thoughts with other customers: Create your own review
 
 
 
Only search this product's reviews



Tag this product

 (What's this?)
Think of a tag as a keyword or label you consider is strongly related to this product.
Tags will help all customers organize and find favorite items.
Your tags: Add your first tag
 

Sell a Digital Version of This Book in the Kindle Store

If you are a publisher or author and hold the digital rights to a book, you can sell a digital version of it in our Kindle Store. Learn more

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

Search Customer Discussions
Search all Amazon discussions
   


Listmania!


Create a Listmania! list

So You'd Like to...


Create a guide


Look for Similar Items by Category


Look for Similar Items by Subject