OSU Resource Monitoring - LifeKeeper for Linux LIVE - 9.8.1

LifeKeeper for Linux
LifeKeeper for Linux Release Notes
- IMPORTANT NOTICES
- Overview
- New Features
- Bug Fixes / Hotfixes
- Discontinued Features
- LifeKeeper Components
- System Requirements
- Storage and Adapter Options
- Open Source Packages
- Known Issues
- Technical Notes
- Upgrades
LifeKeeper for Linux Getting Started Guide
LifeKeeper for Linux Installation Guide
- Software Packaging
- Planning Your LifeKeeper Environment
- Setting Up Your LifeKeeper Environment
- Installing the Software
- How to Use Setup Scripts
- Verifying the LifeKeeper Installation
- Upgrading LifeKeeper
- Upgrading the OS / Kernel on a node with LifeKeeper (OS Patching)
LifeKeeper for Linux Technical Documentation
- Documentation and Training
- lkbackup
- LifeKeeper
- DataKeeper
- Command Line Interface
Application Recovery Kits
- Apache Recovery Kit Administration Guide
  - LifeKeeper Documentation and Apache References
  - Apache Recovery Kit Requirements
  - Configuring Apache Web Server with LifeKeeper
    - Configuration Definitions and Examples
      - Active/Standby and Active/Active Configurations
    - Configuration Considerations for Apache Web Server
  - LifeKeeper Configuration Tasks for Apache
  - Apache Web Server Troubleshooting
  - Apache Recovery Kit Operations Overview
- DB2 Recovery Kit Administration Guide
  - DB2 Documentation and References
  - DB2 Recovery Kit Hardware and Software Requirements
  - DB2 Recovery Kit Overview
  - Configuring the LifeKeeper for Linux DB2 Recovery Kit
  - LifeKeeper for Linux DB2 Recovery Kit Configuration Tasks
  - DB2 Troubleshooting
  - Setting Up DB2 to use Raw I/O
- Recovery Kit for EC2™ Administration Guide
  - Recovery Kit for EC2™ Requirements
  - Recovery Kit for EC2™ Configuration Examples
  - Recovery Kit for EC2™ Overview
  - Recovery Kit for EC2™ Configuration
- LB Health Check Kit Administration Guide
  - Configuration Examples
  - Upgrading Generic Application Kit for Load Balancer Health Checks to LB Health Check Kit
  - Creating/Extending/Modifying a Resource
  - Operations Overview
  - Setting up LB Health Check from the Command Line (LKCLI)
  - Tuning Load Balancer Health Check Parameters
- Logical Volume Manager Recovery Kit Administration Guide
  - Documentation and References
  - Logical Volume Manager Recovery Kit Requirements
    - Logical Volume Manager Hardware and Software Requirements
  - Overview
    - Logical Volume Manager Recovery Kit Notes and Restrictions
  - LifeKeeper Logical Volume Manager Hierarchy Creation and Administration
  - Logical Volume Manager Troubleshooting
  - LVM Recovery Kit Operations Overview
- IP Recovery Kit Administration Guide
  - IP Recovery Kit Principles of Operation
  - IP Recovery Kit Requirements
  - IP Recovery Kit Configuration
  - IP Recovery Kit (IPv4) Operations Overview
- Recovery Kit for MySQL Administration Guide
  - Recovery Kit for MySQL Hardware and Software Requirements
  - Recovery Kit for MySQL Configuration
  - Managing MySQL Resource Hierarchies
  - MySQL Troubleshooting
  - Recovery Kit for MySQL Operations Overview
- WebSphere MQ Recovery Kit Administration Guide
  - MQ Recovery Kit Abbreviations
  - MQ Recovery Kit Requirements
  - Appendix A – Sample mqs.ini Configuration File
  - Appendix B – Sample qm.ini Configuration File
  - Appendix C – WebSphere MQ Configuration Sheet
- NAS Recovery Kit Administration Guide
  - NAS Documentation and References
  - NAS Recovery Kit Hardware and Software Requirements
  - NAS Recovery Kit Overview
  - Configuring the LifeKeeper for Linux NAS Recovery Kit
    - NAS Configuration Considerations
    - NAS Configuration Examples
  - LifeKeeper Configuration Tasks for NAS
  - NAS Troubleshooting
    - NAS Error Messages
    - LifeKeeper GUI Related Errors
- NFS Server Recovery Kit Administration Guide
  - NFS Server Recovery Kit Overview
  - NFS Server Recovery Kit Requirements
  - NFS Server Recovery Kit Configuration Considerations
- Recovery Kit for Oracle Cloud Infrastructure Administration Guide
  - Principles of Operation
  - Resource Monitoring and Local Recovery
  - Requirements
  - Recovery Kit for Oracle Cloud Infrastructure Notes
  - Configuration
  - Troubleshooting
    - Known Issues / Restrictions
    - Error Messages
- Oracle Recovery Kit Administration Guide
  - Oracle Recovery Kit Hardware and Software Requirements
  - Configuring Oracle with LifeKeeper
  - LifeKeeper Configuration Tasks for Oracle
  - Oracle Troubleshooting
    - Oracle Known Issues and Restrictions
  - Oracle Appendix
- PostgreSQL Recovery Kit Administration Guide
  - PostgreSQL Resource Hierarchy
  - PostgreSQL Hardware and Software Requirements
  - PostgreSQL Configuration Considerations
    - Protecting PostgreSQL Best Practices
    - Using Mirrored File Systems with DataKeeper
  - PostgreSQL Installation
  - PostgreSQL Administration
  - PostgreSQL Troubleshooting
    - PostgreSQL General Tips
    - PostgreSQL Tunables
- Postfix Recovery Kit Administration Guide
  - Postfix Hardware and Software Requirements
    - Postfix Recovery Kit Installation
  - Configuring the LifeKeeper for Linux Postfix Recovery Kit
  - Postfix Configuration Validation
    - LifeKeeper Configuration Tasks for Postfix
  - Postfix Troubleshooting
- Quick Service Protection (QSP) Recovery Kit
- Recovery Kit for Route 53™ Administration Guide
  - Recovery Kit for Route 53™ Requirements
  - Example Configurations
  - Recovery Kit for Route 53™ Configuration
  - Recovery Kit for Route 53™ Troubleshooting
  - Recovery Kit for Route 53™ Operations Overview
- Samba Recovery Kit Administration Guide
  - Samba Recovery Kit Requirements
  - Samba Recovery Kit Installation
  - Samba Recovery Kit Overview
  - Configuring Samba with LifeKeeper
  - Samba Configuration Steps
  - LifeKeeper Configuration Tasks for Samba
  - Samba Hierarchy Administration
    - Modifying the Samba Configuration File
    - Maintaining the smvpasswd File
  - Samba Troubleshooting
- SAP Recovery Kit Administration Guide
  - SAP Abbreviations and Definitions
  - LifeKeeper – SAP Icons
  - SAP Recovery Kit Overview
  - LifeKeeper SAP Solution Page
  - SAP Hardware and Software Requirements
  - SAP Configuration Considerations
  - SAP Installation
  - SAP Administration
  - SAP Troubleshooting
  - SAP Maintenance Mode
  - Custom and Maintenance-Mode Behavior via Policies
  - tset Errors Appear in the LifeKeeper Log File
- SAP HANA Recovery Kit Administration Guide
  - Upgrading from pre-9.7.0
  - Upgrading the SAP HANA Database
  - SAP HANA Supported Configurations
  - SAP HANA Recovery Kit Hardware and Software Requirements
  - SAP HANA Recovery Kit Overview
  - Configuring SAP HANA with LifeKeeper
  - SAP HANA Resource Configuration Tasks
  - SAP HANA Resource Hierarchy Administration
  - SAP HANA Troubleshooting
- SAP MaxDB Recovery Kit Administration Guide
  - SAP MaxDB Recovery Kit Hardware and Software Requirements
  - SAP MaxDB Recovery Kit Overview
    - SAP MaxDB Resource Hierarchy
  - SAP MaxDB Configuration Considerations
  - Configuring SAP MaxDB with LifeKeeper
  - SAP MaxDB Resource Configuration Tasks
  - SAP MaxDB Resource Hierarchy Administration
  - SAP MaxDB Troubleshooting
    - SAP MaxDB Recovery Kit Error Messages
  - Appendix – Creating Device Spaces Using Raw I/O with SAP MaxDB
    - Raw I/O Setup Steps
    - Adding a Device Space after Creating a Hierarchy
- Sybase ASE Recovery Kit Administration Guide
  - Sybase ASE Recovery Kit Overview
  - Sybase ASE Recovery Kit Hardware and Software Requirements
  - Sybase ASE Recovery Kit Configuration Considerations
  - Installing and Configuring Sybase ASE with LifeKeeper
  - Sybase ASE Recovery Kit Administration
  - Appendix – Creating Device Spaces Using Raw I/O with Sybase ASE
- VMDK Shared Storage Recovery Kit Administration Guide
  - VMDK Documentation and References
  - VMDK Hardware and Software Requirements
  - VMDK Recovery Kit Overview
  - Configuring the VMDK Recovery Kit
    - VMDK Configuration Considerations
    - VMDK Configuration Examples
  - LifeKeeper VMDK Recovery Kit Configuration Tasks
  - VMDK Troubleshooting
    - VMDK Error Messages
Parameters List
- EC2 Parameters List
- IP Parameters List
- LB Health Check Parameters List
- MQ Parameters List
- NFS Parameters List
- Recovery Kit for Oracle Cloud Infrastructure Parameters List
- Oracle Parameters List
- PostgreSQL Parameters List
- Quorum Parameters List
- Route53 Parameters List
- SAP Parameters List
- DataKeeper Parameters List
- Standby Node Health Check Parameters List
- SAP HANA Parameters List
- SAP MaxDB Parameters List
Search for an Error Code
- Combined Message Catalog
LifeKeeper for Linux Support Matrix
Supported Storage
Evaluation Guides
- DataKeeper for Linux Evaluation Guide
- LifeKeeper Evaluation Guide for Cloud Environments
Quick Start Guides
- AWS Direct Connect Quick Start Guide
- Microsoft Azure Quick Start Guide
- Connection Between LifeKeeper Cluster and Clients Using AWS Transit Gateway Quick Start Guide
- Multi-VPC Cluster Configuration Using AWS VPC Peering Connections Quick Start Guide
- Apache/MySQL Cluster Using Both Shared and Replicated Storage
LifeKeeper Single Server Protection
- LifeKeeper Single Server Protection for Linux Release Notes
- LifeKeeper Single Server Protection for Linux Introduction
- LifeKeeper Single Server Protection for Linux Installation Guide
- LifeKeeper Single Server Protection for Linux Technical Documentation
Product Support Schedule
LifeKeeper Web Management Console (LKWMC)
- Architecture
- System Requirements
- Getting Started
- LKWMC GUI Operations and Layout
- Known Issues and Restrictions

Download as PDF

For each out-of-service (OSU) resource, lkcheck periodically calls the OSUquickCheck script for the resource. The OSUquickCheck script performs a quick health check for the resource. If it determines that the resource cannot start successfully, it changes the state of the resource to OSF and sends this information to the administrator by email or SNMP event forwarding. This monitoring is performed at the same interval as the normal LifeKeeper resource monitoring (/etc/default/LifeKeeper setting LKCHECKINTERVAL).

Monitored Resources

The following can be monitored with OSU Resource Monitoring:

Resource Name	Monitoring Details
IP Resource	Verify the NIC link is up (disable with /etc/default/LifeKeeper setting IP_NOLINKCHECK=1). Also, verify network reachability (if a ping list is configured).
Disk or DMMP resource(s)	Verify that the paths to the monitored disk are functional by using commands for each resource.
NAS Resource	Verify that NFS access is available for the NFS server. Refer to NAS Configuration Considerations for information on the timeout value for NFS access.

OSU Resource Monitoring Configuration

Set the SNHC_IPCHECK and SNHC_DISKCHECK settings in the /etc/default/LifeKeeper configuration file. You may also need to configure the following setting. See Standby Node Health Check Parameters List for details.

SNHC_IPCHECK_SLEEPTIME

Recovery from Failure

If an error is detected during OSU resource monitoring, the state of the corresponding resource is changed to OSF (out of service with failure). When the status is changed, OSU resource monitoring is no longer performed for the resource. After checking the details of the notified failure and addressing it, you should change the resource state to OSU. The state can be changed from OSF to OSU using the following command:

/opt/LifeKeeper/lkadm/bin/retstate <resource tag>

Node Monitoring

LifeKeeper Administration Overview

Feedback

Post your comment on this topic.