Node Monitoring

SIOS Protection Suite for Linux
SIOS Protection Suite for Linux Release Notes
SIOS Protection Suite for Linux Getting Started Guide
SIOS Protection Suite for Linux Installation Guide
- Software Packaging
- Planning Your SPS Environment
- Setting Up Your SPS Environment
- Installing the Software
- How to Use Setup Scripts
- Verfying the SPS Installation
- Upgrading SPS
- Upgrading the OS / Kernel on a node with LifeKeeper (OS Patching)
SIOS Protection Suite for Linux Technical Documentation
- Documentation and Training
- lkbackup
- LifeKeeper
- DataKeeper
- Command Line Interface
Application Recovery Kits
- Apache Recovery Kit Administration Guide
  - SPS Documentation and Apache References
  - Apache Recovery Kit Requirements
  - Configuring Apache Web Server with LifeKeeper
    - Configuration Definitions and Examples
      - Active/Standby and Active/Active Configurations
    - Configuration Considerations for Apache Web Server
  - LifeKeeper Configuration Tasks for Apache
  - Apache Web Server Troubleshooting
- DB2 Recovery Kit Administration Guide
  - DB2 Documentation and References
  - DB2 Recovery Kit Hardware and Software Requirements
  - DB2 Recovery Kit Overview
  - Configuring the LifeKeeper for Linux DB2 Recovery Kit
  - LifeKeeper for Linux DB2 Recovery Kit Configuration Tasks
  - DB2 Troubleshooting
  - Setting Up DB2 to use Raw I/O
- Recovery Kit for EC2™ Administration Guide
  - Recovery Kit for EC2™ Principles of Operation
  - Recovery Kit for EC2™ Requirements
  - Recovery Kit for EC2™ Configuration
  - Recovery Kit for EC2™ Troubleshooting
- Generic Application Kit for Load Balancer Health Checks
  - Configuration Examples
  - Basic Behaviors
  - Script Specifications
  - Script Parameter List
  - Creating/Extending a Resource
  - Messages List
- LVM Recovery Kit Administration Guide
  - LVM Documentation and References
  - LVM Recovery Kit Requirements
    - LVM Hardware and Software Requirements
  - LVM Recovery Kit Overview
    - LVM Recovery Kit Notes and Restrictions
  - SPS LVM Hierarchy Creation and Administration
  - LVM Troubleshooting
- IP Recovery Kit Administration Guide
  - IP Recovery Kit Principles of Operation
  - IP Recovery Kit Requirements
  - IP Recovery Kit Configuration
- MySQL Recovery Kit Administration Guide
  - MySQL Recovery Kit Hardware and Software Requirements
  - MySQL Recovery Kit Configuration
  - Installing/Configuring MySQL with LifeKeeper
  - MySQL Administration
    - Performing a Manual Switchover from the GUI
  - MySQL Troubleshooting
- MD Recovery Kit Administration Guide
  - Software RAID (md) Documentation and References
  - Software RAID (md) Recovery Kit Hardware and Software Requirements
    - Software RAID (md) Hardware Requirements
    - Software RAID (md) Software Requirements
  - Software RAID (md) Recovery Kit Overview
    - Software RAID Notes and Restrictions
  - Software RAID Hierarchy Creation and Administration
  - Software RAID Best Practices
    - MD Device Number
    - All MD Devices In-Service
  - MD Troubleshooting
- WebSphere MQ Recovery Kit Administration Guide
  - MQ Recovery Kit Abbreviations
  - MQ Recovery Kit Requirements
    - MQ Hardware and Software Requirements
    - Upgrading an MQ LifeKeeper Cluster
  - WebSphere MQ Recovery Kit Overview
    - MQ Recovery Kit Resource Hierarchies
    - MQ Recovery Kit Features
  - WebSphere MQ Configuration Considerations
    - MQ Configuration Requirements
  - LifeKeeper Configuration Tasks for MQ
  - WebSphere MQ Troubleshooting
    - MQ Error Messages
  - Appendix A – Sample mqs.ini Configuration File
  - Appendix B – Sample qm.ini Configuration File
  - Appendix C – WebSphere MQ Configuration Sheet
- NAS Recovery Kit Administration Guide
  - NAS Documentation and References
  - NAS Recovery Kit Hardware and Software Requirements
  - NAS Recovery Kit Overview
  - Configuring the LifeKeeper for Linux NAS Recovery Kit
    - NAS Configuration Considerations
    - NAS Configuration Examples
  - LifeKeeper Configuration Tasks for NAS
  - NAS Troubleshooting
    - NAS Error Messages
    - LifeKeeper GUI Related Errors
- NFS Server Recovery Kit Administration Guide
  - NFS Server Recovery Kit Overview
  - NFS Server Recovery Kit Requirements
  - NFS Server Recovery Kit Configuration Considerations
  - NFS Configuration Tasks
  - NFS Troubleshooting
- Oracle Recovery Kit Administration Guide
  - Oracle Recovery Kit Hardware and Software Requirements
  - Configuring Oracle with LifeKeeper
  - LifeKeeper Configuration Tasks for Oracle
  - Oracle Troubleshooting
    - Oracle Known Issues and Restrictions
  - Oracle Appendix
- PostgreSQL Recovery Kit Administration Guide
  - PostgreSQL Resource Hierarchy
  - PostgreSQL Hardware and Software Requirements
  - PostgreSQL Configuration Considerations
    - Protecting PostgreSQL Best Practices
    - Using Mirrored File Systems with DataKeeper
  - PostgreSQL Installation
  - PostgreSQL Administration
  - PostgreSQL Troubleshooting
    - PostgreSQL General Tips
    - PostgreSQL Tunables
- Postfix Recovery Kit Administration Guide
  - Postfix Hardware and Software Requirements
    - Postfix Recovery Kit Installation
  - Configuring the LifeKeeper for Linux Postfix Recovery Kit
  - Postfix Configuration Validation
  - LifeKeeper Configuration Tasks for Postfix
  - Postfix Troubleshooting
- Quick Service Protection (QSP) Recovery Kit
- Recovery Kit for Route 53™ Administration Guide
  - Recovery Kit for Route 53™ Requirements
  - Recovery Kit for Route 53™ Configuration
  - Recovery Kit for Route 53™ Troubleshooting
- Samba Recovery Kit Administration Guide
  - Samba Recovery Kit Requirements
  - Samba Recovery Kit Installation
  - Samba Recovery Kit Overview
  - Configuring Samba with LifeKeeper
  - Samba Configuration Steps
  - LifeKeeper Configuration Tasks for Samba
  - Samba Hierarchy Administration
    - Modifying the Samba Configuration File
    - Maintaining the smvpasswd File
  - Samba Troubleshooting
- SAP Recovery Kit Administration Guide
  - SAP Abbreviations and Definitions
  - LifeKeeper – SAP Icons
  - SAP Recovery Kit Overview
  - SIOS Protection Suite for SAP Solution Page
  - SAP Hardware and Software Requirements
  - SAP Configuration Considerations
  - SAP Installation
  - SAP Administration
  - SAP Troubleshooting
  - Maintenance Mode
    - SAP Maintenance Mode
    - Custom and Maintenance-Mode Behavior via Policies
- SAP HANA Recovery Kit Administration Guide
  - Upgrading from the SAP HANA Gen/App to the SAP HANA Recovery Kit
  - SAP HANA Recovery Kit Hardware and Software Requirements
  - SAP HANA Recovery Kit Overview
  - Configuring SAP HANA with SPS
  - SAP HANA Resource Configuration Tasks
  - SAP HANA Resource Hierarchy Administration
    - Changing Replication and Operation Modes
    - Resolving Split Brain Scenarios
  - SAP HANA Troubleshooting
    - Switchover failure when the SAP HANA administrative user is using a non-default login shell
- SAP MaxDB Recovery Kit Administration Guide
  - SAP MaxDB Recovery Kit Hardware and Software Requirements
  - SAP MaxDB Recovery Kit Overview
    - SAP MaxDB Resource Hierarchy
  - SAP MaxDB Configuration Considerations
  - Configuring SAP MaxDB with SPS
  - SAP MaxDB Resource Configuration Tasks
  - SAP MaxDB Resource Hierarchy Administration
  - SAP MaxDB Troubleshooting
    - SAP MaxDB Recovery Kit Error Messages
  - Appendix – Creating Device Spaces Using Raw I/O with SAP MaxDB
    - Naming Conventions
    - Adding a Device Space after Creating a Hierarchy
- Sybase ASE Recovery Kit Administration Guide
  - Sybase ASE Recovery Kit Overview
  - Sybase ASE Recovery Kit Hardware and Software Requirements
  - Sybase ASE Recovery Kit Configuration Considerations
  - Installing and Configuring Sybase ASE with SPS
  - Sybase ASE Recovery Kit Administration
  - Troubleshooting Sybase ASE Error During Resource Creation
  - Appendix – Creating Device Spaces Using Raw I/O with Sybase ASE
- VMDK Shared Storage Recovery Kit Administration Guide
  - VMDK Documentation and References
  - VMDK Hardware and Software Requirements
  - VMDK Recovery Kit Overview
  - Configuring the VMDK Recovery Kit
    - VMDK Configuration Considerations
    - VMDK Configuration Examples
  - LifeKeeper VMDK Recovery Kit Configuration Tasks
  - VMDK Troubleshooting
    - VMDK Error Messages
Parameters List
- EC2 Parameters List
- IP Parameters List
- MD Parameters List
- MQ Parameters List
- NFS Parameters List
- Oracle Parameters List
- PostgreSQL Parameters List
- Quorum Parameters List
- Route53 Parameters List
- SAP Parameters List
- DataKeeper Parameters List
- Standby Node Health Check Parameters List
- SAP HANA Parameters List
- SAP MaxDB Parameters List
Search for an Error Code
- Combined Message Catalog
SIOS Protection Suite for Linux Support Matrix
Supported Storage
Evaluation Guides
- DataKeeper for Linux Evaluation Guide
- SIOS Protection Suite for Linux Evaluation Guide for Cloud Environments
Quick Start Guides
- AWS Direct Connect Quick Start Guide
- SIOS Protection Suite for Linux in the AWS Cloud (SAP)
- Connecting to a LifeKeeper Cluster using AWS Transit Gateway Quick Start Guide
- Connecting to a LifeKeeper Cluster using AWS VPC Peering Quick Start Guide
- MySQL Cluster with Data Replication (“Shared Nothing” Cluster)
- PostgreSQL Cluster with Shared Storage (ISCSI)
- Apache/MySQL Cluster Using Both Shared and Replicated Storage
LifeKeeper Single Server Protection
- LifeKeeper Single Server Protection for Linux Release Notes
- LifeKeeper Single Server Protection for Linux Installation Guide
LifeKeeper Single Server Protection for Linux Technical Documentation
- Documentation and Training
- Intergration with VMware HA
  - SteelEye Management Console
- Administration
- FAQs
- Troubleshooting
  - Known Issues and Workarounds
  - SMC Troubleshooting
- Application Recovery Kits

If all resources on a node are out of service, LifeKeeper considers it a standby node and calls the node monitoring script. The node monitoring script monitors CPU and memory utilization. If it determines that the node cannot be switched to successfully (due to high CPU or memory load), it sends this information to the administrator by email or SNMP event forwarding. This monitoring is performed at the same interval as the normal LifeKeeper resource monitoring (/etc/default/LifeKeeper setting LKCHECKINTERVAL).

Monitored Resources

The following can be monitored with Node Monitoring:

Resource Name	Monitoring Details
CPU Utilization	Check CPU Utilization in /proc/stat file
Memory Utilization	Check Memory Utilization in /proc/meminfo file

Node Monitoring Configuration

Set the SNHC_CPUCHECK and SNHC_MEMCHECK settings in the /etc/default/LifeKeeper configuration file. You will also need to configure the following settings. See Standby Node Health Check Parameters List for details.

SNHC_CPUCHECK_THRESHOLD

SNHC_CPUCHECK_TIME

SNHC_MEMCHECK_THRESHOLD

SNHC_MEMCHECK_TIME

Standby Node Health Check

OSU Resource Monitoring

Feedback

Post your comment on this topic.

Monitored Resources

Node Monitoring Configuration

Feedback

Was this helpful?