The following should be considered before using the LifeKeeper NFS Server Recovery Kit:
- The NFS file system to be placed under LifeKeeper protection must be exported by the primary server (the server where the NFS resource is being created). This implies that NFS is running and the underlying file system is mounted.
- When you export a read/write file system, use the sync option. This option requests that all file system writes be committed to disk before the write request completes. If the sync option is not used with an NFS file system under LifeKeeper protection, data may be lost during a failover.
- The underlying file system must be on a shared device and mounted with write permission.
- If the underlying file system is already protected by LifeKeeper, it must be in service on the primary server and have the highest priority. If the underlying file system is not under LifeKeeper protection, then the Recovery Kit will place it under protection.
- The NFSv4 root and any bind mounts must be exported by the primary server (the server where the NFS resource is being created). This implies NFS is running and the underlying file systems (root and bind mounts) are mounted. The root and bind mounts must be on shared devices that LifeKeeper can protect. If the underlying file systems are already protected by LifeKeeper, they must be in service on the primary server and have the highest priority. If the file systems are not already protected, then the recovery kit will place them under protection. Note: If the NFSv4 root is /export with three subdirectories called bin, opt and other, local file systems /usr/local/bin and /usr/local/opt are bind mounted into the NFSv4 pseudo file system, then /export, /usr/local/bin and /usr/local/opt must be mounted on a shared device with /export mounted with write permission, /usr/local/bin and /usr/local/opt must be bind mounted into /export (mount – -binda /usr/local/bin /export/bin and mount – -bind /usr/local/opt /export/bin) and the /export, export/bin and /export/opt must all be exported. See the configuration example Active/Standby for NFSv4 for further information.
- The NFS Server Recovery Kit requires an IP resource that must be created and in service on the primary server. The IP resource must also have its highest priority on the primary server.
- Before creation of the NFS resource, clients must be able to mount the NFS file system using the LifeKeeper-protected IP address.
- When you extend an NFS file system resource, the file system must mount at the same mount point on each server.
- When protecting NFSv4 root exports, /var/lib/nfs is moved to the NFSv4 root file system which must have write permissions set when mounted. To provide continued access, a symbolic link is created from /var/lib/nfs to the NFSv4 root. Because of this, Active/Active NFSv4 configurations are not supported nor are configuations with NFSv2/v3 and NFSv4. NFSv2/v3 exports should use the bind features of NFSv4 instead of attempting to make them standalone NFSv2/v3 exports.
- The Oracle Recovery Kit supports NFSv3 for shared database storage. NFSv4 is not supported by the Oracle Recovery Kit at this time due to NFSv4 file locking mechanisms.
- When using a system that adopts Linux kernel 3.12 or later as an NFS client, the file lock at the client side is lost when the communication is disconnected depending on specifications of the kernel. Therefore, takeover of the lock is not guaranteed during switchover or failover.
- In NFS v4, the file lock is lost when the client cannot communicate with the server for more than a certain period of time. This period of time is set as the variable NFS_V4_LEASE_TIME and the default value is 10 (seconds). When the communication between the client and the server is disconnected for a longer time than the period set in NFS_V4_LEASE_TIME during a switchover or failover, the file lock set by the client is forcibly disabled. In order to change this value, set an appropriate value of the environment variable NFS_V4_LEASE_TIME in /etc/default/LifeKeeper.
If you increase the value of NFS_V4_LEASE_TIME, the above mentioned problem will less likely to occur but it will take time to reconnect to the client after the server switching.
Post your comment on this topic.