Cause: ======= A fault tolerance member presents this warning advisory message when it detects that too few group members are active.This situation is usually transient and resolves itself quickly without intervention. However, if the situation persists it might indicate problems that require attention.
This warning indicates that the following conditions all hold simultaneously:
This member is inactive.
This member will not activate, that is, its rank indicates it should remain inactive.
The number of members broadcasting heartbeat messages is still less than the active goal parameter.
Notice that a member does not receive this advisory if it is either active or about to activate.
Resolution: ========= This warning can indicate any of several situations:
Network connectivity is erratic. The network repeatedly separates into two or more disconnected parts and then reconnects. Notify your network administrator immediately.
Member processes terminate immediately upon activation. New members activate to replace them, resulting in a cascade of failures. Possible causes include errors in program code and transient network overload.