Cluster is not coming up properly. Error, "Failed connecting to [Metaspace:xxxx] with error SYS_ERROR (connect_timeout - connect timeout). Retrying in [30] seconds".

Cluster is not coming up properly. Error, "Failed connecting to [Metaspace:xxxx] with error SYS_ERROR (connect_timeout - connect timeout). Retrying in [30] seconds".

book

Article ID: KB0093300

calendar_today

Updated On:

Products Versions
TIBCO BusinessEvents Enterprise Edition -
Not Applicable -

Description

Description:
Cluster is not coming up properly. Error, "Failed connecting to [Metaspace:xxxx] with error SYS_ERROR (connect_timeout - connect timeout). Retrying in [30] seconds".


Symptoms:
Error reported:
 
2014 Sep 11 17:32:52:282 GMT -7 -CS-P01 Warning [main] - [as.runtime.cluster.om.daoprovider] Failed connecting to [Metaspace:xxxx] with error SYS_ERROR (connect_timeout - connect timeout). Retrying in [30] seconds
2014 Sep 11 17:32:52:284 GMT -7 -CS-P01 Debug [main] - [runtime.session] Error performing initAll()
java.lang.RuntimeException: Unable to connect to metaspace [aaaaa] even after 5 tries
        at com.tibco.cep.runtime.service.dao.impl.tibas.ASDaoProvider.initHook(SourceFile:344)
        at com.tibco.cep.runtime.service.om.impl.AbstractDaoProvider.init(AbstractDaoProvider.java:65)
        at com.tibco.cep.runtime.service.cluster.MultiAgentCluster.init(MultiAgentCluster.java:112)
        at com.tibco.cep.runtime.session.impl.RuleServiceProviderImpl.initCluster(RuleServiceProviderImpl.java:707)
        at com.tibco.cep.runtime.session.impl.RuleServiceProviderImpl.configure(RuleServiceProviderImpl.java:271)
        at com.tibco.cep.container.standalone.BEMain.jumpStart(SourceFile:149)
        at com.tibco.cep.container.standalone.BEMain.main(SourceFile:69)
2014 Sep 11 17:32:52:284 GMT -7 xxxx-CS-P01 Fatal [main] - [container.standalone] Unable to connect to metaspace [aaaaa] even after 5 tries
java.lang.RuntimeException: Unable to connect to metaspace [aaaaa] even after 5 tries
        at com.tibco.cep.runtime.service.dao.impl.tibas.ASDaoProvider.initHook(SourceFile:344)
        at com.tibco.cep.runtime.service.om.impl.AbstractDaoProvider.init(AbstractDaoProvider.java:65)
        at com.tibco.cep.runtime.service.cluster.MultiAgentCluster.init(MultiAgentCluster.java:112)
        at com.tibco.cep.runtime.session.impl.RuleServiceProviderImpl.initCluster(RuleServiceProviderImpl.java:707)
        at com.tibco.cep.runtime.session.impl.RuleServiceProviderImpl.configure(RuleServiceProviderImpl.java:271)
        at com.tibco.cep.container.standalone.BEMain.jumpStart(SourceFile:149)
        at com.tibco.cep.container.standalone.BEMain.main(SourceFile:69
Cause:
Question checklist.

1). What is the Cluster architecture? How many seeders, physical machines?

2). Are you able to use 'as-admin' to connect to the metaspace? What's is the space status?

3). Do you have access to the AS logs?

4).  Check the physical machines resources usage such as CPU/Memory usage percentages.

5). Did this Cluster work properly before? Any operations applied on the metaspace before this error was seen?

6). Check the AS log to see if the discovery node has started. If you see a message such as, "waiting for discovery nodes, retry count [6], discovery [tcp://], it shows that the discovery node has not started.

7). Check the CDD configuration.

8). Check the listening URL settings. If you use, for example, tcp://22519, confirm that the engine you want to start uses tcp://<hostname>:22519 and not any other value.

9). To check the hostname, ping it.

The hostname must be the hostname on the local machine. You can type the command "hostname" in the terminal window and use this name.

Issue/Introduction

Cluster is not coming up properly. Error, "Failed connecting to [Metaspace:xxxx] with error SYS_ERROR (connect_timeout - connect timeout). Retrying in [30] seconds".

Resolution

In this case, the reason for the error was the : (colon)  missing in the listening URL settings. Once added back, it should work properly.