How a Cluster Works in Unity Connection 10.x
How a Cluster Works in Unity Connection 10.x
The Unity Connection cluster feature provides high availability voice messaging through two Unity Connection servers that are configured in a cluster. Under normal conditions, the Unity Connection servers are both active so that:
- The cluster can be assigned a DNS name that is shared by the Unity Connection servers.
- Clients such as email applications and the web tools available through the Cisco Personal Communications Assistant (PCA) can connect to either Unity Connection server.
- Phone systems can send calls to either Unity Connection server.
- Incoming phone traffic load is balanced between the Unity Connection servers by the phone system, PIMG/TIMG units, or other gateways that are required for the phone system integration.
Each server in the cluster is responsible for handling a share of the incoming calls for the cluster (answering phone calls and taking messages). The server with Primary status is responsible for the following functions:
- Homing and publishing the database and message store, which are both replicated to the other server.
- Sending message notifications and MWI requests (the Connection Notifier service is activated).
- Sending SMTP notifications and VPIM messages (the Connection Message Transfer Agent service is activated).
- Synchronizing voice messages in Unity Connection and Exchange mailboxes, if the single-inbox feature is turned on (the Unity Connection Mailbox Sync service is activated).
When one of the servers stops functioning (for example, when it is shut down for maintenance), the remaining server assumes responsibility for handling all incoming calls for the cluster. The remaining server also assumes responsibility for the database and message store, which are both replicated to the other server when the connection and its functionality are restored.
When the server that stopped functioning is able to resume its normal functions and is activated, it resumes responsibility for handling its share of incoming calls for the cluster.
To monitor the status of the servers, the Connection Server Role Manager service runs in Cisco Unity Connection Serviceability on both servers. This service performs the following functions:
- Starts the applicable services on each server, depending on server status.
- Determines whether critical processes (such as voice message processing, database replication, voice message synchronization with Exchange, and message store replication) are functioning normally.
- Initiates changes to server status when the server with Primary status is not functioning or when critical services are not running.
Note the following limitations when the publisher server is not functioning:
- If the Unity Connection cluster is integrated with an LDAP directory, directory synchronization does not occur, although authentication continues to work when only the subscriber server is functioning. When the publisher server is functioning again, directory synchronization resumes.
- If a Digital Network includes the Unity Connection cluster, directory updates do not occur, although messages continue to be sent to and from the cluster when only the subscriber server is functioning. When the publisher server is functioning again, directory updates resume.
The Connection Server Role Manager service sends a keep-alive events between the publisher and subscriber servers to confirm that the servers are functioning and connected. If one of the servers stops functioning or the connection between the servers is lost, the Connection Server Role Manager service waits for the keep-alive events and may require 30 to 60 seconds to detect that the other server is not available. While the Connection Server Role Manager service is waiting for the keep-alive events, users signing in to the server with Secondary status will not be able to access their mailbox or send messages, because the Connection Server Role Manager service has not yet detected that the server with Primary status (which has the active message store) is unavailable. In this situation, callers who attempt to leave a message may hear dead air or may not hear the recording beep.
NoteIt is recommended to import and delete the LDAP users from the publisher node only. It is recommended to import and delete the LDAP users from the publisher node only.
Licenses for an Unity Connection 10.x Cluster
A Cisco Unity Connection cluster requires a license for each Unity Connection server. The license that has the MAC address of the publisher server must be installed on the publisher server. The license that has the MAC address of the subscriber server must be installed on the subscriber server.
For information on managing licenses, see the “Managing Licenses in Cisco Unity Connection 10.x” chapter of the System Administration Guide for Cisco Unity Connection Release 10.x at http://www.cisco.com/en/US/docs/voice_ip_comm/connection/10x/administration/guide/10xcucsagx.html .
About the Unity Connection Publisher Server
The first Cisco Unity Connection server that is configured in the cluster is the publisher server. The Cluster Management page in Cisco Unity Connection Serviceability identifies the publisher server.
The publisher server assumes responsibility for publishing the database and message store when the cluster is functioning normally.
When the publisher server does not have Primary status (for example, when the administrator manually changes the status of the other server to Primary, which automatically changes the status of the publisher server to Secondary), the other server assumes responsibility for publishing the database and message store.
Server Status Functions in the Unity Connection 10.x Cluster
Each server in the cluster has a status that appears on the Cluster Management page of Cisco Unity Connection Serviceability. The status indicates the functions that the server is currently performing in the cluster, as described in Table 3-1 .
Server Assignments and Usage of Voice Messaging Ports in Unity Connection
In a Cisco Unity Connection cluster, the servers share the same phone system integrations. Each server is responsible for handling a share of the incoming calls for the cluster (answering phone calls and taking messages).
Depending on the phone system integration, each voice messaging port is either assigned to a specific server or used by both servers. Table 3-2 describes the port assignments.
Requirements for a Unity Connection 10.x Cluster
For current Cisco Unity Connection cluster requirements, see System Requirements for Cisco Unity Connection Release 10.x at www.cisco.com/en/US/docs/voice_ip_comm/connection/10x/requirements/10xcucsysreqs.html .
Effects on Calls in Progress When Server Status Changes in Unity Connection
When the status of a Cisco Unity Connection server changes, the effects on calls in progress depend on the final status of the server that is handling a call and on the condition of the network. Table 3-3 describes the effects.
If network connections are lost, then calls in progress may be dropped, depending on the nature of the network problem.
Effects on Unity Connection 10.x Web Applications When Server Status Changes
Normal functioning of the following web applications are not affected when the server status changes:
Effects of Stopping a Critical Service on Unity Connection 10.x
Critical services are necessary for the normal functioning of the Cisco Unity Connection system. The effects of stopping a critical service depend on the server and its status. Table 3-4 describes the effects.
Effects of a Split-Brain Condition in Unity Connection 10.x
When the servers in a Cisco Unity Connection cluster have Primary status at the same time (for example, when the servers have lost their connection with each other), both servers handle incoming calls (answer phone calls and take messages), send message notifications, send MWI requests, accept changes to the administrative interfaces (such as Unity Connection Administration), and synchronize voice messages in Unity Connection and Exchange mailboxes if single inbox is turned on. However, the servers do not replicate the database and message store to each other and do not receive replicated data from each other.
When the connection between the servers is restored, the status of the servers temporarily changes to Split Brain Recovery while the data is replicated between the servers and MWI settings are coordinated. During the time when the server status is Split Brain Recovery, the Connection Message Transfer Agent service and the Connection Notifier service (in Cisco Unity Connection Serviceability) are stopped on both servers, so Unity Connection does not deliver any messages and does not send any message notifications. The Unity Connection Mailbox Sync service is also stopped, so Unity Connection does not synchronize voice messages with Exchange (single inbox). The message stores are also briefly dismounted, so that Unity Connection tells users who are trying to retrieve their messages at this point that their mailboxes are temporarily unavailable.
When the recovery process is complete, the Connection Message Transfer Agent service and the Connection Notifier service are started on the publisher server. Delivery of the messages that arrived while during the recovery process may take additional time, depending on the number of messages to be delivered. The Connection Message Transfer Agent service and the Connection Notifier service are started on the subscriber server. Finally, the publisher server has Primary status and the subscriber server has Secondary status. At this point, the Unity Connection Mailbox Sync service is started on the server with Primary status, so that Unity Connection can resume synchronizing voice messages with Exchange if single inbox is turned on.
Events When Server Status Changes in Unity Connection 10.x
This section describes the events that take place when server status changes in the following situations:
- Automatic Change of Server Status Initiated by a Unity Connection Server with Primary Status
- Automatic Change of Server Status Initiated by a Unity Connection Server with Secondary Status
- Manual Change of Unity Connection Server Status Initiated by Administrator
Automatic Change of Server Status Initiated by a Unity Connection Server with Primary Status
1. The Unity Connection Server Role Manager service on the server with Primary status detects an unrecoverable failure (for example, the database fails or a critical service is stopped).
2. The Unity Connection Server Role Manager service on the server with Primary status notifies the Unity Connection Server Role Manager service on the other server to change its status.
3. The Unity Connection Server Role Manager service on both servers posts alarms that it is initiating a change of status.
4. The Unity Connection Server Role Manager service on the server with Primary status sets its status in the database to Secondary.
5. The Unity Connection Server Role Manager service on the other server (the server that originally had Secondary status) sets its status in the database to Primary.
6. The Unity Connection Server Role Manager service on the server that now has Primary status starts the critical services on that server.
7. The data connector detects the changed server status and sets the connections to use the database on the server that now has Primary status.
8. If possible, database and message store replication continues between the servers.
9. The Unity Connection Server Role Manager service on the server that now has Primary status posts an alarm that the change of status is complete.
Automatic Change of Server Status Initiated by a Unity Connection Server with Secondary Status
1. The Unity Connection Server Role Manager service on the server with Secondary status does not receive contact from the Unity Connection Server Role Manager service on the server with Primary status.
2. The Unity Connection Server Role Manager service on the server with Secondary status confirms its network connection by pinging the local host and other known remote servers.
3. If the network connection is confirmed, the Unity Connection Server Role Manager service on the server with Secondary status posts an alarm that it is initiating a change of status.
If the network connection is not available, the status does not change and the remaining events do not occur.
4. The Unity Connection Server Role Manager service on the server with Secondary status sets its status in the database to Primary.
5. The Unity Connection Server Role Manager service on the server that now has Primary status starts the critical services on that server.
6. The data connector detects the changed status and sets the connections to use the database on the server that now has Primary status.
7. If possible, database and message store replication continues between the servers.
8. The Unity Connection Server Role Manager service on the server that now has Primary status posts an alarm that the change of status is complete.
Manual Change of Unity Connection Server Status Initiated by Administrator
1. In Cisco Unity Connection Serviceability, the administrator manually initiates a change of server status.
2. The Unity Connection Server Role Manager service on the server with Secondary status notifies the Unity Connection Server Role Manager service on the server with Primary status to initiate change of status.
3. The Unity Connection Server Role Manager service on the both servers posts alarms that the change of status is being initiated.
4. The Unity Connection Server Role Manager service on the server with Primary status sets its status in the database to Secondary.
5. The Unity Connection Server Role Manager service on the other server (the server that originally had Secondary status) sets its status in the database to Primary.
6. The Unity Connection Server Role Manager service on the server that now has Primary status starts the critical services on that server.
7. The data connector detects the changed status and sets the connections to use the database on the server that now has Primary status.
8. Database and file replication continues between the servers.
9. The Unity Connection Server Role Manager service on the server that now has Primary status posts an alarm that the change of status is complete.