Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

header-navigation
keyboard_arrow_up
list Table of Contents
file_download PDF
{ "lLangCode": "en", "lName": "English", "lCountryCode": "us", "transcode": "en_US" }
English
ON THIS PAGE
keyboard_arrow_right

Troubleshoot RabbitMQ Cluster Failure

Release: Paragon Automation 23.2
{}
Change Release
date_range 26-Sep-22

Problem

The RabbitMQ cluster fails as a result of a power outage.

Solution

If the RabbitMQ cluster fails as a result of a power outage, the RabbitMQ message bus may not restart properly. To identify the status of the RabbitMQ message bus, run the kubectl get po -n northstar -l app=rabbitmq command. This command should show three pods with their status as Running. For example:

content_copy zoom_out_map
$ kubectl get po -n northstar -l app=rabbitmq
NAME READY STATUS RESTARTS AGE
rabbitmq-0 1/1 Running 0 10m
rabbitmq-1 1/1 Running 0 10m
rabbitmq-2 1/1 Running 0 9m37s

If the status of one or more pods is Error, use the following recovery procedure:

  1. Delete RabbitMQ.

    kubectl delete po -n northstar -l app=rabbitmq

  2. Check the status of the pods.

    kubectl get po -n northstar -l app=rabbitmq.

    Repeat kubectl delete po -n northstar -l app=rabbitmq until the status of all pods is Running.

  3. Restart the Paragon Pathfinder application.

    content_copy zoom_out_map
    kubectl rollout restart deploy -n northstar
footer-navigation