Troubleshooting a Fabric Link Failure in an SRX Chassis Cluster
Problem
Description
The fabric link fails to come up in an SRX chassis cluster.
Environment
SRX chassis cluster
Symptoms
The status of the fabric link is displayed
as down
in the output of the show
chassis cluster interfaces
command. Here are sample outputs
for an SRX branch device and a high-end SRX device.
{primary:node0} root@SRX_Branch> show chassis cluster interfaces Control link 0 name: fxp1 Control link status: Up Fabric interfaces: Name Child-interface Status fab0 ge-0/0/2 down fab0 fab1 ge-9/0/2 down fab1 Fabric link status: down
{primary:node0} root@SRX_HighEnd> show chassis cluster interfaces Control link 0 name: em0 Control link 1 name: em1 Control link status: up Fabric interfaces: Name Child-interface Status fab0 ge-0/0/5 down fab0 Fabric link status: down
Diagnosis
Are the fabric link ports connected through a switch?
Yes: Remove the switch and connect the fabric link ports directly. Reboot the secondary node and check whether the fabric link is up.
If the link is up, then there might be an issue in the chassis cluster setup on the Layer 2 switch network. See SRX Series Gateway Cluster Deployment in Layer 2 Network.
If the link is down, proceed to Step 2.
No: Proceed to Step 2.
-
Are the link LEDs for the fabric link ports on both the nodes lit green?
-
Yes: The physical link is up, but the fabric packets are not being processed. To eliminate possible issues with the port:
-
Reconfigure the fabric link on a different port, connect the cable to the new port, and reboot the secondary node.
-
Check whether the fabric link status is up:
-
If the link is up, the issue is resolved.
There might be a hardware issue with the onboard ports or interface module ports on which you had previously configured the fabric link. Verify the interface statistics by using the
show interfaces interface-name
command. Open a case with your technical support representative to resolve the issue with the ports. Proceed to Data Collection for Customer Support. -
If the link is still down, open a case with your technical support representative. Proceed to Data Collection for Customer Support.
-
-
-
No: The fabric link cable might be faulty. Proceed to Step 3.
-
-
Change the cable connecting the fabric link ports and check the link LED. Is the LED lit green?
-
Yes: This indicates that the original cable was faulty. Reboot both the nodes simultaneously to come out of the bad state. If the fabric link does not come up after the reboot:
-
Reconfigure the fabric link on a different port, connect the cable to the new port, and reboot the secondary node.
-
Check whether the fabric link status is up:
-
If the link is up, the issue is resolved.
There might be a hardware issue with the onboard ports or interface module ports on which you had previously configured the fabric link. Verify the interface statistics by using the
show interfaces interface-name
command. Open a case with your technical support representative to resolve the issue with the ports. Proceed to Data Collection for Customer Support. -
If the link is still down, open a case with your technical support representative. Proceed to Data Collection for Customer Support.
-
-
-
No: The transceivers might be faulty. Proceed to step 4.
-
-
If the fabric link port is an SFP or XFP port, change the transceivers on both the nodes. Ensure that you use transceivers provided by Juniper Networks and that the transceivers are of the same type (such as LX or SX). Is the fabric link up now?
-
Yes: The issue is resolved.
The original transceivers used on the fabric link ports might be faulty. Open a case with your technical support representative to resolve the issue with the transceivers. Proceed to Data Collection for Customer Support.
-
No: Continue to troubleshoot this issue with your technical support representative. Proceed to Data Collection for Customer Support.
-