Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

header-navigation
keyboard_arrow_up
close
keyboard_arrow_left
QFX5240 Switch Hardware Guide
Table of Contents Expand all
list Table of Contents
file_download PDF
{ "lLangCode": "en", "lName": "English", "lCountryCode": "us", "transcode": "en_US" }
English
keyboard_arrow_right

Troubleshoot the QFX5240 Switch

date_range 29-Mar-24

QFX5240 Troubleshooting Resources Overview

To troubleshoot a QFX5240 problem, you can use:

  • Junos Evolved OS CLI

    The CLI is the primary tool for controlling and troubleshooting hardware, Junos OS Evolved, routing protocols, and network connectivity. CLI commands display information from routing tables, information specific to routing protocols, and information about network connectivity derived from the ping and traceroute utilities. For information about using the CLI to troubleshoot Junos OS Evolved, see the appropriate Junos OS Evolved configuration guide.

  • Alarms and LEDs on the network ports, management panel, and components

    When the Routing Engine detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. In addition, you can also use component LEDs and network port LEDs to troubleshoot the QFX5240 switch. For more information, see QFX5240 Management Panel.

  • JTAC

    If you need assistance during troubleshooting, you can contact the Juniper Networks Technical Assistance Center (JTAC) by using the Web or by telephone. If you encounter software problems, or problems with hardware components not discussed here, contact JTAC.

  • Knowledge Base articles–Knowledge Base.

QFX5240 Alarm Messages Overview

When a QFX5240 switch detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. To view a more detailed description of the alarm cause, issue the show system alarms operational CLI command.

content_copy zoom_out_map
user@host> show system alarms
2 alarms currently active
Alarm time               Class  Description
2019-01-22 16:32:54 PST  Major  PEM 1 Absent
2019-01-22 16:31:04 PST  Minor  Host 0 Disk 2 Labelled incorrectly

For thermal problems, the show chassis temperature-thresholds CLI command shows the cutoff temperatures for each level of alarm:

content_copy zoom_out_map
user@host> show chassis temperature-thresholds
                                        Fan speed      Yellow alarm      Red alarm      Fire Shutdown
                                       (degrees C)      (degrees C)     (degrees C)      (degrees C)
Item                                  Normal  High   Normal  Bad fan   Normal  Bad fan     Normal
Routing Engine 0 CPU Temperature          65    70       95       92       97       94        100
Routing Engine 0 Ch-0 DIMM-0 Temp         55    62       80       77       82       79         85
Routing Engine 0 Ch-1 DIMM-0 Temp         55    62       80       77       82       79         85
FPC 0 Sensor TH5 Max Reading              75    80      100       97      105      102        110
FPC 0 et-0/0/0                            53    60       73       73       76       76         77
FPC 0 et-0/0/4                            53    60       73       73       76       76         77
FPC 0 MB Middle Right Rear                45    50       80       77       85       82         90
FPC 0 MB Middle Left Rear                 45    50       80       77       85       82         90
FPC 0 MB Left Rear                        45    50       80       77       85       82         90
FPC 0 MB Left Front                       45    50       80       77       85       82         90
FPC 0 MB Right Rear                       45    50       80       77       85       82         90
FPC 0 MB Right Front                      45    50       80       77       85       82         90
FPC 0 MB OSFP_GRP1_3V3                    75    80      115      112      120      117        125
FPC 0 MB OSFP_GRP2_3V3                    75    80      115      112      120      117        125
FPC 0 MB OSFP_GRP3_3V3                    75    80      115      112      120      117        125
FPC 0 MB VDD_0P75                         75    80      115      112      120      117        125
FPC 0 MB TRVDD1_0V9_0V75                  75    80      115      112      120      117        125
FPC 0 MB TRVDD0_0V9_0V75                  75    80      115      112      120      117        125
FPC 0 FB Exhaust Left                     40    45       65       62       70       67         75
FPC 0 FB Exhaust Right                    40    45       65       62       70       67         75

You can also calculate the percentage of fan RPM, or duty cycle. Use the following command as root:

content_copy zoom_out_map
root@re0.~#i2cget -y -f 13 0x66 0x11
0x07

In this example, the system returned the hexadecimal value 0x07. If you convert that value to decimal, you get 7. Then use this formula to calculate the duty cycle:

content_copy zoom_out_map
Duty cycle = (value returned + 1)*6.25%

In this example, duty cycle = (7 +1)*6.25 = 50%

Chassis Alarm Messages

Chassis alarms indicate a failure on the device or one of its components. Chassis alarms are preset and cannot be modified.

Chassis alarms on QFX5240 switches have two severity levels:

  • Major (red)—Indicates a critical situation on the device that has resulted from one of the conditions described in Table 1. A red alarm condition requires immediate action.

  • Minor (yellow)—Indicates a noncritical condition on the device that, if left unchecked, might cause an interruption in service or degradation in performance. A yellow alarm condition requires monitoring or maintenance.

Table 1 describes the chassis alarm messages on a QFX5240.

Junos OS Evolved systems, such as QFX5240, are based on a new alarm infrastructure, which does not support all power supplies and fan alarms. Table 1 shows these alarms.

Table 1: Chassis Alarm Messages for QFX5240

Component

Alarm Type

CLI Message

Recommended Action

Fans

Red (major)

Fan Tray fan-tray-number Absent

Install fan modules in the slots where they are absent.

Fan Tray fan-tray-number Failure

Remove and check the fan module for obstructions. Reinsert the fan module. If the problem persists, replace the fan module.

sensor-location Temp Sensor Too Hot

Check the environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down.

Yellow (minor)

FAN fan-number Fan Sensor Fail

Remove and check the fan module for obstructions. Reinsert the fan module. If the problem persists, check the system log for the message related to the sensor and report the message to customer service.

sensor-location Temp Sensor Too Warm

Check the environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor.

Power Supplies

Red (major)

PEM pem-number Not Powered

Install a power supply into the empty slot and ensure the power supply is powered.

Temperature sensors

Major (red)

FPC 0 Temperature Hot

Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down.

Minor (yellow)

FPC 0 Temperature Warm

Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor.

FPC 0 Temp Sensor Fail

Check the system log for the following error message and report the message to customer support:

Routing Engine

Major (red)

RE RE number /var partition is full

File storage is at capacity. Reduce unnecessary files to free space.

Minor (yellow)

RE RE number /var partition is high

File storage is reaching capacity. Reduce unnecessary files to free space.

Management Ethernet interface

Major (red)

Management interface management-interface-name down on node

Check whether a cable is connected to the management Ethernet interface, or whether the cable is defective. Replace the cable, if required.

footer-navigation