Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

 
 

thermal-health-check

Syntax

Hierarchy Level

Description

Enable thermal health check, and configure an action to be taken on detection of a thermal health event such as power leakage on PTX5K, MX10K, PTX10K and QFX10K devices. The thermal check feature monitors the PSM power output and FRU power consumption every minute. When the PSM power output exceeds the FRU power consumption by a default threshold for three consecutive iterations, it assumes a thermal health event and takes an action based on user configuration.

Note:

The default threshold for QFX10002 devices is 100 W and for other devices is 600 W.

The default action is set to none. It can be configured to shutdown all PSMs when thermal health check fails. The default shutdown timer is 900 seconds but can be configured. For example, the command provided below enables thermal health check, and shutdowns the system after 10 seconds of thermal health check failure:

You can enable the PSM watchdog to shutdown the system in case a thermal health event causes Junos to go down. Please note that PEM firmware upgrade is required for the thermal health check and PSM watchdog feature.

You can enable the fet-failure-check option to monitor a failing power supply due to a Field-effect Transistor (FET) failure and take corrective action. You can choose to shutdown a reporting PSM if a redundant power supply is available, raise an alarm and log the events when a risk of thermal event is determined.

Note:

The fet-failure-check option is supported on MX10K and PTX10K devices.

Options

fet-failure-check Enable FET failure detection, and configure an action to be taken upon FET failure.
action-onfail Choose an action to be performed on detection of a thermal health event. The following options are available:
  • auto-shutdown—The software shuts down the reporting PSM based on redundancy when a thermal health event is detected.

  • none—The software raises a major alarm when a thermal health event is detected.

Required Privilege Level

interface—To view this statement in the configuration.

interface-control—To add this statement to the configuration.

Release Information

Statement introduced in Junos OS Release 20.1R1.

fet-failure-check option introduced in Junos OS Release 21.2R1.