- play_arrow Overview
- play_arrow MPC-specific
- play_arrow Fabric Management
- play_arrow Power Management
- play_arrow Environment Monitoring
- play_arrow Network Services Mode
- play_arrow Packet Scheduling Mode
- play_arrow Configuration Statements and Operational Commands
Platform Resiliency
This section covers generic information about the platform resiliency feature.
Resiliency represents the system's ability to anticipate, withstand, and rapidly recover from disruptions while maintaining critical functionality. This capability monitors the health status of various device components and handles faults by taking necessary actions.
Resiliency is a comprehensive solution applied at all levels of the system.
Platform resiliency is supported for multiple hardware components such as:
- CPU
- BIOS
- Memory
Storage
- USB
Temperature Sensors
- Management Ethernet
- FPGA
- Optics
- Fan/Fan Tray
- Power Supply Module
I2C Access
When a hardware failure occurs, the software performs the following actions:
Logs the message to give clear indication of failure details, including but not limited to time stamp, module name, component name and failure details.
Checks if system correlation is required for this fault.
Raises/clears alarms, if applicable.
Sends an SNMP trap, if applicable.
Glows the FRU fault if LED is present
Performs local action such as self-healing or taking the component out of service.
Platform-Specific Resiliency Support
Use Feature Explorer to know if your platform supports this feature.
Use the following table to review platform-specific behaviors for your platforms.
Platform | Support |
---|---|
EX4000, EX4100 | Platform resiliency is supported for Fan/Fan Tray, PEM/PSU, Temperature Sensor, FPGA, PFE, uBoot, Management Ethernet, Storage-eUSB/eMMC, I2C access, CPU. |
SRX4700 | Platform resiliency is supported for CPU, Memory, Storage, PCIe, Temperature sensor, voltage sensor, Fan/Fan Tray, PSU, Fabric links, Control Ethernet, BIOS, USB, FPGA/CPLD, Optics. |
EX4400 | Platform resiliency is supported for Storage, Temperature Sensor, PEM/PSM, BIOS, Fan/Fan Tray, I2C. |
PTX10001-36MR | Platform Resiliency is supported for BIOS, CPU, Temperature sensors, voltage sensors, memory, PCIe, Storage, Management Ethernet, FPGA. |
PTX10003, PTX10004, PTX10008, PTX100016 | Platform resiliency is supported for BIOS, Storage, CPU, Temperature Sensor, Voltage sensor, SIB faults, PSM, Fan/Fan Tray, I2C Access, PCIe, Linecard, FPGA/CPLD. |
PTX10002-36QDD | Platform resiliency is supported for FPGA/CPLD, PCIe, PSU, FAN, PTP FPGA, BITS (Timing), Optics, Fan/Fan Tray, PSU, Power Distribution Board (PDU), Port LED board. |
MX304 | Platform resiliency is supported for Fan/Fan Tray, PEM, Timing Board, FPGA, Fabric links, Optics, CPU, BIOS, Memory, Storage, Control Ethernet, PCIe. |
ACX Series | Platform resiliency is supported for BIOS, CPU, Memory, PSM, Storage, Fan/Fan Tray, FPGA/CPLD, PTP FPGA, BITS (Timing), USB, Management Ethernet, Optics. |
QFX5240-64OD/QD , QFX5241-64OD/QD and QFX5241E-64OD, QFX5230-64CD | Platform resiliency is supported for CPU, BIOS, Memory, USB port, Management Ethernet Ports, FPGA board, Optics panel, Fan/Fan tray, PSM. |
MX Series | Platform resiliency is supported for CPU, BIOS, PCIe, CPLD, Storage, I2C Access, Temperature Sensors, Optics, Fan/Fan Tray, PEM, Ethernet Links, PCH Interfaces, FPGA, USB, Clocking. |