About Environmental Monitoring
Environmental monitoring of chassis components provides early warning indications of possible component failure. This warning helps you to ensure the safe and reliable operation of your system and avoid network interruptions.
This section describes how to monitor critical system components so that you can identify and rapidly correct hardware-related problems.
Using CLI Commands to Monitor your Environment
Enter the show environment [all | counters | history | location | sensor | status | summary | table] command to display system status information. Keyword descriptions are listed in the following table.
Keyword |
Purpose |
---|---|
all |
Displays a detailed listing of all the environmental monitor parameters (for example, the power supplies, temperature readings, voltage readings, and so on). This is the default. |
counters |
Displays operational counters. |
history |
Displays the sensor state change history. |
location |
Displays sensors by location. |
sensor |
Displays the sensor summary. |
status |
Displays field-replaceable unit (FRU) operational status and power and power supply fan sensor information. |
summary |
Displays the summary of all the environment monitoring sensors. |
table |
Displays a sensor state table. |
Displaying Environment Conditions
Supervisor modules and their associated line cards support multiple temperature sensors per card. The environment condition output includes the temperature reading from each sensor and the temperature thresholds for each sensor. These line cards support three thresholds: warning, critical, and shutdown.
The following example illustrates how to display the environment condition on a supervisor module. The thresholds appear within parentheses.
Device# show environment
Number of Critical alarms: 0
Number of Major alarms: 0
Number of Minor alarms: 0
Slot Sensor Current State Reading Threshold(Minor,Major,Critical,Shutdown)
---------- -------------- --------------- ------------ ---------------------------------------
R0 Temp: InltFrnt Normal 27 Celsius (45 ,50 ,55 ,60 )(Celsius)
R0 Temp: InltRear Normal 28 Celsius (45 ,50 ,55 ,60 )(Celsius)
R0 Temp: OtltFrnt Normal 35 Celsius (75 ,80 ,85 ,90 )(Celsius)
R0 Temp: OtltRear Normal 43 Celsius (75 ,80 ,85 ,90 )(Celsius)
R0 Temp: UADP_0_0 Normal 54 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_1 Normal 53 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_2 Normal 53 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_3 Normal 55 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_4 Normal 54 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_5 Normal 55 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_6 Normal 64 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_7 Normal 59 Celsius (105,110,120,124)(Celsius)
R0 Temp: UADP_0_8 Normal 55 Celsius (105,110,120,124)(Celsius)
<output truncated>
The following example illustrates how to display the LED status on a supervisor module.
Device# show hardware led
SWITCH: 1
SYSTEM: GREEN
Line Card : 1
PORT STATUS: (48) Fo1/0/1:BLACK Fo1/0/2:BLACK Fo1/0/3:BLACK Fo1/0/4:BLACK Fo1/0/5:BLACK Fo1/0/6:BLACK Fo1/0/7:BLACK Fo1/0/8:BLACK Fo1/0/9:BLACK Fo1/0/10:BLACK Fo1/0/11:BLACK Fo1/0/12:BLACK Fo1/0/13:BLACK Fo1/0/14:BLACK Fo1/0/15:BLACK Fo1/0/16:BLACK Fo1/0/17:BLACK Fo1/0/18:BLACK Fo1/0/19:BLACK Fo1/0/20:BLACK Fo1/0/21:GREEN Fo1/0/22:BLACK Fo1/0/23:BLACK Fo1/0/24:BLACK Hu1/0/25:GREEN Hu1/0/26:BLACK Hu1/0/27:BLACK Hu1/0/28:BLACK Hu1/0/29:BLACK Hu1/0/30:BLACK Hu1/0/31:BLACK Hu1/0/32:BLACK Hu1/0/33:BLACK Hu1/0/34:BLACK Hu1/0/35:BLACK Hu1/0/36:BLACK Hu1/0/37:BLACK Hu1/0/38:BLACK Hu1/0/39:BLACK Hu1/0/40:BLACK Hu1/0/41:BLACK Hu1/0/42:BLACK Hu1/0/43:BLACK Hu1/0/44:BLACK Hu1/0/45:BLACK Hu1/0/46:BLACK Hu1/0/47:BLACK Hu1/0/48:BLACK
BEACON: BLACK
STATUS: GREEN
Line Card : 2
PORT STATUS: (48) Fo2/0/1:BLACK Fo2/0/2:GREEN Fo2/0/3:GREEN Fo2/0/4:GREEN Fo2/0/5:GREEN Fo2/0/6:GREEN Fo2/0/7:GREEN Fo2/0/8:GREEN Fo2/0/9:GREEN Fo2/0/10:GREEN Fo2/0/11:GREEN Fo2/0/12:GREEN Fo2/0/13:GREEN Fo2/0/14:GREEN Fo2/0/15:GREEN Fo2/0/16:GREEN Fo2/0/17:GREEN Fo2/0/18:GREEN Fo2/0/19:GREEN Fo2/0/20:GREEN Fo2/0/21:GREEN Fo2/0/22:GREEN Fo2/0/23:GREEN Fo2/0/24:BLACK Hu2/0/25:BLACK Hu2/0/26:BLACK Hu2/0/27:BLACK Hu2/0/28:BLACK Hu2/0/29:BLACK Hu2/0/30:BLACK Hu2/0/31:BLACK Hu2/0/32:BLACK Hu2/0/33:BLACK Hu2/0/34:BLACK Hu2/0/35:BLACK Hu2/0/36:BLACK Hu2/0/37:BLACK Hu2/0/38:BLACK Hu2/0/39:BLACK Hu2/0/40:BLACK Hu2/0/41:BLACK Hu2/0/42:BLACK Hu2/0/43:BLACK Hu2/0/44:BLACK Hu2/0/45:BLACK Hu2/0/46:BLACK Hu2/0/47:BLACK Hu2/0/48:BLACK
BEACON: BLACK
STATUS: GREEN
MODULE: slot 3
SUPERVISOR: ACTIVE
PORT STATUS: (0)
BEACON: BLACK
STATUS: GREEN
SYSTEM: GREEN
ACTIVE: GREEN
MODULE: slot 4
SUPERVISOR: STANDBY
PORT STATUS: (0)
BEACON: BLACK
STATUS: GREEN
SYSTEM: GREEN
ACTIVE: AMBER
Line Card : 5
PORT STATUS: (48) Twe5/0/1:BLACK Twe5/0/2:GREEN Twe5/0/3:GREEN Twe5/0/4:GREEN Twe5/0/5:GREEN Twe5/0/6:GREEN Twe5/0/7:GREEN Twe5/0/8:GREEN Twe5/0/9:GREEN Twe5/0/10:GREEN Twe5/0/11:GREEN Twe5/0/12:GREEN Twe5/0/13:GREEN Twe5/0/14:GREEN Twe5/0/15:GREEN Twe5/0/16:GREEN Twe5/0/17:GREEN Twe5/0/18:GREEN Twe5/0/19:GREEN Twe5/0/20:GREEN Twe5/0/21:GREEN Twe5/0/22:GREEN Twe5/0/23:GREEN Twe5/0/24:GREEN Twe5/0/25:GREEN Twe5/0/26:GREEN Twe5/0/27:GREEN Twe5/0/28:GREEN Twe5/0/29:GREEN Twe5/0/30:GREEN Twe5/0/31:GREEN Twe5/0/32:GREEN Twe5/0/33:GREEN Twe5/0/34:GREEN Twe5/0/35:GREEN Twe5/0/36:GREEN Twe5/0/37:GREEN Twe5/0/38:GREEN Twe5/0/39:GREEN Twe5/0/40:GREEN Twe5/0/41:GREEN Twe5/0/42:GREEN Twe5/0/43:GREEN Twe5/0/44:GREEN Twe5/0/45:GREEN Twe5/0/46:GREEN Twe5/0/47:BLACK Twe5/0/48:BLACK
BEACON: BLACK
STATUS: GREEN
Line Card : 6
PORT STATUS: (48) Twe6/0/1:BLACK Twe6/0/2:GREEN Twe6/0/3:GREEN Twe6/0/4:GREEN Twe6/0/5:GREEN Twe6/0/6:GREEN Twe6/0/7:GREEN Twe6/0/8:GREEN Twe6/0/9:GREEN Twe6/0/10:GREEN Twe6/0/11:GREEN Twe6/0/12:GREEN Twe6/0/13:GREEN Twe6/0/14:GREEN Twe6/0/15:GREEN Twe6/0/16:GREEN Twe6/0/17:GREEN Twe6/0/18:GREEN Twe6/0/19:GREEN Twe6/0/20:GREEN Twe6/0/21:GREEN Twe6/0/22:GREEN Twe6/0/23:GREEN Twe6/0/24:GREEN Twe6/0/25:GREEN Twe6/0/26:GREEN Twe6/0/27:GREEN Twe6/0/28:GREEN Twe6/0/29:GREEN Twe6/0/30:GREEN Twe6/0/31:GREEN Twe6/0/32:GREEN Twe6/0/33:GREEN Twe6/0/34:GREEN Twe6/0/35:GREEN Twe6/0/36:BLACK Twe6/0/37:BLACK Twe6/0/38:BLACK Twe6/0/39:BLACK Twe6/0/40:GREEN Twe6/0/41:GREEN Twe6/0/42:GREEN Twe6/0/43:GREEN Twe6/0/44:GREEN Twe6/0/45:GREEN Twe6/0/46:BLACK Twe6/0/47:BLACK Twe6/0/48:BLACK
BEACON: BLACK
STATUS: GREEN
RJ45 CONSOLE: GREEN
GigabitEthernet0/0 (MGMT): GREEN
TenGigabitEthernet0/1 (SFP MGMT): BLACK
FANTRAY STATUS: GREEN
FANTRAY BEACON: BLACK
Displaying On Board Failure Logging (OBFL) information
The OBFL feature records operating temperatures, hardware uptime, interrupts, and other important events and messages that can assist with diagnosing problems with line cards and supervisor modules installed in a switch. Data is logged to files stored in nonvolatile memory. When the onboard hardware is started up, a first record is made for each area monitored and becomes a base value for subsequent records. The OBFL feature provides a circular updating scheme for collecting continuous records and archiving older (historical) records, ensuring accurate data about the system. Data is recorded in one of two formats: continuous information that displays a snapshot of measurements and samples in a continuous file, and summary information that provides details about the data being collected. The data is displayed using the show logging onboard command. The message “No historical data to display” is seen when historical data is not available.
Device# show logging onboard RP active voltage detail
--------------------------------------------------------------------------------
VOLTAGE SUMMARY INFORMATION
--------------------------------------------------------------------------------
Number of sensors : 33
--------------------------------------------------------------------------------
Sensor ID Normal Range Maximum Sensor Value
--------------------------------------------------------------------------------
CPU_P5V 0 0 - 5 5
CPU_P3V3 1 0 - 5 3
CPU_P2V5_VPP 2 0 - 5 2
CPU_PVCCSCFUSESUS 3 0 - 5 1
CPU_PVCCIN 4 0 - 5 1
CPU_P1V5_PCH 5 0 - 5 1
CPU_PVCCKRHV 6 0 - 5 1
CPU_P1V2_VDDQ 7 0 - 5 1
CPU_P1V05_COMBINED 8 0 - 5 1
CPU_P0V6_VTT 9 0 - 5 1
BB_P1V0_BCM82752 10 0 - 5 3
BB_P3V3_A 11 0 - 5 12
BB_P12V0 12 0 - 12 12
BB_P7V0 13 0 - 7 7
BB_P5V0 14 0 - 5 5
BB_P1V5 15 0 - 5 3
BB_P3V3 16 0 - 5 3
BB_P2V5 17 0 - 5 2
BB_P1V8 18 0 - 5 1
BB_P0V9_DP0_PLL 19 0 - 5 0
BB_P0V9_DP1_PLL 20 0 - 5 0
BB_P0V9_DP2_PLL 21 0 - 5 0
BB_P0V8_DP0_VDD 22 0 - 5 0
BB_P0V8_DP1_VDD 23 0 - 5 0
BB_P0V8_DP2_VDD 24 0 - 5 0
BB_P0V9_DP0_AVDD 25 0 - 5 0
BB_P0V9_DP1_AVDD 26 0 - 5 0
BB_P0V9_DP2_AVDD 27 0 - 5 1
BB_P1V1_HATH 28 0 - 5 1
BB_P1V1_DP0_AVDDH 29 0 - 5 1
BB_P1V2_HATH 30 0 - 5 3
BB_3V3_IRC 31 0 - 5 3
BB_P3V3_EUSB 32 0 - 5 0
--------------------------------------------------------------------------------
Sensor Value
Total Time of each Sensor
--------------------------------------------------------------------------------
value: 0
0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 61d, 94d, 577h, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 61d, 112d, 112d, 112d, 112d, 112d, 112d, 112d, 112d, 50d, 0s, 0s, 0s, 0s, 112d,
value: 1
0s, 0s, 0s, 112d, 112d, 112d, 112d, 112d, 50d, 426h, 645h, 0s, 0s, 0s, 61d, 50d, 0s, 61d, 50d, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 61d, 112d, 112d, 50d, 0s, 0s,
value: 2
0s, 0s, 112d, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 61d, 50d, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s,
value: 3
0s, 112d, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 61d, 50d, 0s, 0s, 0s, 61d, 50d, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 61d, 112d, 0s,
value: 4
900h, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 160d, 43d, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s, 0s,
value: 5
<output truncated>
Emergency Actions
The chassis can power down a single card, providing a detailed response to over-temperature conditions on line cards. However, the chassis cannot safely operate when the temperature of the supervisor module itself exceeds the critical threshold. The supervisor module turns off the chassis’ power supplies to protect itself from overheating. When this happens, you can recover the switch only by cycling the power on and off switches on the power supplies or by cycling the AC or DC inputs to the power supplies.
Shutdown temperature emergencies on a supervisor will trigger chassis shutdown. Shutdown temperature emergencies on a linecard will shut down the linecard but not the chassis. Critical temperature emergencies will trigger a warning message and the fan will be at its highest speed, but the chassis will not shut down. This applies to all slots.
The following table lists temperature emergencies but does not distinguish between critical and shutdown emergencies.
Case 1. Complete fan failure emergency. |
SYSLOG message displays and the chassis shuts down. |
Case 2. Temperature emergency on a line card. |
Power down the line card. |
Case 3. Temperature emergency on a power supply. When the shutdown alarm threshold is exceeded, all the power supplies will shut down. |
Power cycle the device to recover from power supply shut down. |
Case 4. Temperature emergency on the active supervisor module. |
Power down the chassis. |
System Alarms
Any system has two types of alarms: major and minor. A major alarm indicates a critical problem that could lead to system shutdown. A minor alarm is informational—it alerts you to a problem that could become critical if corrective action is not taken.
The following table lists the possible environment alarms.
A temperature sensor over its warning threshold |
minor |
||
A temperature sensor over its critical threshold |
major |
||
A temperature sensor over its shutdown threshold |
major |
||
A partial fan failure |
minor |
||
A complete fan failure
|
major |
Fan failure alarms are issued as soon as the fan failure condition is detected and are canceled when the fan failure condition clears. Temperature alarms are issued as soon as the temperature reaches the threshold temperature. An LED on the supervisor module indicates whether an alarm has been issued.
When the system issues a major alarm, it starts a timer whose duration depends on the alarm. If the alarm is not canceled before the timer expires, the system takes emergency action to protect itself from the effects of overheating. The timer values and the emergency actions depend on the type of supervisor module.
Note |
Refer to the Hardware Installation Guide for information on LEDs, including the startup behavior of the supervisor module system LED. |
Event |
Alarm Type |
Supervisor LED Color |
Description and Action |
---|---|---|---|
Card temperature exceeds the critical threshold. |
Major |
Red |
Syslog message displays when the alarm is issued. |
Card temperature exceeds the shutdown threshold. |
Major |
Red |
Syslog message displays when the alarm is issued. |
Chassis temperature exceeds the warning threshold. |
Minor |
Orange |
Syslog message displays when the alarm is issued. |
Chassis fan tray experiences partial failure. |
Minor |
Orange |
Syslog message displays when the alarm is issued. |
Chassis fan tray experiences complete failure. |
Major |
Red |
Syslog message displays when the alarm is issued. |