NetScaler -13.1 56.18 - Non-recoverable : potential damage: system hardware in jeopardy or damaged

NetScaler -13.1 56.18 - Non-recoverable : potential damage: system hardware in jeopardy or damaged

book

Article ID: CTX692891

calendar_today

Updated On:

Description

The Citrix NetScaler SDX 16000 appliance may exhibit the following symptoms:
 
The appliance reports critical voltage errors in the ns.log and system message logs, indicating potential hardware damage. 

Specific Log Entries: The following log entries, or similar variations, are observed: 

Mar  9 19:45:54 <local0.crit> hostname svm_event: xxx.xxx.xxx.xxx : EVENT VOLTAGEERROR : xxx.xxx.xxx.xxx:HealthMonitoring:Vcpu1VCCIN - Vcpu1VCCIN : Non-recoverable : (potential damage: system hardware in jeopardy or damaged)
Mar 10 21:41:29 <local0.crit> hostname svm_event: xxx.xxx.xxx.xxx : EVENT VOLTAGEERROR : xxx.xxx.xxx.xxx:HealthMonitoring:Vcpu1VCCIN - Vcpu1VCCIN : Non-recoverable : (potential damage: system hardware in jeopardy or damaged)
Mar 11 02:35:05 <local0.crit> hostname svm_event:xxx.xxx.xxx.xxx : EVENT VOLTAGEERROR : 169.254.0.1:HealthMonitoring:Vcpu1VCCIN - Vcpu1VCCIN : Non-recoverable : (potential damage: system hardware in jeopardy or damaged)
Mar 11 05:36:12 <local0.crit> hostname svm_event: xxx.xxx.xxx.xxx : EVENT VOLTAGEERROR : 169.254.0.1:HealthMonitoring:Vcpu2VDDQEFGH - Vcpu2VDDQEFGH : Non-recoverable : (potential damage: system hardware in jeopardy or damaged)
 
IPMI Log Information: IPMI tool logs show the following device information:
 
Device ID                 : 32
Device Revision           : 1
Firmware Revision         : 2.13   -------> LOM version
IPMI Version              : 2.0
Manufacturer ID           : 10876
Manufacturer Name         : Supermicro
Product ID                : 6983 (0x1b47)
Product Name              : Unknown (0x1B47)
Device Available          : yes
Provides Device SDRs      : no
Additional Device Support :
    Sensor Device
    SDR Repository Device
    SEL Device
    FRU Inventory Device
    IPMB Event Receiver
    IPMB Event Generator
    Chassis Device
Aux Firmware Rev Info     :
    0x12
    0x01
    0x00
    0x00
 
Stable Voltage Readings: Despite the reported voltage errors, sensor readings from the IPMI tool indicate that the actual voltage supplied to the appliance is within acceptable ranges.
 
Vcpu1VDDQABCD    | 1.192      | Volts      | ok    | 1.024     | 1.048     | 1.096     | 1.344     | 1.368     | 1.400
Vcpu1VDDQABCD    | 1.192      | Volts      | ok    | 1.024     | 1.048     | 1.096     | 1.344     | 1.368     | 1.400
Vcpu1VDDQABCD    | 1.192      | Volts      | ok    | 1.024     | 1.048     | 1.096     | 1.344     | 1.368     | 1.400
Vcpu1VDDQABCD    | 1.200      | Volts      | ok    | 1.024     | 1.048     | 1.096     | 1.344     | 1.368     | 1.400

Resolution

Follow below troubleshooting steps to resolve this:

1. Firmware Upgrade:
The primary solution is to upgrade the LOM firmware to version 3.11. This upgrade has been specifically designed to address the reported issue. Please follow your organization's standard procedures for firmware updates.

2. Power Cycle:
If the firmware upgrade does not resolve the issue, a power cycle may be necessary.
Carefully disconnect the power cable from the affected device.
Wait for a minimum of five (5) minutes.
Reconnect the power cable.

3. Contact Technical Support:
If the issue persists after both the firmware upgrade and power cycle, it indicates a potential hardware failure. Please contact technical support for further diagnosis and to initiate the RMA process.