NVIDIA NVOS User Manual for InfiniBand Switches

Health Monitoring Commands

nv show system health


nv show system health

Show system health status. 

Syntax Description

N/A


Default

N/A

History

25.02.2002 

Example

admin@nvos:~$ nv show system health
            operational  applied
----------  -----------  -------
status      Not OK
status-led  off



Health issues
================
    Component  Status information
    ---------  ------------------
    LEAKAGE-1  detected leakage

REST API

GET https://<ip>/nvue_v1/system/health

Related Commands

nv show system health history

Notes


nv show system health history


nv show system health history [file-name]

Show system health history file.

Syntax Description

file-name

Show health history files in the system

History

25.02.2002 

Example

admin@nvos:~$ nv show system health history

admin@nvos:~$ nv show system health history files

health history reports      File path
----------------------       -------------------------
health_history                 /var/log/health_history
health_history.1              /var/log/health_history.1

admin@nvos:~$ nv show system health history files health_history

REST API

GET https://<ip>/nvue_v1/system/health/history

Related Commands

nv show system health

Notes

  • When running the command via the CLI, the file open in “Less”

  • When no file is selected, the default file name that opens is “health_history”

nv show system health component


nv show system health component

Display all system health components with their instances and health state.

Syntax Description

N/A


Default

N/A

History


Updated output and added note

Example

admin@nvos:~$ nv show system health component

Component       Instance   State    Last Unhealthy  Unhealthy Count
--------------  ---------  -------  --------------  ---------------
asic            ASIC1      HEALTHY                  0
                ASIC2      HEALTHY                  0
cpu             ALL        HEALTHY                  0
leakage-sensor  LEAKAGE-1  HEALTHY                  0
                LEAKAGE-2  HEALTHY                  0
software        ALL        HEALTHY                  0
switch          ALL        HEALTHY                  0

REST API

GET https://<ip>/nvue_v1/system/health/component

Related Commands

nv action clear system health component

Notes

Components with multiple hardware units (asic, fan, transceiver, leakage-sensor, psu) show individual instances. Single instance components (cpu, software, switch) show ALL as the instance name.

nv show system health component id


nv show system health component {component-id}

Display system component health for a specific componenet.

Syntax Description

component-id

Component name (e.g., PSU, Fan, Software)

Default

N/A

History

25.02.7002

Example

admin@nvos:~$ nv show system health component Fan

-------------------------   ------------------------ 
state                       HEALTHY
last-unhealthy              2025-01-01 18:00:10
unhealthy-count             1

admin@nvos:~$ $ nv show system health component asic

instances
============
    Instance  State    Last Unhealthy  Unhealthy Count
    --------  -------  --------------  ---------------
    ASIC1     HEALTHY                  0
    ASIC2     HEALTHY                  0

REST API

GET https://<ip>/nvue_v1/system/health/component

Related Commands

nv action clear system health component

Notes


nv show system health component instance

 

nv show system health component <component-id> instance <instance-id>

Display detailed health information for a specific component instance.

Syntax Description

component-id

Component name (e.g., PSU, Fan, Software)

instance-id


Default

N/A

History

 

25.02.8008

Example

admin@nvos:~$ nv show system health component asic instance ASIC1


                 operational
---------------  -----------
state            HEALTHY
last-unhealthy
unhealthy-count  0 

admin@nvos:~$ nv show system health component cpu instance ALL
 
                 operational
---------------  -----------
state            HEALTHY
last-unhealthy
unhealthy-count  0


REST API

GET https://<ip>/nvue_v1/system/health/component/<component-id>/instance/<instance-id>

Related Commands

nv action clear system health component

Notes

 

nv action clear system health component


nv action clear system health component

Clear system components unhealthy information.

Syntax Description

N/A


Default

N/A

History

25.02.7002

Example

admin@nvos:~$ nv action clear system health component

Action executing ...
Cleared unhealthy information successfully
Action succeeded

REST API

POST https://<ip>/nvue_v1/system/health/component

Related Commands


Notes

The clear command resets only the last‑unhealthy and unhealthy‑count fields. The state field is not reset.

Last updated: