Provided by: rocm-smi_6.1.2-1_amd64 bug

NAME

       rocm-smi - rocm-smi - a tool to monitor AMD accelerators and GPUs

DESCRIPTION

       usage:  rocm-smi  [-h]  [-V] [-d DEVICE [DEVICE ...]] [--alldevices] [--showhw] [-a] [-i] [-v] [-e [EVENT
       ...]]

              [--showdriverversion]   [--showtempgraph]    [--showfwinfo    [BLOCK    ...]]    [--showmclkrange]
              [--showmemvendor]    [--showsclkrange]    [--showproductname]    [--showserial]   [--showuniqueid]
              [--showvoltagerange]  [--showbus]  [--showpagesinfo]   [--showpendingpages]   [--showretiredpages]
              [--showunreservablepages]  [-f]  [-P] [-t] [-u] [--showmemuse] [--showvoltage] [-b] [-c] [-g] [-l]
              [-M]  [-m]  [-o]  [-p]  [-S]  [-s]  [--showmeminfo  TYPE  [TYPE   ...]]   [--showpids   [VERBOSE]]
              [--showpidgpus   [SHOWPIDGPUS   ...]]    [--showreplaycount]   [--showrasinfo  [SHOWRASINFO  ...]]
              [--showvc] [--showxgmierr]  [--showtopo]  [--showtopoaccess]  [--showtopoweight]  [--showtopohops]
              [--showtopotype]  [--showtoponuma]  [--showenergycounter] [--shownodesbw] [--showcomputepartition]
              [--showmemorypartition]    [-r]     [--resetfans]     [--resetprofile]     [--resetpoweroverdrive]
              [--resetxgmierr]   [--resetperfdeterminism]   [--resetcomputepartition]   [--resetmemorypartition]
              [--setclock TYPE LEVEL] [--setsclk LEVEL [LEVEL ...]]  [--setmclk LEVEL  [LEVEL  ...]]  [--setpcie
              LEVEL  [LEVEL ...]] [--setslevel SCLKLEVEL SCLK SVOLT] [--setmlevel MCLKLEVEL MCLK MVOLT] [--setvc
              POINT SCLK SVOLT] [--setsrange SCLKMIN SCLKMAX] [--setextremum min|max sclk|mclk CLK] [--setmrange
              MCLKMIN MCLKMAX] [--setfan LEVEL] [--setperflevel LEVEL] [--setoverdrive %] [--setmemoverdrive  %]
              [--setpoweroverdrive     WATTS]     [--setprofile    SETPROFILE]    [--setperfdeterminism    SCLK]
              [--setcomputepartition      {CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}]       [--setmemorypartition
              {NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}]   [--rasenable   BLOCK   ERRTYPE]  [--rasdisable  BLOCK
              ERRTYPE] [--rasinject BLOCK] [--gpureset] [--load FILE |  --save  FILE]  [--autorespond  RESPONSE]
              [--loglevel LEVEL] [--json] [--csv]

       AMD ROCm System Management Interface | ROCM-SMI version: 2.2.0

   options:
       -h, --help
              show this help message and exit

       --gpureset
              Reset specified GPU (One GPU must be specified)

       --load FILE
              Load Clock, Fan, Performance and Profile settings from FILE

       --save FILE
              Save Clock, Fan, Performance and Profile settings to FILE

       -V, --version
              Show version information

       -d DEVICE [DEVICE ...], --device DEVICE [DEVICE ...]
              Execute command on specified device

   Display Options:

       --alldevices

       --showhw
              Show Hardware details

       -a, --showallinfo
              Show Temperature, Fan and Clock values

   Topology:
       -i, --showid
              Show DEVICE IDs

       -v, --showvbios
              Show VBIOS version

       -e [EVENT ...], --showevents [EVENT ...]
              Show event list

       --showdriverversion
              Show kernel driver version

       --showtempgraph
              Show Temperature Graph

       --showfwinfo [BLOCK ...]
              Show FW information

       --showmclkrange
              Show mclk range

       --showmemvendor
              Show GPU memory vendor

       --showsclkrange
              Show sclk range

       --showproductname
              Show product details

       --showserial
              Show GPU's Serial Number

       --showuniqueid
              Show GPU's Unique ID

       --showvoltagerange
              Show voltage range

       --showbus
              Show PCI bus number

   Pages information:
       --showpagesinfo
              Show retired, pending and unreservable pages

       --showpendingpages
              Show pending retired pages

       --showretiredpages
              Show retired pages

       --showunreservablepages
              Show unreservable pages

   Hardware-related information:
       -f, --showfan
              Show current fan speed

       -P, --showpower
              Show current average or instant socket graphics package power consumption

       -t, --showtemp
              Show current temperature

       -u, --showuse
              Show current GPU use

       --showmemuse
              Show current GPU memory used

       --showvoltage
              Show current GPU voltage

   Software-related/controlled information:
       -b, --showbw
              Show estimated PCIe use

       -c, --showclocks
              Show current clock frequencies

       -g, --showgpuclocks
              Show current GPU clock frequencies

       -l, --showprofile
              Show Compute Profile attributes

       -M, --showmaxpower
              Show maximum graphics package power this GPU will consume

       -m, --showmemoverdrive
              Show current GPU Memory Clock OverDrive level

       -o, --showoverdrive
              Show current GPU Clock OverDrive level

       -p, --showperflevel
              Show current DPM Performance Level

       -S, --showclkvolt
              Show supported GPU and Memory Clocks and Voltages

       -s, --showclkfrq
              Show supported GPU and Memory Clock

       --showmeminfo TYPE [TYPE ...]
              Show Memory usage information for given block(s) TYPE

       --showpids [VERBOSE]
              Show current running KFD PIDs (pass details to VERBOSE for detailed information)

       --showpidgpus [SHOWPIDGPUS ...]
              Show GPUs used by specified KFD PIDs (all if no arg given)

       --showreplaycount
              Show PCIe Replay Count

       --showrasinfo [SHOWRASINFO ...]
              Show RAS enablement information and error counts for the specified block(s) (all if no arg given)

       --showvc
              Show voltage curve

       --showxgmierr
              Show XGMI error information since last read

       --showtopo
              Show hardware topology information

       --showtopoaccess
              Shows the link accessibility between GPUs

       --showtopoweight
              Shows the relative weight between GPUs

       --showtopohops
              Shows the number of hops between GPUs

       --showtopotype
              Shows the link type between GPUs

       --showtoponuma
              Shows the numa nodes

       --showenergycounter
              Energy accumulator that stores amount of energy consumed

       --shownodesbw
              Shows the numa nodes

       --showcomputepartition
              Shows current compute partitioning

       --showmemorypartition
              Shows current memory partition

   Set options:
       --setclock TYPE LEVEL
              Set Clock Frequency Level(s) for specified clock (requires manual Perf level)

       --setsclk LEVEL [LEVEL ...]
              Set GPU Clock Frequency Level(s) (requires manual Perf level)

       --setmclk LEVEL [LEVEL ...]
              Set GPU Memory Clock Frequency Level(s) (requires manual Perf level)

       --setpcie LEVEL [LEVEL ...]
              Set PCIE Clock Frequency Level(s) (requires manual Perf level)

       --setslevel SCLKLEVEL SCLK SVOLT
              Change GPU Clock frequency (MHz) and Voltage (mV) for a specific Level

       --setmlevel MCLKLEVEL MCLK MVOLT
              Change GPU Memory clock frequency (MHz) and Voltage for (mV) a specific Level

       --setvc POINT SCLK SVOLT
              Change SCLK Voltage Curve (MHz mV) for a specific point

       --setsrange SCLKMIN SCLKMAX
              Set min and max SCLK speed

       --setextremum min|max sclk|mclk CLK
              Set min/max of SCLK/MCLK speed

       --setmrange MCLKMIN MCLKMAX
              Set min and max MCLK speed

       --setfan LEVEL
              Set GPU Fan Speed (Level or %)

       --setperflevel LEVEL
              Set Performance Level

       --setoverdrive %
              Set GPU OverDrive level (requires manual|high Perf level)

       --setmemoverdrive %
              Set GPU Memory Overclock OverDrive level (requires manual|high Perf level)

       --setpoweroverdrive WATTS
              Set the maximum GPU power using Power OverDrive in Watts

       --setprofile SETPROFILE
              Specify  Power  Profile  level  (#)  or  a quoted string of CUSTOM Profile attributes "# # # #..."
              (requires manual Perf level)

       --setperfdeterminism SCLK
              Set clock frequency limit to get minimal performance variation

       --setcomputepartition {CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}
              Set compute partition

       --setmemorypartition {NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}
              Set memory partition

       --rasenable BLOCK ERRTYPE
              Enable RAS for specified block and error type

       --rasdisable BLOCK ERRTYPE
              Disable RAS for specified block and error type

       --rasinject BLOCK
              Inject RAS poison for specified block (ONLY WORKS ON UNSECURED BOARDS)

   Reset options:
       -r, --resetclocks
              Reset clocks and OverDrive to default

       --resetfans
              Reset fans to automatic (driver) control

       --resetprofile
              Reset Power Profile back to default

       --resetpoweroverdrive
              Set the maximum GPU power back to the device default state

       --resetxgmierr
              Reset XGMI error count

       --resetperfdeterminism
              Disable performance determinism

       --resetcomputepartition
              Resets to boot compute partition state

       --resetmemorypartition
              Resets to boot memory partition state

   Auto-response options:
       --autorespond RESPONSE
              Response to automatically provide for all prompts (NOT RECOMMENDED)

   Output options:
       --loglevel LEVEL
              How   much   output   will    be    printed    for    what    program    is    doing,    one    of
              debug/info/warning/error/critical

       --json Print output in JSON format

       --csv  Print output in CSV format

SEE ALSO

       The full documentation for rocm-smi is maintained as a Texinfo manual.  If the info and rocm-smi programs
       are properly installed at your site, the command

              info rocm-smi

       should give you access to the complete manual.

rocm-smi 2.2.0                                    November 2024                                      ROCM-SMI(1)