1
0
Fork 0

Adding upstream version 1.34.4.

Signed-off-by: Daniel Baumann <daniel@debian.org>
This commit is contained in:
Daniel Baumann 2025-05-24 07:26:29 +02:00
parent e393c3af3f
commit 4978089aab
Signed by: daniel
GPG key ID: FBB4F0E80A80222F
4963 changed files with 677545 additions and 0 deletions

View file

@ -0,0 +1,110 @@
# AMD ROCm System Management Interface (SMI) Input Plugin
This plugin gathers statistics including memory and GPU usage, temperatures
etc from [AMD ROCm platform][amd_rocm] GPUs.
> [!IMPORTANT]
> The [`rocm-smi` binary][binary] is required and needs to be installed on the
> system.
⭐ Telegraf v1.20.0
🏷️ hardware, system
💻 all
[amd_rocm]: https://rocm.docs.amd.com/
[binary]: https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/master/python_smi_tools
## Global configuration options <!-- @/docs/includes/plugin_config.md -->
In addition to the plugin-specific configuration settings, plugins support
additional global and plugin configuration settings. These settings are used to
modify metrics, tags, and field or create aliases and configure ordering, etc.
See the [CONFIGURATION.md][CONFIGURATION.md] for more details.
[CONFIGURATION.md]: ../../../docs/CONFIGURATION.md#plugins
## Startup error behavior options
In addition to the plugin-specific and global configuration settings the plugin
supports options for specifying the behavior when experiencing startup errors
using the `startup_error_behavior` setting. Available values are:
- `error`: Telegraf with stop and exit in case of startup errors. This is the
default behavior.
- `ignore`: Telegraf will ignore startup errors for this plugin and disables it
but continues processing for all other plugins.
- `retry`: NOT AVAILABLE
## Configuration
```toml @sample.conf
# Query statistics from AMD Graphics cards using rocm-smi binary
[[inputs.amd_rocm_smi]]
## Optional: path to rocm-smi binary, defaults to $PATH via exec.LookPath
# bin_path = "/opt/rocm/bin/rocm-smi"
## Optional: timeout for GPU polling
# timeout = "5s"
```
## Metrics
- measurement: `amd_rocm_smi`
- tags
- `name` (entry name assigned by rocm-smi executable)
- `gpu_id` (id of the GPU according to rocm-smi)
- `gpu_unique_id` (unique id of the GPU)
- fields
- `driver_version` (integer)
- `fan_speed` (integer)
- `memory_total` (integer, B)
- `memory_used` (integer, B)
- `memory_free` (integer, B)
- `temperature_sensor_edge` (float, Celsius)
- `temperature_sensor_junction` (float, Celsius)
- `temperature_sensor_memory` (float, Celsius)
- `utilization_gpu` (integer, percentage)
- `utilization_memory` (integer, percentage)
- `clocks_current_sm` (integer, Mhz)
- `clocks_current_memory` (integer, Mhz)
- `clocks_current_display` (integer, Mhz)
- `clocks_current_fabric` (integer, Mhz)
- `clocks_current_system` (integer, Mhz)
- `power_draw` (float, Watt)
- `card_series` (string)
- `card_model` (string)
- `card_vendor` (string)
## Troubleshooting
Check the full output by running `rocm-smi` binary manually.
Linux:
```sh
rocm-smi rocm-smi -o -l -m -M -g -c -t -u -i -f -p -P -s -S -v --showreplaycount --showpids --showdriverversion --showmemvendor --showfwinfo --showproductname --showserial --showuniqueid --showbus --showpendingpages --showpagesinfo --showretiredpages --showunreservablepages --showmemuse --showvoltage --showtopo --showtopoweight --showtopohops --showtopotype --showtoponuma --showmeminfo all --json
```
Please include the output of this command if opening a GitHub issue, together
with ROCm version.
## Example Output
```text
amd_rocm_smi,gpu_id=0x6861,gpu_unique_id=0x2150e7d042a1124,host=ali47xl,name=card0 clocks_current_memory=167i,clocks_current_sm=852i,driver_version=51114i,fan_speed=14i,memory_free=17145282560i,memory_total=17163091968i,memory_used=17809408i,power_draw=7,temperature_sensor_edge=28,temperature_sensor_junction=29,temperature_sensor_memory=92,utilization_gpu=0i 1630572551000000000
amd_rocm_smi,gpu_id=0x6861,gpu_unique_id=0x2150e7d042a1124,host=ali47xl,name=card0 clocks_current_memory=167i,clocks_current_sm=852i,driver_version=51114i,fan_speed=14i,memory_free=17145282560i,memory_total=17163091968i,memory_used=17809408i,power_draw=7,temperature_sensor_edge=29,temperature_sensor_junction=30,temperature_sensor_memory=91,utilization_gpu=0i 1630572701000000000
amd_rocm_smi,gpu_id=0x6861,gpu_unique_id=0x2150e7d042a1124,host=ali47xl,name=card0 clocks_current_memory=167i,clocks_current_sm=852i,driver_version=51114i,fan_speed=14i,memory_free=17145282560i,memory_total=17163091968i,memory_used=17809408i,power_draw=7,temperature_sensor_edge=29,temperature_sensor_junction=29,temperature_sensor_memory=92,utilization_gpu=0i 1630572749000000000
```
## Limitations and notices
Please notice that this plugin has been developed and tested on a limited number
of versions and small set of GPUs. Currently the latest ROCm version tested is
4.3.0. Notice that depending on the device and driver versions the amount of
information provided by `rocm-smi` can vary so that some fields would start/stop
appearing in the metrics upon updates. The `rocm-smi` JSON output is not
perfectly homogeneous and is possibly changing in the future, hence parsing and
unmarshalling can start failing upon updating ROCm.
Inspired by the current state of the art of the `nvidia-smi` plugin.

View file

@ -0,0 +1,305 @@
//go:generate ../../../tools/readme_config_includer/generator
package amd_rocm_smi
import (
_ "embed"
"encoding/json"
"fmt"
"os"
"os/exec"
"strconv"
"strings"
"time"
"github.com/influxdata/telegraf"
"github.com/influxdata/telegraf/config"
"github.com/influxdata/telegraf/internal"
"github.com/influxdata/telegraf/plugins/inputs"
)
//go:embed sample.conf
var sampleConfig string
const measurement = "amd_rocm_smi"
type ROCmSMI struct {
BinPath string `toml:"bin_path"`
Timeout config.Duration `toml:"timeout"`
Log telegraf.Logger `toml:"-"`
}
type gpu struct {
DeviceID string `json:"Device ID"`
GpuID string `json:"GPU ID"`
GpuUniqueID string `json:"Unique ID"`
GpuVBIOSVersion string `json:"VBIOS version"`
GpuTemperatureSensorEdge string `json:"Temperature (Sensor edge) (C)"`
GpuTemperatureSensorJunction string `json:"Temperature (Sensor junction) (C)"`
GpuTemperatureSensorMemory string `json:"Temperature (Sensor memory) (C)"`
GpuDcefClkClockSpeed string `json:"dcefclk clock speed:"`
GpuDcefClkClockLevel string `json:"dcefclk clock level:"`
GpuFclkClockSpeed string `json:"fclk clock speed:"`
GpuFclkClockLevel string `json:"fclk clock level:"`
GpuMclkClockSpeed string `json:"mclk clock speed:"`
GpuMclkClockLevel string `json:"mclk clock level:"`
GpuSclkClockSpeed string `json:"sclk clock speed:"`
GpuSclkClockLevel string `json:"sclk clock level:"`
GpuSocclkClockSpeed string `json:"socclk clock speed:"`
GpuSocclkClockLevel string `json:"socclk clock level:"`
GpuPcieClock string `json:"pcie clock level"`
GpuFanSpeedLevel string `json:"Fan speed (level)"`
GpuFanSpeedPercentage string `json:"Fan speed (%)"`
GpuFanRPM string `json:"Fan RPM"`
GpuPerformanceLevel string `json:"Performance Level"`
GpuOverdrive string `json:"GPU OverDrive value (%)"`
GpuMaxPower string `json:"Max Graphics Package Power (W)"`
GpuAveragePower string `json:"Average Graphics Package Power (W)"`
GpuUsePercentage string `json:"GPU use (%)"`
GpuMemoryAllocatedPercentage string `json:"GPU Memory Allocated (VRAM%)"`
GpuMemoryUsePercentage string `json:"GPU memory use (%)"`
GpuMemoryVendor string `json:"GPU memory vendor"`
GpuPCIeReplay string `json:"PCIe Replay Count"`
GpuSerialNumber string `json:"Serial Number"`
GpuVoltagemV string `json:"Voltage (mV)"`
GpuPCIBus string `json:"PCI Bus"`
GpuASDDirmware string `json:"ASD firmware version"`
GpuCEFirmware string `json:"CE firmware version"`
GpuDMCUFirmware string `json:"DMCU firmware version"`
GpuMCFirmware string `json:"MC firmware version"`
GpuMEFirmware string `json:"ME firmware version"`
GpuMECFirmware string `json:"MEC firmware version"`
GpuMEC2Firmware string `json:"MEC2 firmware version"`
GpuPFPFirmware string `json:"PFP firmware version"`
GpuRLCFirmware string `json:"RLC firmware version"`
GpuRLCSRLC string `json:"RLC SRLC firmware version"`
GpuRLCSRLG string `json:"RLC SRLG firmware version"`
GpuRLCSRLS string `json:"RLC SRLS firmware version"`
GpuSDMAFirmware string `json:"SDMA firmware version"`
GpuSDMA2Firmware string `json:"SDMA2 firmware version"`
GpuSMCFirmware string `json:"SMC firmware version"`
GpuSOSFirmware string `json:"SOS firmware version"`
GpuTARAS string `json:"TA RAS firmware version"`
GpuTAXGMI string `json:"TA XGMI firmware version"`
GpuUVDFirmware string `json:"UVD firmware version"`
GpuVCEFirmware string `json:"VCE firmware version"`
GpuVCNFirmware string `json:"VCN firmware version"`
GpuCardSeries string `json:"Card series"`
GpuCardModel string `json:"Card model"`
GpuCardVendor string `json:"Card vendor"`
GpuCardSKU string `json:"Card SKU"`
GpuNUMANode string `json:"(Topology) Numa Node"`
GpuNUMAAffinity string `json:"(Topology) Numa Affinity"`
GpuVisVRAMTotalMemory string `json:"VIS_VRAM Total Memory (B)"`
GpuVisVRAMTotalUsedMemory string `json:"VIS_VRAM Total Used Memory (B)"`
GpuVRAMTotalMemory string `json:"VRAM Total Memory (B)"`
GpuVRAMTotalUsedMemory string `json:"VRAM Total Used Memory (B)"`
GpuGTTTotalMemory string `json:"GTT Total Memory (B)"`
GpuGTTTotalUsedMemory string `json:"GTT Total Used Memory (B)"`
}
type sysInfo struct {
DriverVersion string `json:"Driver version"`
}
type metric struct {
tags map[string]string
fields map[string]interface{}
}
func (*ROCmSMI) SampleConfig() string {
return sampleConfig
}
func (rsmi *ROCmSMI) Start(telegraf.Accumulator) error {
if _, err := os.Stat(rsmi.BinPath); os.IsNotExist(err) {
binPath, err := exec.LookPath("rocm-smi")
if err != nil {
return &internal.StartupError{Err: err}
}
rsmi.BinPath = binPath
}
return nil
}
func (rsmi *ROCmSMI) Gather(acc telegraf.Accumulator) error {
data, err := rsmi.pollROCmSMI()
if err != nil {
return fmt.Errorf("failed to execute command in pollROCmSMI: %w", err)
}
return gatherROCmSMI(data, acc)
}
func (*ROCmSMI) Stop() {}
func (rsmi *ROCmSMI) pollROCmSMI() ([]byte, error) {
// Construct and execute metrics query, there currently exist (ROCm v4.3.x) a "-a" option
// that does not provide all the information, so each needed parameter is set manually
cmd := exec.Command(rsmi.BinPath,
"-o",
"-l",
"-m",
"-M",
"-g",
"-c",
"-t",
"-u",
"-i",
"-f",
"-p",
"-P",
"-s",
"-S",
"-v",
"--showreplaycount",
"--showpids",
"--showdriverversion",
"--showmemvendor",
"--showfwinfo",
"--showproductname",
"--showserial",
"--showuniqueid",
"--showbus",
"--showpendingpages",
"--showpagesinfo",
"--showmeminfo",
"all",
"--showretiredpages",
"--showunreservablepages",
"--showmemuse",
"--showvoltage",
"--showtopo",
"--showtopoweight",
"--showtopohops",
"--showtopotype",
"--showtoponuma",
"--json")
return internal.StdOutputTimeout(cmd, time.Duration(rsmi.Timeout))
}
func genTagsFields(gpus map[string]gpu, system map[string]sysInfo) []metric {
metrics := make([]metric, 0, len(gpus))
for cardID := range gpus {
if strings.Contains(cardID, "card") {
tags := map[string]string{
"name": cardID,
}
payload := gpus[cardID]
//nolint:errcheck // silently treat as zero if malformed
totVRAM, _ := strconv.ParseInt(payload.GpuVRAMTotalMemory, 10, 64)
//nolint:errcheck // silently treat as zero if malformed
usdVRAM, _ := strconv.ParseInt(payload.GpuVRAMTotalUsedMemory, 10, 64)
strFree := strconv.FormatInt(totVRAM-usdVRAM, 10)
// Try using value found in Device ID first. If not found, try GPU
// ID for backwards compatibility.
setTagIfUsed(tags, "gpu_id", payload.DeviceID)
setTagIfUsed(tags, "gpu_id", payload.GpuID)
setTagIfUsed(tags, "gpu_unique_id", payload.GpuUniqueID)
fields := make(map[string]interface{}, 20)
setIfUsed("int", fields, "driver_version", strings.ReplaceAll(system["system"].DriverVersion, ".", ""))
setIfUsed("int", fields, "fan_speed", payload.GpuFanSpeedPercentage)
setIfUsed("int64", fields, "memory_total", payload.GpuVRAMTotalMemory)
setIfUsed("int64", fields, "memory_used", payload.GpuVRAMTotalUsedMemory)
setIfUsed("int64", fields, "memory_free", strFree)
setIfUsed("float", fields, "temperature_sensor_edge", payload.GpuTemperatureSensorEdge)
setIfUsed("float", fields, "temperature_sensor_junction", payload.GpuTemperatureSensorJunction)
setIfUsed("float", fields, "temperature_sensor_memory", payload.GpuTemperatureSensorMemory)
setIfUsed("int", fields, "utilization_gpu", payload.GpuUsePercentage)
// Try using allocated percentage first.
setIfUsed("int", fields, "utilization_memory", payload.GpuMemoryAllocatedPercentage)
setIfUsed("int", fields, "utilization_memory", payload.GpuMemoryUsePercentage)
setIfUsed("int", fields, "clocks_current_sm", strings.Trim(payload.GpuSclkClockSpeed, "(Mhz)"))
setIfUsed("int", fields, "clocks_current_memory", strings.Trim(payload.GpuMclkClockSpeed, "(Mhz)"))
setIfUsed("int", fields, "clocks_current_display", strings.Trim(payload.GpuDcefClkClockSpeed, "(Mhz)"))
setIfUsed("int", fields, "clocks_current_fabric", strings.Trim(payload.GpuFclkClockSpeed, "(Mhz)"))
setIfUsed("int", fields, "clocks_current_system", strings.Trim(payload.GpuSocclkClockSpeed, "(Mhz)"))
setIfUsed("float", fields, "power_draw", payload.GpuAveragePower)
setIfUsed("str", fields, "card_series", payload.GpuCardSeries)
setIfUsed("str", fields, "card_model", payload.GpuCardModel)
setIfUsed("str", fields, "card_vendor", payload.GpuCardVendor)
metrics = append(metrics, metric{tags, fields})
}
}
return metrics
}
func gatherROCmSMI(ret []byte, acc telegraf.Accumulator) error {
var gpus map[string]gpu
var sys map[string]sysInfo
err1 := json.Unmarshal(ret, &gpus)
if err1 != nil {
return err1
}
err2 := json.Unmarshal(ret, &sys)
if err2 != nil {
return err2
}
metrics := genTagsFields(gpus, sys)
for _, metric := range metrics {
acc.AddFields(measurement, metric.fields, metric.tags)
}
return nil
}
func setTagIfUsed(m map[string]string, k, v string) {
if v != "" {
m[k] = v
}
}
func setIfUsed(t string, m map[string]interface{}, k, v string) {
vals := strings.Fields(v)
if len(vals) < 1 {
return
}
val := vals[0]
switch t {
case "float":
if val != "" {
f, err := strconv.ParseFloat(val, 64)
if err == nil {
m[k] = f
}
}
case "int":
if val != "" {
i, err := strconv.Atoi(val)
if err == nil {
m[k] = i
}
}
case "int64":
if val != "" {
i, err := strconv.ParseInt(val, 10, 64)
if err == nil {
m[k] = i
}
}
case "str":
if val != "" {
m[k] = val
}
}
}
func init() {
inputs.Add("amd_rocm_smi", func() telegraf.Input {
return &ROCmSMI{
BinPath: "/opt/rocm/bin/rocm-smi",
Timeout: config.Duration(5 * time.Second),
}
})
}

View file

@ -0,0 +1,610 @@
package amd_rocm_smi
import (
"os"
"path/filepath"
"testing"
"time"
"github.com/stretchr/testify/require"
"github.com/influxdata/telegraf"
"github.com/influxdata/telegraf/internal"
"github.com/influxdata/telegraf/models"
"github.com/influxdata/telegraf/testutil"
)
func TestErrorBehaviorDefault(t *testing.T) {
// make sure we can't find rocm-smi in $PATH somewhere
os.Unsetenv("PATH")
plugin := &ROCmSMI{
BinPath: "/random/non-existent/path",
Log: &testutil.Logger{},
}
model := models.NewRunningInput(plugin, &models.InputConfig{
Name: "amd_rocm_smi",
})
require.NoError(t, model.Init())
var acc testutil.Accumulator
var ferr *internal.FatalError
require.NotErrorAs(t, model.Start(&acc), &ferr)
require.ErrorIs(t, model.Gather(&acc), internal.ErrNotConnected)
}
func TestErrorBehaviorError(t *testing.T) {
// make sure we can't find rocm-smi in $PATH somewhere
os.Unsetenv("PATH")
plugin := &ROCmSMI{
BinPath: "/random/non-existent/path",
Log: &testutil.Logger{},
}
model := models.NewRunningInput(plugin, &models.InputConfig{
Name: "amd_rocm_smi",
StartupErrorBehavior: "error",
})
require.NoError(t, model.Init())
var acc testutil.Accumulator
var ferr *internal.FatalError
require.NotErrorAs(t, model.Start(&acc), &ferr)
require.ErrorIs(t, model.Gather(&acc), internal.ErrNotConnected)
}
func TestErrorBehaviorRetry(t *testing.T) {
// make sure we can't find nvidia-smi in $PATH somewhere
os.Unsetenv("PATH")
plugin := &ROCmSMI{
BinPath: "/random/non-existent/path",
Log: &testutil.Logger{},
}
model := models.NewRunningInput(plugin, &models.InputConfig{
Name: "amd_rocm_smi",
StartupErrorBehavior: "retry",
})
require.NoError(t, model.Init())
var acc testutil.Accumulator
var ferr *internal.FatalError
require.NotErrorAs(t, model.Start(&acc), &ferr)
require.ErrorIs(t, model.Gather(&acc), internal.ErrNotConnected)
}
func TestErrorBehaviorIgnore(t *testing.T) {
// make sure we can't find nvidia-smi in $PATH somewhere
os.Unsetenv("PATH")
plugin := &ROCmSMI{
BinPath: "/random/non-existent/path",
Log: &testutil.Logger{},
}
model := models.NewRunningInput(plugin, &models.InputConfig{
Name: "amd_rocm_smi",
StartupErrorBehavior: "ignore",
})
require.NoError(t, model.Init())
var acc testutil.Accumulator
var ferr *internal.FatalError
require.ErrorAs(t, model.Start(&acc), &ferr)
require.ErrorIs(t, model.Gather(&acc), internal.ErrNotConnected)
}
func TestGatherValidJSON(t *testing.T) {
tests := []struct {
name string
filename string
expected []telegraf.Metric
}{
{
name: "Vega 10 XT",
filename: "vega-10-XT.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x6861",
"gpu_unique_id": "0x2150e7d042a1124",
"name": "card0",
},
map[string]interface{}{
"card_model": "0xc1e",
"card_vendor": "Advanced",
"driver_version": 5925,
"fan_speed": 13,
"memory_total": int64(17163091968),
"memory_used": int64(17776640),
"memory_free": int64(17145315328),
"temperature_sensor_edge": 39.0,
"temperature_sensor_junction": 40.0,
"temperature_sensor_memory": 92.0,
"utilization_gpu": 0,
"clocks_current_display": 600,
"clocks_current_sm": 1269,
"clocks_current_memory": 167,
"clocks_current_system": 960,
"power_draw": 15.0,
},
time.Unix(0, 0)),
},
},
{
name: "Vega 20 WKS GL-XE [Radeon Pro VII]",
filename: "vega-20-WKS-GL-XE.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x66a1",
"gpu_unique_id": "0x2f048617326b1ea",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x834",
"card_series": "Radeon",
"card_vendor": "Advanced",
"driver_version": 5917,
"fan_speed": 0,
"memory_total": int64(34342961152),
"memory_used": int64(10850304),
"memory_free": int64(34332110848),
"temperature_sensor_edge": 36.0,
"temperature_sensor_junction": 38.0,
"temperature_sensor_memory": 35.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_display": 357,
"clocks_current_fabric": 1080,
"clocks_current_sm": 1725,
"clocks_current_memory": 1000,
"clocks_current_system": 971,
"power_draw": 26.0,
},
time.Unix(0, 0),
),
},
},
{
name: "mi100 + ROCm 571",
filename: "mi100_rocm571.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "N/A",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 624,
"memory_total": int64(34342961152),
"memory_used": int64(6782976),
"memory_free": int64(34336178176),
"temperature_sensor_edge": 31.0,
"temperature_sensor_junction": 34.0,
"temperature_sensor_memory": 30.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 39.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "N/A",
"name": "card1",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 624,
"memory_total": int64(34342961152),
"memory_used": int64(6782976),
"memory_free": int64(34336178176),
"temperature_sensor_edge": 30.0,
"temperature_sensor_junction": 33.0,
"temperature_sensor_memory": 38.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 37.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "N/A",
"name": "card2",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 624,
"memory_total": int64(34342961152),
"memory_used": int64(6782976),
"memory_free": int64(34336178176),
"temperature_sensor_edge": 31.0,
"temperature_sensor_junction": 34.0,
"temperature_sensor_memory": 31.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 35.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "N/A",
"name": "card3",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 624,
"memory_total": int64(34342961152),
"memory_used": int64(6782976),
"memory_free": int64(34336178176),
"temperature_sensor_edge": 33.0,
"temperature_sensor_junction": 35.0,
"temperature_sensor_memory": 36.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 39.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "N/A",
"name": "card4",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 624,
"memory_total": int64(34342961152),
"memory_used": int64(6782976),
"memory_free": int64(34336178176),
"temperature_sensor_edge": 32.0,
"temperature_sensor_junction": 34.0,
"temperature_sensor_memory": 38.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 39.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "N/A",
"name": "card5",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 624,
"memory_total": int64(34342961152),
"memory_used": int64(6782976),
"memory_free": int64(34336178176),
"temperature_sensor_edge": 33.0,
"temperature_sensor_junction": 35.0,
"temperature_sensor_memory": 38.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 40.0,
},
time.Unix(0, 0),
),
},
},
{
name: "mi100 + ROCm 602",
filename: "mi100_rocm602.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "0x79ccd55167a2124a",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(34342961152),
"memory_used": int64(6750208),
"memory_free": int64(34336210944),
"temperature_sensor_edge": 53.0,
"temperature_sensor_junction": 55.0,
"temperature_sensor_memory": 53.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 36.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "0x4edfb117a17a07d",
"name": "card1",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(34342961152),
"memory_used": int64(6750208),
"memory_free": int64(34336210944),
"temperature_sensor_edge": 55.0,
"temperature_sensor_junction": 58.0,
"temperature_sensor_memory": 54.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 44.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "0xd4a9ec48d03d261d",
"name": "card2",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(34342961152),
"memory_used": int64(6750208),
"memory_free": int64(34336210944),
"temperature_sensor_edge": 54.0,
"temperature_sensor_junction": 57.0,
"temperature_sensor_memory": 55.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 43.0,
},
time.Unix(0, 0),
),
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x738c",
"gpu_unique_id": "0x1b9dd972253c3736",
"name": "card3",
},
map[string]interface{}{
"card_model": "0x0c34",
"card_series": "Arcturus",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(34342961152),
"memory_used": int64(6750208),
"memory_free": int64(34336210944),
"temperature_sensor_edge": 51.0,
"temperature_sensor_junction": 53.0,
"temperature_sensor_memory": 50.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_fabric": 1402,
"clocks_current_sm": 300,
"clocks_current_memory": 1200,
"clocks_current_system": 1000,
"power_draw": 39.0,
},
time.Unix(0, 0),
),
},
},
{
name: "rx6700xt + ROCm 430",
filename: "rx6700xt_rocm430.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x73df",
"gpu_unique_id": "N/A",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x1002",
"card_series": "0x1002",
"card_vendor": "0x1002",
"driver_version": 636,
"memory_total": int64(12868124672),
"memory_used": int64(1622728704),
"memory_free": int64(11245395968),
"temperature_sensor_edge": 45.0,
"temperature_sensor_junction": 47.0,
"temperature_sensor_memory": 46.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_display": 480,
"clocks_current_fabric": 1051,
"clocks_current_sm": 500,
"clocks_current_memory": 96,
"clocks_current_system": 685,
"power_draw": 6.0,
},
time.Unix(0, 0),
),
},
},
{
name: "rx6700xt + ROCm 571",
filename: "rx6700xt_rocm571.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x73df",
"gpu_unique_id": "N/A",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x6601",
"card_series": "Navi",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(12868124672),
"memory_used": int64(1564491776),
"memory_free": int64(11303632896),
"temperature_sensor_edge": 45.0,
"temperature_sensor_junction": 47.0,
"temperature_sensor_memory": 46.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_display": 480,
"clocks_current_fabric": 1051,
"clocks_current_sm": 500,
"clocks_current_memory": 96,
"clocks_current_system": 685,
"power_draw": 6.0,
},
time.Unix(0, 0),
),
},
},
{
name: "rx6700xt + ROCm 602",
filename: "rx6700xt_rocm602.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x73df",
"gpu_unique_id": "N/A",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x6601",
"card_series": "Navi",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(12868124672),
"memory_used": int64(1572757504),
"memory_free": int64(11295367168),
"temperature_sensor_edge": 45.0,
"temperature_sensor_junction": 47.0,
"temperature_sensor_memory": 46.0,
"utilization_gpu": 0,
"utilization_memory": 0,
"clocks_current_display": 480,
"clocks_current_fabric": 1051,
"clocks_current_sm": 500,
"clocks_current_memory": 96,
"clocks_current_system": 685,
},
time.Unix(0, 0),
),
},
},
{
name: "rx6700xt + ROCm 612",
filename: "rx6700xt_rocm612.json",
expected: []telegraf.Metric{
testutil.MustMetric(
"amd_rocm_smi",
map[string]string{
"gpu_id": "0x73df",
"gpu_unique_id": "N/A",
"name": "card0",
},
map[string]interface{}{
"card_model": "0x73df",
"card_series": "Navi",
"card_vendor": "Advanced",
"driver_version": 636,
"memory_total": int64(12868124672),
"memory_used": int64(1572745216),
"memory_free": int64(11295379456),
"temperature_sensor_edge": 45.0,
"temperature_sensor_junction": 47.0,
"temperature_sensor_memory": 46.0,
"utilization_gpu": 0,
"utilization_memory": 12,
"clocks_current_display": 480,
"clocks_current_fabric": 1051,
"clocks_current_sm": 0,
"clocks_current_memory": 96,
"clocks_current_system": 685,
"power_draw": 6.0,
},
time.Unix(0, 0),
),
},
},
}
for _, tt := range tests {
t.Run(tt.name, func(t *testing.T) {
var acc testutil.Accumulator
octets, err := os.ReadFile(filepath.Join("testdata", tt.filename))
require.NoError(t, err)
err = gatherROCmSMI(octets, &acc)
require.NoError(t, err)
testutil.RequireMetricsEqual(t, tt.expected, acc.GetTelegrafMetrics(), testutil.IgnoreTime(), testutil.SortMetrics())
})
}
}

View file

@ -0,0 +1,7 @@
# Query statistics from AMD Graphics cards using rocm-smi binary
[[inputs.amd_rocm_smi]]
## Optional: path to rocm-smi binary, defaults to $PATH via exec.LookPath
# bin_path = "/opt/rocm/bin/rocm-smi"
## Optional: timeout for GPU polling
# timeout = "5s"

View file

@ -0,0 +1,532 @@
{
"card0": {
"GPU ID": "0x738c",
"Unique ID": "N/A",
"VBIOS version": "113-D3430400-037",
"Temperature (Sensor edge) (C)": "31.0",
"Temperature (Sensor junction) (C)": "34.0",
"Temperature (Sensor memory) (C)": "30.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (8.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "39.0",
"0": "8.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "0",
"Voltage (mV)": "662",
"PCI Bus": "0000:1E:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6782976",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6782976",
"GTT Total Memory (B)": "135048462336",
"GTT Total Used Memory (B)": "11399168",
"ASD firmware version": "0x21000059",
"CE firmware version": "0",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "0",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"PFP firmware version": "0",
"RLC firmware version": "24",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.15",
"UVD firmware version": "0x00000000",
"VCE firmware version": "0x00000000",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3430400",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"card1": {
"GPU ID": "0x738c",
"Unique ID": "N/A",
"VBIOS version": "113-D3430400-037",
"Temperature (Sensor edge) (C)": "30.0",
"Temperature (Sensor junction) (C)": "33.0",
"Temperature (Sensor memory) (C)": "38.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (8.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "37.0",
"0": "8.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "hynix",
"PCIe Replay Count": "0",
"Serial Number": "0",
"Voltage (mV)": "662",
"PCI Bus": "0000:22:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6782976",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6782976",
"GTT Total Memory (B)": "135048462336",
"GTT Total Used Memory (B)": "11366400",
"ASD firmware version": "0x21000059",
"CE firmware version": "0",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "0",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"PFP firmware version": "0",
"RLC firmware version": "24",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.15",
"UVD firmware version": "0x00000000",
"VCE firmware version": "0x00000000",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3430400",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"card2": {
"GPU ID": "0x738c",
"Unique ID": "N/A",
"VBIOS version": "113-D3430400-037",
"Temperature (Sensor edge) (C)": "31.0",
"Temperature (Sensor junction) (C)": "34.0",
"Temperature (Sensor memory) (C)": "31.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (8.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "35.0",
"0": "8.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "0",
"Voltage (mV)": "656",
"PCI Bus": "0000:3F:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6782976",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6782976",
"GTT Total Memory (B)": "135048462336",
"GTT Total Used Memory (B)": "11366400",
"ASD firmware version": "0x21000059",
"CE firmware version": "0",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "0",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"PFP firmware version": "0",
"RLC firmware version": "24",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.15",
"UVD firmware version": "0x00000000",
"VCE firmware version": "0x00000000",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3430400",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"card3": {
"GPU ID": "0x738c",
"Unique ID": "N/A",
"VBIOS version": "113-D3430400-037",
"Temperature (Sensor edge) (C)": "33.0",
"Temperature (Sensor junction) (C)": "35.0",
"Temperature (Sensor memory) (C)": "36.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (8.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "39.0",
"0": "8.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "hynix",
"PCIe Replay Count": "0",
"Serial Number": "0",
"Voltage (mV)": "656",
"PCI Bus": "0000:43:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6782976",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6782976",
"GTT Total Memory (B)": "135048462336",
"GTT Total Used Memory (B)": "11366400",
"ASD firmware version": "0x21000059",
"CE firmware version": "0",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "0",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"PFP firmware version": "0",
"RLC firmware version": "24",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.15",
"UVD firmware version": "0x00000000",
"VCE firmware version": "0x00000000",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3430400",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"card4": {
"GPU ID": "0x738c",
"Unique ID": "N/A",
"VBIOS version": "113-D3430400-037",
"Temperature (Sensor edge) (C)": "32.0",
"Temperature (Sensor junction) (C)": "34.0",
"Temperature (Sensor memory) (C)": "38.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (8.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "39.0",
"0": "8.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "hynix",
"PCIe Replay Count": "0",
"Serial Number": "0",
"Voltage (mV)": "656",
"PCI Bus": "0000:46:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6782976",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6782976",
"GTT Total Memory (B)": "135048462336",
"GTT Total Used Memory (B)": "11366400",
"ASD firmware version": "0x21000059",
"CE firmware version": "0",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "0",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"PFP firmware version": "0",
"RLC firmware version": "24",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.15",
"UVD firmware version": "0x00000000",
"VCE firmware version": "0x00000000",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3430400",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"card5": {
"GPU ID": "0x738c",
"Unique ID": "N/A",
"VBIOS version": "113-D3430400-037",
"Temperature (Sensor edge) (C)": "33.0",
"Temperature (Sensor junction) (C)": "35.0",
"Temperature (Sensor memory) (C)": "38.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (8.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "40.0",
"0": "8.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "hynix",
"PCIe Replay Count": "0",
"Serial Number": "0",
"Voltage (mV)": "662",
"PCI Bus": "0000:49:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6782976",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6782976",
"GTT Total Memory (B)": "135048462336",
"GTT Total Used Memory (B)": "11366400",
"ASD firmware version": "0x21000059",
"CE firmware version": "0",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "0",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"PFP firmware version": "0",
"RLC firmware version": "24",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.15",
"UVD firmware version": "0x00000000",
"VCE firmware version": "0x00000000",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3430400",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"system": {
"Driver version": "6.2.4",
"PID104225": "mlir-cpu-runner, 0, 0, 0, 0",
"(Topology) Weight between DRM devices 0 and 1": "40",
"(Topology) Weight between DRM devices 0 and 2": "40",
"(Topology) Weight between DRM devices 0 and 3": "40",
"(Topology) Weight between DRM devices 0 and 4": "40",
"(Topology) Weight between DRM devices 0 and 5": "40",
"(Topology) Weight between DRM devices 1 and 2": "40",
"(Topology) Weight between DRM devices 1 and 3": "40",
"(Topology) Weight between DRM devices 1 and 4": "40",
"(Topology) Weight between DRM devices 1 and 5": "40",
"(Topology) Weight between DRM devices 2 and 3": "40",
"(Topology) Weight between DRM devices 2 and 4": "40",
"(Topology) Weight between DRM devices 2 and 5": "40",
"(Topology) Weight between DRM devices 3 and 4": "40",
"(Topology) Weight between DRM devices 3 and 5": "40",
"(Topology) Weight between DRM devices 4 and 5": "40",
"(Topology) Hops between DRM devices 0 and 1": "2",
"(Topology) Hops between DRM devices 0 and 2": "2",
"(Topology) Hops between DRM devices 0 and 3": "2",
"(Topology) Hops between DRM devices 0 and 4": "2",
"(Topology) Hops between DRM devices 0 and 5": "2",
"(Topology) Hops between DRM devices 1 and 2": "2",
"(Topology) Hops between DRM devices 1 and 3": "2",
"(Topology) Hops between DRM devices 1 and 4": "2",
"(Topology) Hops between DRM devices 1 and 5": "2",
"(Topology) Hops between DRM devices 2 and 3": "2",
"(Topology) Hops between DRM devices 2 and 4": "2",
"(Topology) Hops between DRM devices 2 and 5": "2",
"(Topology) Hops between DRM devices 3 and 4": "2",
"(Topology) Hops between DRM devices 3 and 5": "2",
"(Topology) Hops between DRM devices 4 and 5": "2",
"(Topology) Link type between DRM devices 0 and 1": "PCIE",
"(Topology) Link type between DRM devices 0 and 2": "PCIE",
"(Topology) Link type between DRM devices 0 and 3": "PCIE",
"(Topology) Link type between DRM devices 0 and 4": "PCIE",
"(Topology) Link type between DRM devices 0 and 5": "PCIE",
"(Topology) Link type between DRM devices 1 and 2": "PCIE",
"(Topology) Link type between DRM devices 1 and 3": "PCIE",
"(Topology) Link type between DRM devices 1 and 4": "PCIE",
"(Topology) Link type between DRM devices 1 and 5": "PCIE",
"(Topology) Link type between DRM devices 2 and 3": "PCIE",
"(Topology) Link type between DRM devices 2 and 4": "PCIE",
"(Topology) Link type between DRM devices 2 and 5": "PCIE",
"(Topology) Link type between DRM devices 3 and 4": "PCIE",
"(Topology) Link type between DRM devices 3 and 5": "PCIE",
"(Topology) Link type between DRM devices 4 and 5": "PCIE"
}
}

View file

@ -0,0 +1,311 @@
{
"card0": {
"Device ID": "0x738c",
"Device Rev": "0x1",
"Unique ID": "0x79ccd55167a2124a",
"VBIOS version": "113-D3431401-101",
"Temperature (Sensor edge) (C)": "53.0",
"Temperature (Sensor junction) (C)": "55.0",
"Temperature (Sensor memory) (C)": "53.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (16.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "36.0",
"0": "16.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"Avg. Memory Bandwidth": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "662",
"PCI Bus": "0000:83:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6750208",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6750208",
"GTT Total Memory (B)": "135138402304",
"GTT Total Used Memory (B)": "11354112",
"ASD firmware version": "0x21000059",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"RLC firmware version": "24",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.17",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3431401",
"(Topology) Numa Node": "1",
"(Topology) Numa Affinity": "1"
},
"card1": {
"Device ID": "0x738c",
"Device Rev": "0x1",
"Unique ID": "0x4edfb117a17a07d",
"VBIOS version": "113-D3431401-101",
"Temperature (Sensor edge) (C)": "55.0",
"Temperature (Sensor junction) (C)": "58.0",
"Temperature (Sensor memory) (C)": "54.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (16.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "44.0",
"0": "16.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"Avg. Memory Bandwidth": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "668",
"PCI Bus": "0000:A3:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6750208",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6750208",
"GTT Total Memory (B)": "135138402304",
"GTT Total Used Memory (B)": "11354112",
"ASD firmware version": "0x21000059",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"RLC firmware version": "24",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.17",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3431401",
"(Topology) Numa Node": "1",
"(Topology) Numa Affinity": "1"
},
"card2": {
"Device ID": "0x738c",
"Device Rev": "0x1",
"Unique ID": "0xd4a9ec48d03d261d",
"VBIOS version": "113-D3431401-101",
"Temperature (Sensor edge) (C)": "54.0",
"Temperature (Sensor junction) (C)": "57.0",
"Temperature (Sensor memory) (C)": "55.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (16.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "43.0",
"0": "16.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"Avg. Memory Bandwidth": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "668",
"PCI Bus": "0000:C3:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6750208",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6750208",
"GTT Total Memory (B)": "135138402304",
"GTT Total Used Memory (B)": "11354112",
"ASD firmware version": "0x21000059",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"RLC firmware version": "24",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.17",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3431401",
"(Topology) Numa Node": "1",
"(Topology) Numa Affinity": "1"
},
"card3": {
"Device ID": "0x738c",
"Device Rev": "0x1",
"Unique ID": "0x1b9dd972253c3736",
"VBIOS version": "113-D3431401-101",
"Temperature (Sensor edge) (C)": "51.0",
"Temperature (Sensor junction) (C)": "53.0",
"Temperature (Sensor memory) (C)": "50.0",
"fclk clock speed:": "(1402Mhz)",
"fclk clock level:": "0",
"mclk clock speed:": "(1200Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(300Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(1000Mhz)",
"socclk clock level:": "0",
"pcie clock level": "0 (16.0GT/s x16)",
"sclk clock level": "0 (300Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "290.0",
"Average Graphics Package Power (W)": "39.0",
"0": "16.0GT/s x16 *",
"1": "495Mhz",
"2": "731Mhz",
"3": "962Mhz",
"4": "1029Mhz",
"5": "1087Mhz",
"6": "1147Mhz",
"7": "1189Mhz",
"8": "1235Mhz",
"9": "1283Mhz",
"10": "1319Mhz",
"11": "1363Mhz",
"12": "1404Mhz",
"13": "1430Mhz",
"14": "1472Mhz",
"15": "1502Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"Avg. Memory Bandwidth": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "662",
"PCI Bus": "0000:E3:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "6750208",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "6750208",
"GTT Total Memory (B)": "135138402304",
"GTT Total Used Memory (B)": "11354112",
"ASD firmware version": "0x21000059",
"MEC firmware version": "65",
"MEC2 firmware version": "65",
"RLC firmware version": "24",
"SDMA firmware version": "18",
"SDMA2 firmware version": "18",
"SMC firmware version": "00.54.29.00",
"SOS firmware version": "0x0017004f",
"TA RAS firmware version": "27.00.01.62",
"TA XGMI firmware version": "32.00.00.17",
"VCN firmware version": "0x01101015",
"Card series": "Arcturus GL-XL [Instinct MI100]",
"Card model": "0x0c34",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D3431401",
"(Topology) Numa Node": "1",
"(Topology) Numa Affinity": "1"
},
"system": {
"Driver version": "6.3.6",
"(Topology) Weight between DRM devices 0 and 1": "40",
"(Topology) Weight between DRM devices 0 and 2": "40",
"(Topology) Weight between DRM devices 0 and 3": "40",
"(Topology) Weight between DRM devices 1 and 2": "40",
"(Topology) Weight between DRM devices 1 and 3": "40",
"(Topology) Weight between DRM devices 2 and 3": "40",
"(Topology) Hops between DRM devices 0 and 1": "2",
"(Topology) Hops between DRM devices 0 and 2": "2",
"(Topology) Hops between DRM devices 0 and 3": "2",
"(Topology) Hops between DRM devices 1 and 2": "2",
"(Topology) Hops between DRM devices 1 and 3": "2",
"(Topology) Hops between DRM devices 2 and 3": "2",
"(Topology) Link type between DRM devices 0 and 1": "PCIE",
"(Topology) Link type between DRM devices 0 and 2": "PCIE",
"(Topology) Link type between DRM devices 0 and 3": "PCIE",
"(Topology) Link type between DRM devices 1 and 2": "PCIE",
"(Topology) Link type between DRM devices 1 and 3": "PCIE",
"(Topology) Link type between DRM devices 2 and 3": "PCIE"
}
}

View file

@ -0,0 +1,69 @@
{
"card0": {
"GPU ID": "0x73df",
"Unique ID": "N/A",
"VBIOS version": "113-67XA6SSB1-D01",
"Temperature (Sensor edge) (C)": "45.0",
"Temperature (Sensor junction) (C)": "47.0",
"Temperature (Sensor memory) (C)": "46.0",
"Temperature (Sensor HBM 0) (C)": "N/A",
"Temperature (Sensor HBM 1) (C)": "N/A",
"Temperature (Sensor HBM 2) (C)": "N/A",
"Temperature (Sensor HBM 3) (C)": "N/A",
"dcefclk clock speed:": "(480Mhz)",
"dcefclk clock level:": "1",
"fclk clock speed:": "(1051Mhz)",
"fclk clock level:": "1",
"mclk clock speed:": "(96Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(500Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(685Mhz)",
"socclk clock level:": "1",
"pcie clock level": "1 (16.0GT/s x16)",
"sclk clock level": "0 (500Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "211.0",
"Average Graphics Package Power (W)": "6.0",
"0": "2.5GT/s x1",
"1": "16.0GT/s x16 *",
"2": "1200Mhz",
"3": "1000Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "6",
"PCI Bus": "0000:07:00.0",
"VRAM Total Memory (B)": "12868124672",
"VRAM Total Used Memory (B)": "1622728704",
"VIS_VRAM Total Memory (B)": "268435456",
"VIS_VRAM Total Used Memory (B)": "70930432",
"GTT Total Memory (B)": "16792432640",
"GTT Total Used Memory (B)": "244150272",
"ASD firmware version": "553648315",
"CE firmware version": "37",
"ME firmware version": "64",
"MEC firmware version": "116",
"MEC2 firmware version": "116",
"PFP firmware version": "97",
"RLC firmware version": "74",
"SDMA firmware version": "80",
"SDMA2 firmware version": "80",
"SMC firmware version": "00.65.60.00",
"SOS firmware version": "0x00220b0c",
"VCN firmware version": "0x0211e006",
"Card series": "0x1002",
"Card model": "0x1002",
"Card vendor": "0x1002",
"Card SKU": "67XA6S",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "4294967295"
},
"system": {
"Driver version": "6.3.6"
}
}

View file

@ -0,0 +1,66 @@
{
"card0": {
"GPU ID": "0x73df",
"Unique ID": "N/A",
"VBIOS version": "113-67XA6SSB1-D01",
"Temperature (Sensor edge) (C)": "45.0",
"Temperature (Sensor junction) (C)": "47.0",
"Temperature (Sensor memory) (C)": "46.0",
"dcefclk clock speed:": "(480Mhz)",
"dcefclk clock level:": "1",
"fclk clock speed:": "(1051Mhz)",
"fclk clock level:": "1",
"mclk clock speed:": "(96Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(500Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(685Mhz)",
"socclk clock level:": "1",
"pcie clock level": "1 (16.0GT/s x16)",
"sclk clock level": "0 (500Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "211.0",
"Average Graphics Package Power (W)": "6.0",
"0": "2.5GT/s x1",
"1": "16.0GT/s x16 *",
"2": "1200Mhz",
"3": "1000Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "862",
"PCI Bus": "0000:07:00.0",
"VRAM Total Memory (B)": "12868124672",
"VRAM Total Used Memory (B)": "1564491776",
"VIS_VRAM Total Memory (B)": "268435456",
"VIS_VRAM Total Used Memory (B)": "70930432",
"GTT Total Memory (B)": "16792432640",
"GTT Total Used Memory (B)": "248328192",
"ASD firmware version": "0x210000bb",
"CE firmware version": "37",
"ME firmware version": "64",
"MEC firmware version": "116",
"MEC2 firmware version": "116",
"PFP firmware version": "97",
"RLC firmware version": "74",
"SDMA firmware version": "80",
"SDMA2 firmware version": "80",
"SMC firmware version": "00.65.60.00",
"SOS firmware version": "0x00220b0c",
"VCN firmware version": "0x0211e006",
"Card series": "Navi 22 [Radeon RX 6700/6700 XT / 6800M]",
"Card model": "0x6601",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "67XA6SSB1",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "4294967295"
},
"system": {
"Driver version": "6.3.6"
}
}

View file

@ -0,0 +1,68 @@
{
"card0": {
"Device ID": "0x73df",
"Device Rev": "0xc1",
"Unique ID": "N/A",
"VBIOS version": "113-67XA6SSB1-D01",
"Temperature (Sensor edge) (C)": "45.0",
"Temperature (Sensor junction) (C)": "47.0",
"Temperature (Sensor memory) (C)": "46.0",
"dcefclk clock speed:": "(480Mhz)",
"dcefclk clock level:": "1",
"fclk clock speed:": "(1051Mhz)",
"fclk clock level:": "1",
"mclk clock speed:": "(96Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(500Mhz)",
"sclk clock level:": "1",
"socclk clock speed:": "(685Mhz)",
"socclk clock level:": "1",
"pcie clock level": "1 (16.0GT/s x16)",
"sclk clock level": "1 (500Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "211.0",
"Average Graphics Package Power (W)": "N/A (Secondary die)",
"0": "2.5GT/s x1",
"1": "16.0GT/s x16 *",
"2": "1200Mhz",
"3": "1000Mhz",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"Memory Activity": "N/A",
"Avg. Memory Bandwidth": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "6",
"PCI Bus": "0000:07:00.0",
"VRAM Total Memory (B)": "12868124672",
"VRAM Total Used Memory (B)": "1572757504",
"VIS_VRAM Total Memory (B)": "268435456",
"VIS_VRAM Total Used Memory (B)": "70930432",
"GTT Total Memory (B)": "16792432640",
"GTT Total Used Memory (B)": "242049024",
"ASD firmware version": "0x210000bb",
"CE firmware version": "37",
"ME firmware version": "64",
"MEC firmware version": "116",
"MEC2 firmware version": "116",
"PFP firmware version": "97",
"RLC firmware version": "74",
"SDMA firmware version": "80",
"SDMA2 firmware version": "80",
"SMC firmware version": "00.65.60.00",
"SOS firmware version": "0x00220b0c",
"VCN firmware version": "0x0211e006",
"Card series": "Navi 22 [Radeon RX 6700/6700 XT / 6800M]",
"Card model": "0x6601",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "67XA6SSB1",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "-1"
},
"system": {
"Driver version": "6.3.6"
}
}

View file

@ -0,0 +1,75 @@
{
"card0": {
"Device Name": "Navi 22 [Radeon RX 6700/6700 XT / 6800M]",
"Device ID": "0x73df",
"Device Rev": "0xc1",
"Subsystem ID": "0x6601",
"GUID": "55069",
"Unique ID": "N/A",
"VBIOS version": "113-67XA6SSB1-D01",
"Temperature (Sensor edge) (C)": "45.0",
"Temperature (Sensor junction) (C)": "47.0",
"Temperature (Sensor memory) (C)": "46.0",
"dcefclk clock speed:": "(480Mhz)",
"dcefclk clock level:": "1",
"fclk clock speed:": "(1051Mhz)",
"fclk clock level:": "1",
"mclk clock speed:": "(96Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(0Mhz)",
"sclk clock level:": "0",
"socclk clock speed:": "(685Mhz)",
"socclk clock level:": "1",
"pcie clock level": "1 (16.0GT/s x16)",
"sclk clock level": "0 (0Mhz)",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "211.0",
"Average Graphics Package Power (W)": "6.0",
"0": "2.5GT/s x1",
"1": "16.0GT/s x16 *",
"2": "1200Mhz",
"3": "1000Mhz",
"GPU use (%)": "0",
"GPU Memory Allocated (VRAM%)": "12",
"GPU Memory Read/Write Activity (%)": "0",
"Memory Activity": "N/A",
"Avg. Memory Bandwidth": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "6",
"PCI Bus": "0000:07:00.0",
"VRAM Total Memory (B)": "12868124672",
"VRAM Total Used Memory (B)": "1572745216",
"VIS_VRAM Total Memory (B)": "268435456",
"VIS_VRAM Total Used Memory (B)": "70930432",
"GTT Total Memory (B)": "16792432640",
"GTT Total Used Memory (B)": "242036736",
"ASD firmware version": "0x210000bb",
"CE firmware version": "37",
"ME firmware version": "64",
"MEC firmware version": "116",
"MEC2 firmware version": "116",
"PFP firmware version": "97",
"RLC firmware version": "74",
"SDMA firmware version": "80",
"SDMA2 firmware version": "80",
"SMC firmware version": "00.65.60.00",
"SOS firmware version": "0x00220b0c",
"VCN firmware version": "0x0211e006",
"Card Series": "Navi 22 [Radeon RX 6700/6700 XT / 6800M]",
"Card Model": "0x73df",
"Card Vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "67XA6SSB1",
"Node ID": "1",
"GFX Version": "gfx1031",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "-1"
},
"system": {
"Driver version": "6.3.6"
}
}

View file

@ -0,0 +1,77 @@
{
"card0": {
"GPU ID": "0x6861",
"Unique ID": "0x2150e7d042a1124",
"VBIOS version": "113-D0510100-106",
"Temperature (Sensor edge) (C)": "39.0",
"Temperature (Sensor junction) (C)": "40.0",
"Temperature (Sensor memory) (C)": "92.0",
"dcefclk clock speed:": "(600Mhz)",
"dcefclk clock level:": "0",
"mclk clock speed:": "(167Mhz)",
"mclk clock level:": "0",
"sclk clock speed:": "(1269Mhz)",
"sclk clock level:": "3",
"socclk clock speed:": "(960Mhz)",
"socclk clock level:": "3",
"pcie clock level": "1 (8.0GT/s x16)",
"sclk clock level": "3 (1269Mhz)",
"Fan speed (level)": "33",
"Fan speed (%)": "13",
"Fan RPM": "682",
"Performance Level": "auto",
"GPU OverDrive value (%)": "0",
"GPU Memory OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "170.0",
"Average Graphics Package Power (W)": "15.0",
"0": "8.0GT/s x16",
"1": "8.0GT/s x16 *",
"2": "847Mhz",
"3": "960Mhz *",
"4": "1028Mhz",
"5": "1107Mhz",
"6": "1440Mhz",
"7": "1500Mhz",
"GPU use (%)": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "N/A",
"Voltage (mV)": "906",
"PCI Bus": "0000:04:00.0",
"VRAM Total Memory (B)": "17163091968",
"VRAM Total Used Memory (B)": "17776640",
"VIS_VRAM Total Memory (B)": "268435456",
"VIS_VRAM Total Used Memory (B)": "13557760",
"GTT Total Memory (B)": "17163091968",
"GTT Total Used Memory (B)": "25608192",
"ASD firmware version": "553648152",
"CE firmware version": "79",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "163",
"MEC firmware version": "432",
"MEC2 firmware version": "432",
"PFP firmware version": "186",
"RLC firmware version": "93",
"RLC SRLC firmware version": "0",
"RLC SRLG firmware version": "0",
"RLC SRLS firmware version": "0",
"SDMA firmware version": "430",
"SDMA2 firmware version": "430",
"SMC firmware version": "00.28.54.00",
"SOS firmware version": "0x0008015d",
"TA RAS firmware version": "00.00.00.00",
"TA XGMI firmware version": "00.00.00.00",
"UVD firmware version": "0x422b1100",
"VCE firmware version": "0x39060400",
"VCN firmware version": "0x00000000",
"Card model": "0xc1e",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D05101",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"system": {
"Driver version": "5.9.25"
}
}

View file

@ -0,0 +1,165 @@
{
"card0": {
"GPU ID": "0x66a1",
"Unique ID": "0x2f048617326b1ea",
"VBIOS version": "113-D1631700-111",
"Temperature (Sensor edge) (C)": "36.0",
"Temperature (Sensor junction) (C)": "38.0",
"Temperature (Sensor memory) (C)": "35.0",
"dcefclk clock speed:": "(357Mhz)",
"dcefclk clock level:": "0",
"fclk clock speed:": "(1080Mhz)",
"fclk clock level:": "6",
"mclk clock speed:": "(1000Mhz)",
"mclk clock level:": "2",
"sclk clock speed:": "(1725Mhz)",
"sclk clock level:": "8",
"socclk clock speed:": "(971Mhz)",
"socclk clock level:": "7",
"pcie clock level": "1 (16.0GT/s x16)",
"sclk clock level": "8 (1725Mhz)",
"Fan speed (level)": "0",
"Fan speed (%)": "0",
"Fan RPM": "0",
"Performance Level": "high",
"GPU OverDrive value (%)": "0",
"Max Graphics Package Power (W)": "225.0",
"Average Graphics Package Power (W)": "26.0",
"0": "2.5GT/s x16",
"1": "16.0GT/s x16 *",
"2": "566Mhz",
"3": "618Mhz",
"4": "680Mhz",
"5": "755Mhz",
"6": "850Mhz",
"7": "971Mhz *",
"8": "1725Mhz *",
"GPU use (%)": "0",
"GPU memory use (%)": "0",
"GPU memory vendor": "samsung",
"PCIe Replay Count": "0",
"Serial Number": "692024000810",
"Voltage (mV)": "1000",
"PCI Bus": "0000:63:00.0",
"VRAM Total Memory (B)": "34342961152",
"VRAM Total Used Memory (B)": "10850304",
"VIS_VRAM Total Memory (B)": "34342961152",
"VIS_VRAM Total Used Memory (B)": "10850304",
"GTT Total Memory (B)": "54974742528",
"GTT Total Used Memory (B)": "11591680",
"ASD firmware version": "553648199",
"CE firmware version": "79",
"DMCU firmware version": "0",
"MC firmware version": "0",
"ME firmware version": "164",
"MEC firmware version": "448",
"MEC2 firmware version": "448",
"PFP firmware version": "188",
"RLC firmware version": "50",
"RLC SRLC firmware version": "1",
"RLC SRLG firmware version": "1",
"RLC SRLS firmware version": "1",
"SDMA firmware version": "144",
"SDMA2 firmware version": "144",
"SMC firmware version": "00.40.59.00",
"SOS firmware version": "0x00080b67",
"TA RAS firmware version": "27.00.01.36",
"TA XGMI firmware version": "32.00.00.02",
"UVD firmware version": "0x42002b13",
"VCE firmware version": "0x39060400",
"VCN firmware version": "0x00000000",
"Card series": "Radeon Instinct MI50 32GB",
"Card model": "0x834",
"Card vendor": "Advanced Micro Devices, Inc. [AMD/ATI]",
"Card SKU": "D16317",
"(Topology) Numa Node": "0",
"(Topology) Numa Affinity": "0"
},
"system": {
"Driver version": "5.9.17",
"(Topology) Weight between DRM devices 0 and 1": "40",
"(Topology) Weight between DRM devices 0 and 2": "40",
"(Topology) Weight between DRM devices 0 and 3": "40",
"(Topology) Weight between DRM devices 0 and 4": "72",
"(Topology) Weight between DRM devices 0 and 5": "72",
"(Topology) Weight between DRM devices 0 and 6": "72",
"(Topology) Weight between DRM devices 0 and 7": "72",
"(Topology) Weight between DRM devices 1 and 2": "40",
"(Topology) Weight between DRM devices 1 and 3": "40",
"(Topology) Weight between DRM devices 1 and 4": "72",
"(Topology) Weight between DRM devices 1 and 5": "72",
"(Topology) Weight between DRM devices 1 and 6": "72",
"(Topology) Weight between DRM devices 1 and 7": "72",
"(Topology) Weight between DRM devices 2 and 3": "40",
"(Topology) Weight between DRM devices 2 and 4": "72",
"(Topology) Weight between DRM devices 2 and 5": "72",
"(Topology) Weight between DRM devices 2 and 6": "72",
"(Topology) Weight between DRM devices 2 and 7": "72",
"(Topology) Weight between DRM devices 3 and 4": "72",
"(Topology) Weight between DRM devices 3 and 5": "72",
"(Topology) Weight between DRM devices 3 and 6": "72",
"(Topology) Weight between DRM devices 3 and 7": "72",
"(Topology) Weight between DRM devices 4 and 5": "40",
"(Topology) Weight between DRM devices 4 and 6": "40",
"(Topology) Weight between DRM devices 4 and 7": "40",
"(Topology) Weight between DRM devices 5 and 6": "40",
"(Topology) Weight between DRM devices 5 and 7": "40",
"(Topology) Weight between DRM devices 6 and 7": "40",
"(Topology) Hops between DRM devices 0 and 1": "2",
"(Topology) Hops between DRM devices 0 and 2": "2",
"(Topology) Hops between DRM devices 0 and 3": "2",
"(Topology) Hops between DRM devices 0 and 4": "3",
"(Topology) Hops between DRM devices 0 and 5": "3",
"(Topology) Hops between DRM devices 0 and 6": "3",
"(Topology) Hops between DRM devices 0 and 7": "3",
"(Topology) Hops between DRM devices 1 and 2": "2",
"(Topology) Hops between DRM devices 1 and 3": "2",
"(Topology) Hops between DRM devices 1 and 4": "3",
"(Topology) Hops between DRM devices 1 and 5": "3",
"(Topology) Hops between DRM devices 1 and 6": "3",
"(Topology) Hops between DRM devices 1 and 7": "3",
"(Topology) Hops between DRM devices 2 and 3": "2",
"(Topology) Hops between DRM devices 2 and 4": "3",
"(Topology) Hops between DRM devices 2 and 5": "3",
"(Topology) Hops between DRM devices 2 and 6": "3",
"(Topology) Hops between DRM devices 2 and 7": "3",
"(Topology) Hops between DRM devices 3 and 4": "3",
"(Topology) Hops between DRM devices 3 and 5": "3",
"(Topology) Hops between DRM devices 3 and 6": "3",
"(Topology) Hops between DRM devices 3 and 7": "3",
"(Topology) Hops between DRM devices 4 and 5": "2",
"(Topology) Hops between DRM devices 4 and 6": "2",
"(Topology) Hops between DRM devices 4 and 7": "2",
"(Topology) Hops between DRM devices 5 and 6": "2",
"(Topology) Hops between DRM devices 5 and 7": "2",
"(Topology) Hops between DRM devices 6 and 7": "2",
"(Topology) Link type between DRM devices 0 and 1": "PCIE",
"(Topology) Link type between DRM devices 0 and 2": "PCIE",
"(Topology) Link type between DRM devices 0 and 3": "PCIE",
"(Topology) Link type between DRM devices 0 and 4": "PCIE",
"(Topology) Link type between DRM devices 0 and 5": "PCIE",
"(Topology) Link type between DRM devices 0 and 6": "PCIE",
"(Topology) Link type between DRM devices 0 and 7": "PCIE",
"(Topology) Link type between DRM devices 1 and 2": "PCIE",
"(Topology) Link type between DRM devices 1 and 3": "PCIE",
"(Topology) Link type between DRM devices 1 and 4": "PCIE",
"(Topology) Link type between DRM devices 1 and 5": "PCIE",
"(Topology) Link type between DRM devices 1 and 6": "PCIE",
"(Topology) Link type between DRM devices 1 and 7": "PCIE",
"(Topology) Link type between DRM devices 2 and 3": "PCIE",
"(Topology) Link type between DRM devices 2 and 4": "PCIE",
"(Topology) Link type between DRM devices 2 and 5": "PCIE",
"(Topology) Link type between DRM devices 2 and 6": "PCIE",
"(Topology) Link type between DRM devices 2 and 7": "PCIE",
"(Topology) Link type between DRM devices 3 and 4": "PCIE",
"(Topology) Link type between DRM devices 3 and 5": "PCIE",
"(Topology) Link type between DRM devices 3 and 6": "PCIE",
"(Topology) Link type between DRM devices 3 and 7": "PCIE",
"(Topology) Link type between DRM devices 4 and 5": "PCIE",
"(Topology) Link type between DRM devices 4 and 6": "PCIE",
"(Topology) Link type between DRM devices 4 and 7": "PCIE",
"(Topology) Link type between DRM devices 5 and 6": "PCIE",
"(Topology) Link type between DRM devices 5 and 7": "PCIE",
"(Topology) Link type between DRM devices 6 and 7": "PCIE"
}
}