Summary: | [NVIDIA] Missing sensor information after driver update | ||
---|---|---|---|
Product: | [Applications] plasma-systemmonitor | Reporter: | Marcelo Bossoni <mmbossoni> |
Component: | general | Assignee: | KSysGuard Developers <ksysguard-bugs> |
Status: | RESOLVED FIXED | ||
Severity: | normal | CC: | ahiemstra, anubhavde2002, b.wilkinson93, betlogbetlog, dashonwwIII, giubbe, jon9097, julien.dlq, kde, luochen1990, Mathias.Homann, maxiberta, me+kde, michele.mazza, MrRessiPiyent125, nate, plasma-bugs, rasmus, rockingcat, sephiroth_pk, starrah, wildreiser, yizel7, zawertun |
Priority: | VHI | ||
Version: | 5.26.4 | ||
Target Milestone: | --- | ||
Platform: | Other | ||
OS: | Linux | ||
Latest Commit: | https://invent.kde.org/plasma/ksystemstats/commit/63375432b8dbba59746cbffd4a74a96764239373 | Version Fixed In: | 5.27.1 |
Sentry Crash Report: | |||
Attachments: | nvidia smi output comparing 520 vs 525 |
Description
Marcelo Bossoni
2022-12-01 17:03:13 UTC
Created attachment 154330 [details]
nvidia smi output comparing 520 vs 525
A bit more info
I've compared the output of "/usr/bin/nvidia-smi dmon -d 2 -s pucm" between driver versions and newer driver add some more spaces it seems... Maybe parser is broken?
*** Bug 462687 has been marked as a duplicate of this bug. *** *** Bug 462689 has been marked as a duplicate of this bug. *** *** Bug 462624 has been marked as a duplicate of this bug. *** *** Bug 462024 has been marked as a duplicate of this bug. *** This might actually be a duplicate of bug 461389, but I do not know enough to decide myself, so intervention by a developer would be welcome! :) I don't think it is - the GPU sensors are there, they just return nothing but zeros. *** Bug 462870 has been marked as a duplicate of this bug. *** *** Bug 463094 has been marked as a duplicate of this bug. *** *** Bug 461389 has been marked as a duplicate of this bug. *** This is still issue with latest nvidia driver 525.78.01 (In reply to yizel7 from comment #11) > This is still issue with latest nvidia driver 525.78.01 Confirmed. I am still having the same issue. Git commit d42ab99437830c1c8aa1b6cd6a616d51fcd42916 by David Redondo. Committed on 09/01/2023 at 12:48. Pushed by davidre into branch 'master'. plugins/gpu: Don't let a newline confuse the nvidia-smi output parsing Newer versions of nvidia-smi add more spaces to the output. While it doesn't matter for the middle parts, there is also an additional space before the new line. Use trimmmed to remove whitespace at the end to cope with both. FIXED-IN:5.27 M +2 -2 plugins/gpu/NvidiaSmiProcess.cpp https://invent.kde.org/plasma/ksystemstats/commit/d42ab99437830c1c8aa1b6cd6a616d51fcd42916 (In reply to David Redondo from comment #13) > Git commit d42ab99437830c1c8aa1b6cd6a616d51fcd42916 by David Redondo. > Committed on 09/01/2023 at 12:48. > Pushed by davidre into branch 'master'. > > plugins/gpu: Don't let a newline confuse the nvidia-smi output parsing > > Newer versions of nvidia-smi add more spaces to the output. While > it doesn't matter for the middle parts, there is also an additional > space before the new line. > Use trimmmed to remove whitespace at the end to cope with both. > FIXED-IN:5.27 > > M +2 -2 plugins/gpu/NvidiaSmiProcess.cpp > > https://invent.kde.org/plasma/ksystemstats/commit/ > d42ab99437830c1c8aa1b6cd6a616d51fcd42916 Tyvm David! Thanks a lot David! I'm on the beta version (5.26.90) and it still doesn't seem to be getting any readings on my GPU. (In reply to Benjamin W. from comment #16) > I'm on the beta version (5.26.90) and it still doesn't seem to be getting > any readings on my GPU. Using Nvidia driver version 525.85.05 *** Bug 464792 has been marked as a duplicate of this bug. *** *** Bug 462541 has been marked as a duplicate of this bug. *** I'm still not seeing this fix being pushed to kubuntu 22.04 repo. Should we expect this to happen any time soon? Is there a manual process to regain this functionality that doesnt involve compiling source? (In reply to betlogbetlog from comment #20) > I'm still not seeing this fix being pushed to kubuntu 22.04 repo. > Should we expect this to happen any time soon? > Is there a manual process to regain this functionality that doesnt involve > compiling source? I'm on 5.26.90 on Arch using the `kde-unstable` repo and I haven't seen the fix yet either. I assume you mean it still doesn't work, as the commit is part of 5.26.90. Please attach the output of - kstatsviewer --list | grep gpu - if the above contained output, check with 'kstastviewer gpu/gpu0/name' it's the nvidia card, adjust gpu0 to the output from above - nvidia-smi --query - nvidia-smi dmon -s pucm (kill after a few lines) (In reply to David Redondo from comment #22) > I assume you mean it still doesn't work, as the commit is part of 5.26.90. Yes. It still does not work. > Please attach the output of > - kstatsviewer --list | grep gpu > - if the above contained output, check with 'kstastviewer gpu/gpu0/name' > it's the nvidia card, adjust gpu0 to the output from above ```~ ❯ kstatsviewer --list | grep gpu gpu/gpu0/power GPU 1 Power gpu/all/usedVram All GPUs Used Memory gpu/all/totalVram All GPUs Total Memory gpu GPU gpu/gpu0/coreFrequency GPU 1 Frequency gpu/all All GPUs gpu/gpu0/temperature GPU 1 Temperature gpu/all/usage All GPUs Usage gpu/gpu0/name GPU 1 Name gpu/gpu0/usage GPU 1 Usage gpu/gpu0/usedVram GPU 1 Video Memory Used gpu/gpu0/memoryFrequency GPU 1 Memory Frequency gpu/gpu0 GPU 1 gpu/gpu0/totalVram GPU 1 Total Video Memory ~ ❯ kstatsviewer gpu/gpu0/usage gpu/gpu0/usage 0``` ❯ kstatsviewer --list | grep gpu gpu/gpu0/power GPU 1 Power gpu/all/usedVram All GPUs Used Memory gpu/all/totalVram All GPUs Total Memory gpu GPU gpu/gpu0/coreFrequency GPU 1 Frequency gpu/all All GPUs gpu/gpu0/temperature GPU 1 Temperature gpu/all/usage All GPUs Usage gpu/gpu0/name GPU 1 Name gpu/gpu0/usage GPU 1 Usage gpu/gpu0/usedVram GPU 1 Video Memory Used gpu/gpu0/memoryFrequency GPU 1 Memory Frequency gpu/gpu0 GPU 1 gpu/gpu0/totalVram GPU 1 Total Video Memory ~ ❯ kstatsviewer gpu/gpu0/usage gpu/gpu0/usage 0 > - nvidia-smi --query ```~ ❯ nvidia-smi --query ==============NVSMI LOG============== Timestamp : Mon Feb 13 11:04:24 2023 Driver Version : 525.89.02 CUDA Version : 12.0 Attached GPUs : 1 GPU 00000000:2B:00.0 Product Name : NVIDIA GeForce RTX 3060 Product Brand : GeForce Product Architecture : Ampere Display Mode : Enabled Display Active : Enabled Persistence Mode : Disabled MIG Mode Current : N/A Pending : N/A Accounting Mode : Disabled Accounting Mode Buffer Size : 4000 Driver Model Current : N/A Pending : N/A Serial Number : N/A GPU UUID : GPU-6fb6ec98-c3cd-94db-a6e4-6d623ebc56e5 Minor Number : 0 VBIOS Version : 94.06.14.40.45 MultiGPU Board : No Board ID : 0x2b00 Board Part Number : N/A GPU Part Number : 2503-300-A1 Module ID : 1 Inforom Version Image Version : G001.0000.03.03 OEM Object : 2.0 ECC Object : N/A Power Management Object : N/A GPU Operation Mode Current : N/A Pending : N/A GSP Firmware Version : N/A GPU Virtualization Mode Virtualization Mode : None Host VGPU Mode : N/A IBMNPU Relaxed Ordering Mode : N/A PCI Bus : 0x2B Device : 0x00 Domain : 0x0000 Device Id : 0x250310DE Bus Id : 00000000:2B:00.0 Sub System Id : 0x36573842 GPU Link Info PCIe Generation Max : 4 Current : 4 Device Current : 4 Device Max : 4 Host Max : 4 Link Width Max : 16x Current : 16x Bridge Chip Type : N/A Firmware : N/A Replays Since Reset : 0 Replay Number Rollovers : 0 Tx Throughput : 7000 KB/s Rx Throughput : 628000 KB/s Atomic Caps Inbound : N/A Atomic Caps Outbound : N/A Fan Speed : 0 % Performance State : P3 Clocks Throttle Reasons Idle : Active Applications Clocks Setting : Not Active SW Power Cap : Not Active HW Slowdown : Not Active HW Thermal Slowdown : Not Active HW Power Brake Slowdown : Not Active Sync Boost : Not Active SW Thermal Slowdown : Not Active Display Clock Setting : Not Active FB Memory Usage Total : 12288 MiB Reserved : 251 MiB Used : 1681 MiB Free : 10354 MiB BAR1 Memory Usage Total : 256 MiB Used : 68 MiB Free : 188 MiB Compute Mode : Default Utilization Gpu : 34 % Memory : 7 % Encoder : 0 % Decoder : 0 % Encoder Stats Active Sessions : 0 Average FPS : 0 Average Latency : 0 FBC Stats Active Sessions : 0 Average FPS : 0 Average Latency : 0 Ecc Mode Current : N/A Pending : N/A ECC Errors Volatile SRAM Correctable : N/A SRAM Uncorrectable : N/A DRAM Correctable : N/A DRAM Uncorrectable : N/A Aggregate SRAM Correctable : N/A SRAM Uncorrectable : N/A DRAM Correctable : N/A DRAM Uncorrectable : N/A Retired Pages Single Bit ECC : N/A Double Bit ECC : N/A Pending Page Blacklist : N/A Remapped Rows : N/A Temperature GPU Current Temp : 52 C GPU T.Limit Temp : N/A GPU Shutdown Temp : 98 C GPU Slowdown Temp : 95 C GPU Max Operating Temp : 93 C GPU Target Temperature : 83 C Memory Current Temp : N/A Memory Max Operating Temp : N/A Power Readings Power Management : Supported Power Draw : 31.90 W Power Limit : 170.00 W Default Power Limit : 170.00 W Enforced Power Limit : 170.00 W Min Power Limit : 100.00 W Max Power Limit : 190.00 W Clocks Graphics : 847 MHz SM : 847 MHz Memory : 5000 MHz Video : 780 MHz Applications Clocks Graphics : N/A Memory : N/A Default Applications Clocks Graphics : N/A Memory : N/A Deferred Clocks Memory : N/A Max Clocks Graphics : 2145 MHz SM : 2145 MHz Memory : 7501 MHz Video : 1950 MHz Max Customer Boost Clocks Graphics : N/A Clock Policy Auto Boost : N/A Auto Boost Default : N/A Voltage Graphics : 643.750 mV Fabric State : N/A Status : N/A Processes GPU instance ID : N/A Compute instance ID : N/A Process ID : 937 Type : G Name : /usr/lib/Xorg Used GPU Memory : 315 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1098 Type : G Name : /usr/bin/kwalletd5 Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1184 Type : G Name : /usr/bin/ksmserver Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1186 Type : G Name : /usr/bin/kded5 Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1187 Type : G Name : /usr/bin/kwin_x11 Used GPU Memory : 65 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1223 Type : G Name : /usr/bin/plasmashell Used GPU Memory : 46 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1271 Type : G Name : /usr/lib/polkit-kde-authentication-agent-1 Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1273 Type : G Name : /usr/lib/xdg-desktop-portal-kde Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1375 Type : G Name : /usr/bin/koi Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1379 Type : G Name : /usr/lib/kdeconnectd Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1385 Type : G Name : /usr/bin/kaccess Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1389 Type : G Name : /usr/bin/kalendarac Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1463 Type : G Name : /usr/bin/akonadi_control Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1532 Type : G Name : /usr/bin/akonadi_akonotes_resource Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1533 Type : G Name : /usr/bin/akonadi_birthdays_resource Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1534 Type : G Name : /usr/bin/akonadi_contacts_resource Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1535 Type : G Name : /usr/bin/akonadi_davgroupware_resource Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1536 Type : G Name : /usr/bin/akonadi_ical_resource Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1537 Type : G Name : /usr/bin/akonadi_indexing_agent Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1539 Type : G Name : /usr/bin/akonadi_maildir_resource Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1545 Type : G Name : /usr/bin/akonadi_maildispatcher_agent Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1547 Type : G Name : /usr/bin/akonadi_migration_agent Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1548 Type : G Name : /usr/bin/akonadi_newmailnotifier_agent Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 1655 Type : G Name : /usr/lib/kf5/kiod5 Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 2763 Type : G Name : /home/mossy/.local/share/Steam/ubuntu12_32/steam Used GPU Memory : 106 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 2791 Type : G Name : /home/mossy/.local/share/Steam/ubuntu12_64/steamwebhelper --type=gpu-process --field-trial-handle=8188677406891853571,3726179146442060566,131072 --enable-features=CastMediaRouteProvider --disable-features=SameSiteByDefaultCookies --disable-seccomp-filter-sandbox --use-cmd-decoder=passthrough --use-angle=gl --log-file=/home/mossy/.local/share/Steam/logs/cef_log.txt --product-version=Valve Steam Client --lang=en_US.UTF-8 --gpu-preferences=MAAAAAAAAAAgAAAQAAAAAAAAAAAAAAAAAABgAAAAAAAQAAAAAAAAAAAAAAAAAAAACAAAAAAAAAA= --log-file=/home/mossy/.local/share/Steam/logs/cef_log.txt --shared-files Used GPU Memory : 127 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 4074 Type : C+G Name : Z:\mnt\overflow\Games\Steam Library\steamapps\common\Cookie Clicker\Cookie Clicker.exe Used GPU Memory : 672 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 4496 Type : G Name : /usr/lib/firefox/firefox Used GPU Memory : 117 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 4757 Type : G Name : /usr/bin/plasma-browser-integration-host Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 7163 Type : G Name : /opt/discord/Discord --type=gpu-process --field-trial-handle=10103583813706683152,18265629675230651765,131072 --disable-features=CookiesWithoutSameSiteMustBeSecure,HardwareMediaKeyHandling,MediaSessionService,SameSiteByDefaultCookies,SpareRendererForSitePerProcess,WinRetrieveSuggestionsOnlyOnDemand --enable-crash-reporter=4a74e4c1-4496-497c-a85c-4b73007afc63,no_channel --global-crash-keys=4a74e4c1-4496-497c-a85c-4b73007afc63,no_channel,_companyName=Discord Inc.,_productName=Discord,_version=0.0.24 --gpu-preferences=UAAAAAAAAAAgAAAQAAAAAAAAAAAAAAAAAABgAAAAAAAwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAgAAAAAAAAACAAAAAAAAAAAAAAAAAAAAAIAAAAAAAAAAgAAAAAAAAACAAAAAAAAAA= --shared-files Used GPU Memory : 23 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 8265 Type : G Name : /opt/SchildiChat/schildichat-desktop --type=gpu-process --enable-crash-reporter=51980db1-0abc-4434-b228-7ff0f6dbc976,no_channel --user-data-dir=/home/mossy/.config/SchildiChat --gpu-preferences=WAAAAAAAAAAgAAAIAAAAAAAAAAAAAAAAAABgAAAAAAA4AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAAAAAIAAAAAAAAAABAAAAAAAAAAgAAAAAAAAACAAAAAAAAAAIAAAAAAAAAA== --shared-files --field-trial-handle=0,i,16603587272151528517,15370999515174459645,131072 --enable-features=WebRTCPipeWireCapturer --disable-features=HardwareMediaKeyHandling,MediaSessionService,SpareRendererForSitePerProcess Used GPU Memory : 31 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 9257 Type : G Name : /usr/lib/baloorunner Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 9961 Type : G Name : /usr/bin/alacritty Used GPU Memory : 8 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 15006 Type : G Name : /usr/lib/thunderbird/thunderbird Used GPU Memory : 83 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 19959 Type : G Name : /usr/bin/kate Used GPU Memory : 2 MiB GPU instance ID : N/A Compute instance ID : N/A Process ID : 19995 Type : G Name : /usr/lib/kf5/kioslave5 Used GPU Memory : 2 MiB``` > - nvidia-smi dmon -s pucm (kill after a few lines) ```~ ❯ nvidia-smi dmon -s pucm # gpu pwr gtemp mtemp sm mem enc dec mclk pclk fb bar1 # Idx W C C % % % % MHz MHz MB MB 0 33 53 - 25 6 0 0 5000 1140 1609 68 0 33 53 - 25 6 0 0 5000 1132 1611 68 0 33 53 - 37 7 0 0 5000 1132 1611 68 0 33 53 - 25 7 0 0 5000 1162 1611 68 0 33 53 - 24 6 0 0 810 825 1611 68 ^C⏎``` I have a EVGA GeForce RTX 3060 I verified that parsing of your pasted output works. If you run 'ksystemstats' are there any warnings about gpus printed? (In reply to David Redondo from comment #24) > I verified that parsing of your pasted output works. > > If you run 'ksystemstats' are there any warnings about gpus printed? Yes, it says 'Could not retrieve information for NVidia GPU "0000:2b:00.0"' Thanks I expected something like that, even though it doesn't make sense to me since from the nvidia-smi output it should get 0000:2b:00.0... Will investigate further A possibly relevant merge request was started @ https://invent.kde.org/plasma/ksystemstats/-/merge_requests/50 Git commit d58d03ecf14b1f879a982c9a431384a30033f230 by David Redondo. Committed on 14/02/2023 at 14:38. Pushed by davidre into branch 'master'. gpu/nvidia: lower pci id nvidia-smi uses upper case letters, udev lower case FIXED-IN:5.27.1 M +1 -1 plugins/gpu/NvidiaSmiProcess.cpp https://invent.kde.org/plasma/ksystemstats/commit/d58d03ecf14b1f879a982c9a431384a30033f230 Git commit 63375432b8dbba59746cbffd4a74a96764239373 by David Redondo. Committed on 14/02/2023 at 14:46. Pushed by davidre into branch 'cherry-pick-d58d03ec'. gpu/nvidia: lower pci id nvidia-smi uses upper case letters, udev lower case FIXED-IN:5.27.1 (cherry picked from commit d58d03ecf14b1f879a982c9a431384a30033f230) M +1 -1 plugins/gpu/NvidiaSmiProcess.cpp https://invent.kde.org/plasma/ksystemstats/commit/63375432b8dbba59746cbffd4a74a96764239373 I'll have to try this out. Will this hit the [kde-unstable] repository or will I need to check plasma-systemmonitor-git on the AUR? It will not be part of the initial release but 27.1 which happens in a week. I wish to inform you all that I have updated to 5.27.1 and it is working as expected now. Also working for me. Thanks, everyone. *** Bug 469298 has been marked as a duplicate of this bug. *** Still have this issue: - GPU 1 Frequency: 0 HHz - GPU 1 Temperature: 0 - GPU 1 Usage: 0% - GPU 1 Video Memory Used: "" (Nothing displayed) - GPU1 Total Video Memory: 56.0 GiB $ neofetch --stdout OS: NixOS 23.11.20230706.3c74875 (Tapir) x86_64 Host: ASUSTeK COMPUTER INC. TUF GAMING X670E-PLUS Kernel: 6.4.1 Uptime: 52 mins Packages: 3368 (nix-system), 217 (nix-user) Shell: zsh 5.9 Resolution: 3840x2160 DE: Plasma 5.27.6 WM: KWin Icons: Papirus-Dark [GTK2/3] Terminal: .konsole-wrappe CPU: AMD Ryzen 9 7950X (32) @ 4.500GHz GPU: NVIDIA GeForce RTX 4090 Memory: 8596MiB / 64033MiB See https://bugs.kde.org/show_bug.cgi?id=470474 nvidia changed format again, fixed in 5.27.7 |