Bug 452119 - With an Intel iGPU, animations aren't as smooth on Wayland versus Xorg
Summary: With an Intel iGPU, animations aren't as smooth on Wayland versus Xorg
Status: RESOLVED FIXED
Alias: None
Product: kwin
Classification: Plasma
Component: performance (show other bugs)
Version: 5.24.4
Platform: Arch Linux Linux
: NOR minor
Target Milestone: ---
Assignee: KWin default assignee
URL:
Keywords:
: 416200 439511 445070 467283 (view as bug list)
Depends on:
Blocks:
 
Reported: 2022-03-31 16:02 UTC by Antonio Orefice
Modified: 2024-06-12 14:07 UTC (History)
37 users (show)

See Also:
Latest Commit:
Version Fixed In: 6.1.0
Sentry Crash Report:


Attachments
sudo intel_gpu_top > x11-idle.log (6.01 KB, text/x-log)
2022-05-07 13:38 UTC, kde.org
Details
sudo intel_gpu_top > x11-moving-window.log (2.00 KB, text/x-log)
2022-05-07 13:39 UTC, kde.org
Details
sudo intel_gpu_top > wayland-idle.log (8.91 KB, text/x-log)
2022-05-07 13:40 UTC, kde.org
Details
sudo intel_gpu_top > wayland-moving-window.log (3.56 KB, text/x-log)
2022-05-07 13:40 UTC, kde.org
Details
output of drm_info (34.37 KB, text/plain)
2022-11-02 01:15 UTC, Patrick Silva
Details
output of drm_info (Haswell-ULT Integrated Graphics Controller (rev 0b)) (37.10 KB, text/plain)
2022-11-02 07:16 UTC, Lemmiwinks
Details
drm_info (HD Intel® 4600) (24.97 KB, application/octet-stream)
2022-11-02 13:17 UTC, Podagric
Details
drm_info_output of celeron g1820 haswell (35.24 KB, text/plain)
2022-11-02 14:49 UTC, Patrick Silva
Details
drm_info output with the KWIN_DRM_USE_MODIFIERS=0 env var set (35.24 KB, text/plain)
2022-11-02 15:15 UTC, Patrick Silva
Details
output of drm_info on wayland (Haswell-ULT Integrated Graphics Controller (rev 0b)) (37.12 KB, text/plain)
2022-11-05 17:55 UTC, Lemmiwinks
Details
settings (18.83 KB, image/png)
2023-06-27 13:00 UTC, Lyubomir
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Antonio Orefice 2022-03-31 16:02:47 UTC
Today i switched from x11 to wayland, but unfortunately windows keep stuttering when doing basic movement/maximizing.

I'd say the effect is more pronunced when there are more windows on screen.
If i go for low latency, even dragging a window with wobbly effect on stutters.
I may not have a fast gpu, granted, it is an haswell igp, but under xorg, it is butter smooth in driving 2 monitors at 1920x1080.

To have smooth animations with not so much frames dropped, i've to use the highest latency setting.

While intel_gpu_top reports no significant gpu usage (20%), even when windows are stuttering, one thing that helps is setting the igp min frequency to an higher value; that way i can even use the balanced preset.

But what REALLY helps, is enabling the Show FPS effect.
With just that active,  i can even force the lowest latency settings, and everything is magically smooth like under Xorg.

SOFTWARE/OS VERSIONS
KDE Plasma Version: 5.24.4
KDE Frameworks Version: 5.92.0
Qt Version: 5.15.3
Comment 1 Patrick Silva 2022-04-01 14:39:54 UTC
I also use the iGPU of an haswell cpu (celeron g1820) and can confirm worse performance on Wayland compared to X11.
Even video playback is laggy with several different players despite VAAPI acceleration is working according to vainfo command. Disabling blur effect improved the situation a bit on my system at least.
Comment 2 Patrick Silva 2022-04-01 15:17:04 UTC
My versions:

Operating System: Arch Linux
KDE Plasma Version: 5.24.4
KDE Frameworks Version: 5.92.0
Qt Version: 5.15.3
Kernel Version: 5.17.1-arch1-1 (64-bit)
Graphics Platform: Wayland
Processors: 2 × Intel® Celeron® CPU G1820 @ 2.70GHz
Memory: 7,6 GiB of RAM
Graphics Processor: Mesa Intel® HD Graphics
Comment 3 Zamundaaa 2022-04-02 01:07:08 UTC
> But what REALLY helps, is enabling the Show FPS effect

Does the GPU or memory clock go up if you enable it?
Comment 4 Antonio Orefice 2022-04-02 06:13:53 UTC
(In reply to Zamundaaa from comment #3)
> > But what REALLY helps, is enabling the Show FPS effect
> 
> Does the GPU or memory clock go up if you enable it?

If I remember right, it goes from 2mhz to about 200mhz.
I'll check it by monday.
Comment 5 Antonio Orefice 2022-04-04 12:38:05 UTC
(In reply to Zamundaaa from comment #3)
> > But what REALLY helps, is enabling the Show FPS effect
> 
> Does the GPU or memory clock go up if you enable it?

Enabling show fps effects bump the gpu use to about 150mhz.
Which is a weird value, because the lowest non-idle setting should be 200mhz, so i suspect it is averaging results.
By the way,  the desktop is almost smooth everytime with show fps effeca and with lowest latency.

However if i turn off "show fps" and artificially put the gpu mhz to a higher value state with glxgears to 200mhz, then while it is a bit better than without glxgears, it is definitely not smooth as with the show fps effect running, so i suspect is not just a matter of gpu idling too much.
Comment 6 Zamundaaa 2022-04-04 13:50:18 UTC
What does the memory clock do with the fps effect? GPU core clock is not the single deciding factor on performance
Comment 7 Antonio Orefice 2022-04-04 14:24:05 UTC
(In reply to Zamundaaa from comment #6)
> What does the memory clock do with the fps effect? GPU core clock is not the
> single deciding factor on performance

If you are asking me, I've no idea.
I answered to your previous question: "Does the GPU or memory clock go up if you enable it?"
I don't know which go up when i enable the Show FPS effect, because intel gpu utils refers to a generic "gpu frequency".
However, since you asked about one of those frequencies, i answered Yes, because intel_gpu_top reports an higher frequency when i enable FPS effect.
Note that the actual minimum frequency for non idle state is 200mhz; in idle state the gpu reaches 1mhz.
If when i enable FPS effect i get something lower, it probabily means that it is just waking up the gpu from time to time.

Could you please  be more specific? I'm having hard times trying to understand your point.
Comment 8 Antonio Orefice 2022-04-04 14:26:59 UTC
Aye, i've read better, sorry.
I don't know how to read current memory clock speed. Do you have advices?
Comment 9 Zamundaaa 2022-04-04 17:04:51 UTC
I don't know how to check the memory clock on Intel systems, searching for it doesn't yield results for the dynamic memory speed either.

We can just check for it differently though: If you start glxgears and leave that running instead of the fps effect, does that make KWin run smooth?
Comment 10 Antonio Orefice 2022-04-04 17:13:25 UTC
(In reply to Zamundaaa from comment #9)
> I don't know how to check the memory clock on Intel systems, searching for
> it doesn't yield results for the dynamic memory speed either.
> 
> We can just check for it differently though: If you start glxgears and leave
> that running instead of the fps effect, does that make KWin run smooth?

Already did, please see end of comment 5.
It does, but marginally and frequency is even higher.
Comment 11 kde.org 2022-05-07 13:38:11 UTC
Created attachment 148636 [details]
sudo intel_gpu_top > x11-idle.log

Output of intel_gpu_top for an idle x11 desktop
Comment 12 kde.org 2022-05-07 13:39:08 UTC
Created attachment 148637 [details]
sudo intel_gpu_top > x11-moving-window.log

Output of intel_gpu_top for a window being moved on an X11 desktop
Comment 13 kde.org 2022-05-07 13:40:03 UTC
Created attachment 148638 [details]
sudo intel_gpu_top > wayland-idle.log

Output of intel_gpu_top for an idle Wayland desktop
Comment 14 kde.org 2022-05-07 13:40:50 UTC
Created attachment 148640 [details]
sudo intel_gpu_top > wayland-moving-window.log

Output of intel_gpu_top for a window being moved on a Wayland desktop
Comment 15 kde.org 2022-05-07 14:18:31 UTC
I also noticed a significant slowdown and increased choppiness when switching from X11 to Wayland.
I ran intel_cpu_info on both desktops and attached the output for idle desktops and while moving/dragging a chrome browser window around with the mouse. When run interactively, intel_gpu_top shows values of 1 Mhz and about 220 Mhz for the idle and moving window szenario on X11, respctively.  For Wayland, the numbers are 5-10 Mhz and about 400 Mhz.

When the output is logged to a file, the numbers look differently for idle case, though. It looks as if the frequency actually drops a lot more often to single digit values on wayland in idle, whereas it seems to stay in the 100-200 Mhz range on an idle desktop in X11.
When moving the window around, we see much higher gpu frequencies on Wayland (around 400Mhz) vs around 250Mhz on X11.
So it seems that wayland requires higher gpu frequencies and while still not achieving the smooth result of X11.

I did enable the show fps effect, but noticed only a small improvement. I did notice, however, that dragging a window around with this effect active causes much higher gpu frequencies (around 700 Mhz).

Please note, that the numbers should only be taken as an indication, as the tests were conducted using manual input/mouse movements. while other applications have been running. They do show, however, that wayland currently puts a much higher burden on this intel GPU.

I don't know much about gfx drives on linux. Is the same driver stack used for wayland vs X11? Just wondering if the likes of mesa might have an influence on this, or if this is definitely a kwin-wayland vs kwin-x11 issue.

Please let me know if there is anything else that I can do to provide more information to help get to the cause of this issue.
Comment 16 kde.org 2022-05-07 14:28:40 UTC
(In reply to Zamundaaa from comment #9)
> I don't know how to check the memory clock on Intel systems, searching for
> it doesn't yield results for the dynamic memory speed either.
> 
> We can just check for it differently though: If you start glxgears and leave
> that running instead of the fps effect, does that make KWin run smooth?

As this is a built-in GPU without dedicated memory, I don't think that the memory clock actually changes depending on the GPU load. The  memory is shared with the CPU, so the memory clock should not change.
Leaving glxgears running, it achieves about 60fps. When I start moving a window the FPS drop to about 30-40 and the window movement is still choppy and not smooth.
Comment 17 Zamundaaa 2022-05-16 21:53:34 UTC
Integrated GPUs do very much influence the used memory frequency, it has a large effect on performance and battery life.
But glxgears running should already put memory frequency to the maximum, it certainly does that on my AMD laptop.
Comment 18 Lemmiwinks 2022-06-20 19:51:20 UTC
I'm having the same problem. Using the following setup:
Operating System: Kubuntu 22.04
KDE Plasma Version: 5.24.5
KDE Frameworks Version: 5.94.0
Qt Version: 5.15.3
Kernel Version: 5.15.0-33-generic (64-bit)
Processors: 4 × Intel® Core™ i7-4600U CPU @ 2.10GHz
Memory: 15,1 GiB of RAM
Graphics Processor: Mesa Intel® HD Graphics 4400

Wayland performance was alright with Plasma 5.18.
Comment 19 Nate Graham 2022-08-09 15:26:10 UTC
I was experiencing this issue and found that the cause was actually Firefox forcing kwin to constantly repaint the screen; see https://bugzilla.mozilla.org/show_bug.cgi?id=1754810. Setting `widget.wayland.vsync.enabled` to `false` on Firefox's about:config page fixed the issue by making it no longer constantly repaint.

For people who are affected, by this issue, can you turn on the "Show Paint" KWin effect, set a keyboard shortcut for it, and hit that keyboard shortcut? If you see that the screen constantly flickers, then you're having the same issue of *something* forcing the screen to constantly repaint.

There might be opportunities for KWin itself to handle this better, but apps also need to cooperate.
Comment 20 Lemmiwinks 2022-08-10 16:25:27 UTC
(In reply to Nate Graham from comment #19)
> I was experiencing this issue and found that the cause was actually Firefox
> forcing kwin to constantly repaint the screen; see
> https://bugzilla.mozilla.org/show_bug.cgi?id=1754810. Setting
> `widget.wayland.vsync.enabled` to `false` on Firefox's about:config page
> fixed the issue by making it no longer constantly repaint.
> 
> For people who are affected, by this issue, can you turn on the "Show Paint"
> KWin effect, set a keyboard shortcut for it, and hit that keyboard shortcut?
> If you see that the screen constantly flickers, then you're having the same
> issue of *something* forcing the screen to constantly repaint.
> 
> There might be opportunities for KWin itself to handle this better, but apps
> also need to cooperate.

I tried what you suggested and turned on the "Show Paint" effect. The only difference between wayland an xorg seems that on wayland the screen also flickers when just moving the cursor. But the screen does not flicker constantly, so there is nothing causing constant repaints. 
I also switched to Crocus driver by setting export MESA_LOADER_DRIVER_OVERRIDE=crocus in ~/.profile but it does not help.
The same issue exists in Opensuse Leap 15.4.
Comment 21 Zamundaaa 2022-11-02 01:00:18 UTC
Can everyone that's affected attach the output of drm_info (https://gitlab.freedesktop.org/emersion/drm_info) on their system?
Comment 22 Patrick Silva 2022-11-02 01:15:08 UTC
Created attachment 153392 [details]
output of drm_info
Comment 23 Zamundaaa 2022-11-02 01:57:49 UTC
Ok. Can you check if any of these environment variables (only one at a time) make a difference?
KWIN_DRM_NO_AMS=1
KWIN_DRM_USE_MODIFIERS=0
KWIN_DRM_PREFER_COLOR_DEPTH=24
Comment 24 Lemmiwinks 2022-11-02 07:16:29 UTC
Created attachment 153395 [details]
output of drm_info (Haswell-ULT Integrated Graphics Controller (rev 0b))

Here is my output of drm_info. Going to check your suggestions soon.
Thanks!
Comment 25 Podagric 2022-11-02 13:17:50 UTC
Created attachment 153399 [details]
drm_info (HD Intel® 4600)
Comment 26 Podagric 2022-11-02 13:28:11 UTC
(In reply to Zamundaaa from comment #23)
> Ok. Can you check if any of these environment variables (only one at a time)

no difference
Comment 27 Zamundaaa 2022-11-02 13:46:33 UTC
I'm afraid this will need to be brute forced then; if performance was fine in 5.18 but not now then a bisect would be useful. As that's pretty much impossible to do over such a large range of versions, can you boot a few old ISOs on your hardware and see with what the last Plasma version is where performance was good?
Comment 28 Patrick Silva 2022-11-02 14:22:25 UTC
I have tested the env variables by adding them to /env/environment file. Plasma animations are more fluid when I use the first two options, but video playback with mpv, vlc and gstreamer-based players is still stuttering. There is no difference with the third option.

Operating System: Arch Linux
KDE Plasma Version: 5.26.2
KDE Frameworks Version: 5.99.0
Qt Version: 5.15.7
Kernel Version: 6.0.6-arch1-1 (64-bit)
Graphics Platform: Wayland
Processors: 2 × Intel® Celeron® CPU G1820 @ 2.70GHz
Memory: 7,6 GiB of RAM
Graphics Processor: Mesa Intel® HD Graphics
Comment 29 Patrick Silva 2022-11-02 14:35:21 UTC
And kwin_wayland crashed once when I minimized a window with click on the task manager while using KWIN_DRM_USE_MODIFIERS=0



Thread 12 (Thread 0x7f15968976c0 (LWP 3945)):
#0  0x00007f15ed921096 in epoll_wait (epfd=89, events=events@entry=0x7f1596896770, maxevents=32, timeout=-1) at ../sysdeps/unix/sysv/linux/epoll_wait.c:30
#1  0x00007f15ab290669 in impl_pollfd_wait (object=<optimized out>, pfd=<optimized out>, ev=0x7f1596896940, n_ev=<optimized out>, timeout=<optimized out>) at ../pipewire/spa/plugins/support/system.c:157
#2  0x00007f15ab282b7b in loop_iterate (object=0x560fa85a7ae8, timeout=-1) at ../pipewire/spa/plugins/support/loop.c:401
#3  0x00007f15ee560317 in do_loop (user_data=0x560fa851df20) at ../pipewire/src/pipewire/data-loop.c:81
#4  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#5  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 11 (Thread 0x7f15cafff6c0 (LWP 3550)):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x7f15caffea60, op=137, expected=0, futex_word=0x560fa70ecee4) at futex-internal.c:57
#1  __futex_abstimed_wait_common (futex_word=futex_word@entry=0x560fa70ecee4, expected=expected@entry=0, clockid=clockid@entry=1, abstime=abstime@entry=0x7f15caffea60, private=private@entry=0, cancel=cancel@entry=true) at futex-internal.c:87
#2  0x00007f15ed89c51f in __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x560fa70ecee4, expected=expected@entry=0, clockid=clockid@entry=1, abstime=abstime@entry=0x7f15caffea60, private=private@entry=0) at futex-internal.c:139
#3  0x00007f15ed89efd4 in __pthread_cond_wait_common (abstime=0x7f15caffea60, clockid=1, mutex=0x560fa70ece90, cond=0x560fa70eceb8) at pthread_cond_wait.c:503
#4  ___pthread_cond_timedwait64 (cond=0x560fa70eceb8, mutex=0x560fa70ece90, abstime=0x7f15caffea60) at pthread_cond_wait.c:643
#5  0x00007f15ee6eb714 in QWaitConditionPrivate::wait_relative(QDeadlineTimer) (deadline=..., this=0x560fa70ece90) at thread/qwaitcondition_unix.cpp:136
#6  QWaitConditionPrivate::wait(QDeadlineTimer) (deadline=..., this=0x560fa70ece90) at thread/qwaitcondition_unix.cpp:144
#7  QWaitCondition::wait(QMutex*, QDeadlineTimer) (this=<optimized out>, mutex=0x560fa70e7228, deadline=...) at thread/qwaitcondition_unix.cpp:225
#8  0x00007f15ee6e82b7 in QThreadPoolThread::run() (this=0x560fa70f2c30) at thread/qthreadpool.cpp:140
#9  0x00007f15ee6e42ea in QThreadPrivate::start(void*) (arg=0x560fa70f2c30) at thread/qthread_unix.cpp:330
#10 0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#11 0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 10 (Thread 0x7f159e3316c0 (LWP 3929)):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f159ebcb550) at futex-internal.c:57
#1  __futex_abstimed_wait_common (futex_word=futex_word@entry=0x7f159ebcb550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0, cancel=cancel@entry=true) at futex-internal.c:87
#2  0x00007f15ed89c51f in __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f159ebcb550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at futex-internal.c:139
#3  0x00007f15ed89ecd0 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f159ebcb500, cond=0x7f159ebcb528) at pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f159ebcb528, mutex=0x7f159ebcb500) at pthread_cond_wait.c:618
#5  0x00007f15e250799e in cnd_wait () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:135
#6  0x00007f15e24baf8c in util_queue_thread_func () at ../mesa-22.2.1/src/util/u_queue.c:287
#7  0x00007f15e25078cc in impl_thrd_routine () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:67
#8  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#9  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 9 (Thread 0x7f15ca6606c0 (LWP 3551)):
#0  0x00007f15ed9140bf in __GI___poll (fds=0x7f15b80029e0, nfds=1, timeout=-1) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f15ec6c41df in g_main_context_poll (priority=<optimized out>, n_fds=1, fds=0x7f15b80029e0, timeout=<optimized out>, context=0x7f15b8000c30) at ../glib/glib/gmain.c:4543
#2  g_main_context_iterate.constprop.0 (context=0x7f15b8000c30, block=1, dispatch=1, self=<optimized out>) at ../glib/glib/gmain.c:4233
#3  0x00007f15ec66c132 in g_main_context_iteration (context=0x7f15b8000c30, may_block=1) at ../glib/glib/gmain.c:4303
#4  0x00007f15ee8d7c4c in QEventDispatcherGlib::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15b8000b70, flags=...) at kernel/qeventdispatcher_glib.cpp:423
#5  0x00007f15ee88573c in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15ca65fae0, flags=...) at ../../include/QtCore/../../src/corelib/global/qflags.h:69
#6  0x00007f15ee6e721f in QThread::exec() (this=this@entry=0x560fa7248520) at ../../include/QtCore/../../src/corelib/global/qflags.h:121
#7  0x00007f15ef6c0370 in QQmlThreadPrivate::run() (this=0x560fa7248520) at /usr/src/debug/qtdeclarative/src/qml/qml/ftw/qqmlthread.cpp:155
#8  0x00007f15ee6e42ea in QThreadPrivate::start(void*) (arg=0x560fa7248520) at thread/qthread_unix.cpp:330
#9  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#10 0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 8 (Thread 0x7f15e89ff6c0 (LWP 3544)):
#0  0x00007f15ed9140bf in __GI___poll (fds=0x7f15e40053e0, nfds=3, timeout=2018) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f15ec6c41df in g_main_context_poll (priority=<optimized out>, n_fds=3, fds=0x7f15e40053e0, timeout=<optimized out>, context=0x7f15e4001cf0) at ../glib/glib/gmain.c:4543
#2  g_main_context_iterate.constprop.0 (context=0x7f15e4001cf0, block=1, dispatch=1, self=<optimized out>) at ../glib/glib/gmain.c:4233
#3  0x00007f15ec66c132 in g_main_context_iteration (context=0x7f15e4001cf0, may_block=1) at ../glib/glib/gmain.c:4303
#4  0x00007f15ee8d7c4c in QEventDispatcherGlib::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15e4000b70, flags=...) at kernel/qeventdispatcher_glib.cpp:423
#5  0x00007f15ee88573c in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15e89fead0, flags=...) at ../../include/QtCore/../../src/corelib/global/qflags.h:69
#6  0x00007f15ee6e721f in QThread::exec() (this=this@entry=0x7f15f073f560 <(anonymous namespace)::Q_QGS__q_manager::innerFunction()::holder>) at ../../include/QtCore/../../src/corelib/global/qflags.h:121
#7  0x00007f15f06e3cba in QDBusConnectionManager::run() (this=0x7f15f073f560 <(anonymous namespace)::Q_QGS__q_manager::innerFunction()::holder>) at /usr/src/debug/qtbase/src/dbus/qdbusconnection.cpp:179
#8  0x00007f15ee6e42ea in QThreadPrivate::start(void*) (arg=0x7f15f073f560 <(anonymous namespace)::Q_QGS__q_manager::innerFunction()::holder>) at thread/qthread_unix.cpp:330
#9  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#10 0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 7 (Thread 0x7f15c8a456c0 (LWP 3554)):
#0  0x00007f15ed9140bf in __GI___poll (fds=0x7f15bc0029e0, nfds=1, timeout=-1) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f15ec6c41df in g_main_context_poll (priority=<optimized out>, n_fds=1, fds=0x7f15bc0029e0, timeout=<optimized out>, context=0x7f15bc000c30) at ../glib/glib/gmain.c:4543
#2  g_main_context_iterate.constprop.0 (context=0x7f15bc000c30, block=1, dispatch=1, self=<optimized out>) at ../glib/glib/gmain.c:4233
#3  0x00007f15ec66c132 in g_main_context_iteration (context=0x7f15bc000c30, may_block=1) at ../glib/glib/gmain.c:4303
#4  0x00007f15ee8d7c4c in QEventDispatcherGlib::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15bc000b70, flags=...) at kernel/qeventdispatcher_glib.cpp:423
#5  0x00007f15ee88573c in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15c8a44ae0, flags=...) at ../../include/QtCore/../../src/corelib/global/qflags.h:69
#6  0x00007f15ee6e721f in QThread::exec() (this=this@entry=0x560fa7b80150) at ../../include/QtCore/../../src/corelib/global/qflags.h:121
#7  0x00007f15ef6c0370 in QQmlThreadPrivate::run() (this=0x560fa7b80150) at /usr/src/debug/qtdeclarative/src/qml/qml/ftw/qqmlthread.cpp:155
#8  0x00007f15ee6e42ea in QThreadPrivate::start(void*) (arg=0x560fa7b80150) at thread/qthread_unix.cpp:330
#9  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#10 0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 6 (Thread 0x7f15c929c6c0 (LWP 3553)):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f15c929d550) at futex-internal.c:57
#1  __futex_abstimed_wait_common (futex_word=futex_word@entry=0x7f15c929d550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0, cancel=cancel@entry=true) at futex-internal.c:87
#2  0x00007f15ed89c51f in __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f15c929d550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at futex-internal.c:139
#3  0x00007f15ed89ecd0 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f15c929d500, cond=0x7f15c929d528) at pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f15c929d528, mutex=0x7f15c929d500) at pthread_cond_wait.c:618
#5  0x00007f15e250799e in cnd_wait () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:135
#6  0x00007f15e24baf8c in util_queue_thread_func () at ../mesa-22.2.1/src/util/u_queue.c:287
#7  0x00007f15e25078cc in impl_thrd_routine () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:67
#8  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#9  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 5 (Thread 0x7f15d8bfe6c0 (LWP 3549)):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f15d9e9b550) at futex-internal.c:57
#1  __futex_abstimed_wait_common (futex_word=futex_word@entry=0x7f15d9e9b550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0, cancel=cancel@entry=true) at futex-internal.c:87
#2  0x00007f15ed89c51f in __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f15d9e9b550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at futex-internal.c:139
#3  0x00007f15ed89ecd0 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f15d9e9b500, cond=0x7f15d9e9b528) at pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f15d9e9b528, mutex=0x7f15d9e9b500) at pthread_cond_wait.c:618
#5  0x00007f15e250799e in cnd_wait () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:135
#6  0x00007f15e24baf8c in util_queue_thread_func () at ../mesa-22.2.1/src/util/u_queue.c:287
#7  0x00007f15e25078cc in impl_thrd_routine () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:67
#8  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#9  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 4 (Thread 0x7f15d93ff6c0 (LWP 3548)):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f15e22eb550) at futex-internal.c:57
#1  __futex_abstimed_wait_common (futex_word=futex_word@entry=0x7f15e22eb550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0, cancel=cancel@entry=true) at futex-internal.c:87
#2  0x00007f15ed89c51f in __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f15e22eb550, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at futex-internal.c:139
#3  0x00007f15ed89ecd0 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f15e22eb500, cond=0x7f15e22eb528) at pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f15e22eb528, mutex=0x7f15e22eb500) at pthread_cond_wait.c:618
#5  0x00007f15e250799e in cnd_wait () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:135
#6  0x00007f15e24baf8c in util_queue_thread_func () at ../mesa-22.2.1/src/util/u_queue.c:287
#7  0x00007f15e25078cc in impl_thrd_routine () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:67
#8  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#9  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 3 (Thread 0x7f15daf566c0 (LWP 3546)):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x560fa6e0bac8) at futex-internal.c:57
#1  __futex_abstimed_wait_common (futex_word=futex_word@entry=0x560fa6e0bac8, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0, cancel=cancel@entry=true) at futex-internal.c:87
#2  0x00007f15ed89c51f in __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x560fa6e0bac8, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at futex-internal.c:139
#3  0x00007f15ed89ecd0 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x560fa6e0ba78, cond=0x560fa6e0baa0) at pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x560fa6e0baa0, mutex=0x560fa6e0ba78) at pthread_cond_wait.c:618
#5  0x00007f15e250799e in cnd_wait () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:135
#6  0x00007f15e24baf8c in util_queue_thread_func () at ../mesa-22.2.1/src/util/u_queue.c:287
#7  0x00007f15e25078cc in impl_thrd_routine () at ../mesa-22.2.1/src/c11/impl/threads_posix.c:67
#8  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#9  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 2 (Thread 0x7f15da7556c0 (LWP 3547)):
#0  0x00007f15ed9140bf in __GI___poll (fds=0x7f15cc0029e0, nfds=2, timeout=-1) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f15ec6c41df in g_main_context_poll (priority=<optimized out>, n_fds=2, fds=0x7f15cc0029e0, timeout=<optimized out>, context=0x7f15cc000c30) at ../glib/glib/gmain.c:4543
#2  g_main_context_iterate.constprop.0 (context=0x7f15cc000c30, block=1, dispatch=1, self=<optimized out>) at ../glib/glib/gmain.c:4233
#3  0x00007f15ec66c132 in g_main_context_iteration (context=0x7f15cc000c30, may_block=1) at ../glib/glib/gmain.c:4303
#4  0x00007f15ee8d7c4c in QEventDispatcherGlib::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15cc000b70, flags=...) at kernel/qeventdispatcher_glib.cpp:423
#5  0x00007f15ee88573c in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7f15da754b00, flags=...) at ../../include/QtCore/../../src/corelib/global/qflags.h:69
#6  0x00007f15ee6e721f in QThread::exec() (this=<optimized out>) at ../../include/QtCore/../../src/corelib/global/qflags.h:121
#7  0x00007f15ee6e42ea in QThreadPrivate::start(void*) (arg=0x560fa6ed5a28) at thread/qthread_unix.cpp:330
#8  0x00007f15ed89f8fd in start_thread (arg=<optimized out>) at pthread_create.c:442
#9  0x00007f15ed921a60 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

Thread 1 (Thread 0x7f15e94120c0 (LWP 3537)):
#0  __pthread_kill_implementation (threadid=<optimized out>, signo=signo@entry=6, no_tid=no_tid@entry=0) at pthread_kill.c:44
#1  0x00007f15ed8a16b3 in __pthread_kill_internal (signo=6, threadid=<optimized out>) at pthread_kill.c:78
#2  0x00007f15ed851958 in __GI_raise (sig=sig@entry=6) at ../sysdeps/posix/raise.c:26
#3  0x00007f15ed83b53d in __GI_abort () at abort.c:79
#4  0x00007f15ef98bc8e in egl_provider_resolver (name=name@entry=0x7f15ef9b2008 <entrypoint_strings+2184> "eglQueryDmaBufModifiersEXT", providers=providers@entry=0x7ffed1394186, entrypoints=entrypoints@entry=0x7ffed139417c) at src/egl_generated_dispatch.c:3911
#5  0x00007f15ef98bcd0 in egl_single_resolver (provider=provider@entry=PROVIDER_EGL_EXT_image_dma_buf_import_modifiers, entrypoint_offset=<optimized out>, entrypoint_offset@entry=2184) at src/egl_generated_dispatch.c:3924
#6  0x00007f15ef98dd42 in epoxy_eglQueryDmaBufModifiersEXT_resolver () at src/egl_generated_dispatch.c:4711
#7  epoxy_eglQueryDmaBufModifiersEXT_global_rewrite_ptr (dpy=0x560fa6de8cb0, format=875713089, max_modifiers=0, modifiers=0x0, external_only=0x0, num_modifiers=0x7ffed1394230) at src/egl_generated_dispatch.c:5114
#8  0x0000560fa5c0bf01 in KWin::querySupportedModifiers (format=875713089, eglDisplay=0x560fa6de8cb0) at /usr/src/debug/kwin-5.26.2.1/src/plugins/screencast/screencaststream.cpp:323
#9  KWin::ScreenCastStream::createStream() (this=0x560fa82b9030) at /usr/src/debug/kwin-5.26.2.1/src/plugins/screencast/screencaststream.cpp:344
#10 0x0000560fa5c81d64 in KWin::ScreenCastStream::init() (this=0x560fa82b9030) at /usr/src/debug/kwin-5.26.2.1/src/plugins/screencast/screencaststream.cpp:295
#11 KWin::ScreencastManager::integrateStreams(KWaylandServer::ScreencastStreamV1Interface*, KWin::ScreenCastStream*) [clone .constprop.0] (waylandStream=0x560fa80b9ae0, stream=0x560fa82b9030, this=<optimized out>) at /usr/src/debug/kwin-5.26.2.1/src/plugins/screencast/screencastmanager.cpp:201
#12 0x00007f15ee8bda51 in QtPrivate::QSlotObjectBase::call(QObject*, void**) (a=0x7ffed1394c30, r=<optimized out>, this=0x560fa7af4b00, this=<optimized out>, r=<optimized out>, a=<optimized out>) at ../../include/QtCore/../../src/corelib/kernel/qobjectdefs_impl.h:398
#13 doActivate<false>(QObject*, int, void**) (sender=0x560fa6d90450, signal_index=5, argv=0x7ffed1394c30) at kernel/qobject.cpp:3919
#14 0x00007f15f018a0ba in KWaylandServer::ScreencastV1Interface::windowScreencastRequested(KWaylandServer::ScreencastStreamV1Interface*, QString const&, KWaylandServer::ScreencastV1Interface::CursorMode) (this=<optimized out>, _t1=<optimized out>, _t2=<optimized out>, _t3=<optimized out>) at /usr/src/debug/build/src/kwin_autogen/IEXH3JLKNG/moc_screencast_v1_interface.cpp:349
#15 0x00007f15f04365f6 in QtWaylandServer::zkde_screencast_unstable_v1::handle_stream_window(wl_client*, wl_resource*, unsigned int, char const*, unsigned int) (client=<optimized out>, resource=<optimized out>, stream=136, window_uuid=0x560fa80b9aa0 "{b5280121-2d48-41b8-83d2-d1bc916b13ca}", pointer=1) at /usr/src/debug/build/src/wayland/qwayland-server-zkde-screencast-unstable-v1.cpp:262
#16 0x00007f15ebc424f6 in ffi_call_unix64 () at ../src/x86/unix64.S:104
#17 0x00007f15ebc3ef5e in ffi_call_int (cif=<optimized out>, fn=<optimized out>, rvalue=<optimized out>, avalue=<optimized out>, closure=<optimized out>) at ../src/x86/ffi64.c:673
#18 0x00007f15ebc41b73 in ffi_call (cif=cif@entry=0x7ffed1394e80, fn=<optimized out>, rvalue=rvalue@entry=0x0, avalue=avalue@entry=0x7ffed1394f50) at ../src/x86/ffi64.c:710
#19 0x00007f15ed282ada in wl_closure_invoke (closure=closure@entry=0x560fa80b99c0, target=<optimized out>, target@entry=0x560fa817bc40, opcode=opcode@entry=1, data=<optimized out>, data@entry=0x560fa7c2b100, flags=2) at ../wayland-1.21.0/src/connection.c:1025
#20 0x00007f15ed287010 in wl_client_connection_data (fd=<optimized out>, mask=<optimized out>, data=<optimized out>) at ../wayland-1.21.0/src/wayland-server.c:437
#21 0x00007f15ed2859e2 in wl_event_loop_dispatch (loop=0x560fa6df8eb0, timeout=<optimized out>) at ../wayland-1.21.0/src/event-loop.c:1027
#22 0x00007f15f039756a in KWaylandServer::Display::dispatchEvents() (this=<optimized out>) at /usr/src/debug/kwin-5.26.2.1/src/wayland/display.cpp:114
#23 0x00007f15ee8bda51 in QtPrivate::QSlotObjectBase::call(QObject*, void**) (a=0x7ffed1395560, r=<optimized out>, this=0x560fa6dfd090, this=<optimized out>, r=<optimized out>, a=<optimized out>) at ../../include/QtCore/../../src/corelib/kernel/qobjectdefs_impl.h:398
#24 doActivate<false>(QObject*, int, void**) (sender=0x560fa7380d80, signal_index=3, argv=0x7ffed1395560) at kernel/qobject.cpp:3919
#25 0x00007f15ee8bf904 in QSocketNotifier::activated(QSocketDescriptor, QSocketNotifier::Type, QSocketNotifier::QPrivateSignal) (this=this@entry=0x560fa7380d80, _t1=..., _t2=<optimized out>, _t3=...) at .moc/moc_qsocketnotifier.cpp:178
#26 0x00007f15ee8bfa48 in QSocketNotifier::event(QEvent*) (this=0x560fa7380d80, e=<optimized out>) at kernel/qsocketnotifier.cpp:302
#27 0x00007f15edf78b1c in QApplicationPrivate::notify_helper(QObject*, QEvent*) (this=<optimized out>, receiver=0x560fa7380d80, e=0x7ffed1395680) at kernel/qapplication.cpp:3637
#28 0x00007f15ee88cf98 in QCoreApplication::notifyInternal2(QObject*, QEvent*) (receiver=0x560fa7380d80, event=0x7ffed1395680) at kernel/qcoreapplication.cpp:1064
#29 0x00007f15ee8d690c in QEventDispatcherUNIXPrivate::activateSocketNotifiers() (this=0x560fa6dd0910) at kernel/qeventdispatcher_unix.cpp:304
#30 0x00007f15ee8d7a01 in QEventDispatcherUNIX::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) (this=<optimized out>, flags=...) at kernel/qeventdispatcher_unix.cpp:511
#31 0x0000560fa5cba3a2 in QUnixEventDispatcherQPA::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) ()
#32 0x00007f15ee88573c in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) (this=0x7ffed1395810, flags=...) at ../../include/QtCore/../../src/corelib/global/qflags.h:69
#33 0x00007f15ee890269 in QCoreApplication::exec() () at ../../include/QtCore/../../src/corelib/global/qflags.h:121
#34 0x00007f15eed3a112 in QGuiApplication::exec() () at kernel/qguiapplication.cpp:1870
#35 0x00007f15edf76f2a in QApplication::exec() () at kernel/qapplication.cpp:2829
#36 0x0000560fa5bdb611 in main(int, char**) (argc=<optimized out>, argv=<optimized out>) at /usr/src/debug/kwin-5.26.2.1/src/main_wayland.cpp:613
Comment 30 Zamundaaa 2022-11-02 14:40:03 UTC
That's the same backtrace as bug 460563, it shouldn't be related to the performance problem though.

(In reply to Patrick Silva from comment #28)
> I have tested the env variables by adding them to /env/environment file.
> Plasma animations are more fluid when I use the first two options, but video
> playback with mpv, vlc and gstreamer-based players is still stuttering.
> There is no difference with the third option.
That might be something to look further into. Was the drm_info output you attached created in a Wayland session? There's some information missing in it that could be useful
Comment 31 Patrick Silva 2022-11-02 14:49:02 UTC
Created attachment 153404 [details]
drm_info_output of celeron g1820 haswell

(In reply to Zamundaaa from comment #30)
> That's the same backtrace as bug 460563, it shouldn't be related to the
> performance problem though.
> 
> (In reply to Patrick Silva from comment #28)
> > I have tested the env variables by adding them to /env/environment file.
> > Plasma animations are more fluid when I use the first two options, but video
> > playback with mpv, vlc and gstreamer-based players is still stuttering.
> > There is no difference with the third option.
> That might be something to look further into. Was the drm_info output you
> attached created in a Wayland session? There's some information missing in
> it that could be useful

I got the previously attached output while logged in with another machine via ssh. Now I'm attaching a new output gotten locally.
Comment 32 Zamundaaa 2022-11-02 15:05:33 UTC
Thanks. Can you also attach the output of drm_info with the KWIN_DRM_USE_MODIFIERS=0 env var set?
Comment 33 Patrick Silva 2022-11-02 15:15:42 UTC
Created attachment 153405 [details]
drm_info output with the KWIN_DRM_USE_MODIFIERS=0 env var set
Comment 34 Zamundaaa 2022-11-02 15:36:04 UTC
On your hardware Mesa chooses the same buffer format and layout with and without modifiers. That's quite unexpected, there's nothing here that could cause a difference in performance
Comment 35 Podagric 2022-11-04 20:16:58 UTC
I tested the wayland section of gnome on my hardware and had performance problems just like with plasma.

Searching a bit more I saw that there is a version of mutter with a patch to enable the tripple buffer, and after installing it, the experience was much better.

As I understand it, kwin does not have this in wayland, correct? only in xorg.
Comment 36 Lemmiwinks 2022-11-05 17:55:46 UTC
Created attachment 153497 [details]
output of drm_info on wayland (Haswell-ULT Integrated Graphics Controller (rev 0b))

This is my drm_info output on Wayland, the first one was X11...
Comment 37 Joshua T 2022-11-15 15:42:31 UTC
I just tried Fedora KDE Rawhide with Wayland, and I didn't have any issues on my Intel UHD 600. Animations were very close to 60 FPS. I have definitely seen this problem in the past. Or is Fedora doing something differently?
Comment 38 Antonio Orefice 2022-11-15 16:06:28 UTC
(In reply to Joshua T from comment #37)
> I just tried Fedora KDE Rawhide with Wayland, and I didn't have any issues
> on my Intel UHD 600. Animations were very close to 60 FPS. I have definitely
> seen this problem in the past. Or is Fedora doing something differently?

Are you unable to spot any performance degradation versus Xorg then?
Comment 39 Lemmiwinks 2022-11-15 21:42:34 UTC
(In reply to Joshua T from comment #37)
> I just tried Fedora KDE Rawhide with Wayland, and I didn't have any issues
> on my Intel UHD 600. Animations were very close to 60 FPS. I have definitely
> seen this problem in the past. Or is Fedora doing something differently?

Intel UHD 600 should not be affected as it is newer than Haswell generation graphics. So I'm not surprised that you don't have any issues.
Could you try with a different distro, like Ubuntu 20.04?
Comment 40 Joshua T 2022-11-15 21:50:14 UTC
(In reply to Antonio Orefice from comment #38)
> (In reply to Joshua T from comment #37)
> > I just tried Fedora KDE Rawhide with Wayland, and I didn't have any issues
> > on my Intel UHD 600. Animations were very close to 60 FPS. I have definitely
> > seen this problem in the past. Or is Fedora doing something differently?
> 
> Are you unable to spot any performance degradation versus Xorg then?

I used the Show Kwin FPS desktop effect, and Xorg was only slightly smoother than Wayland.
Comment 41 Joshua T 2022-11-15 21:51:53 UTC
(In reply to Lemmiwinks from comment #39)
> (In reply to Joshua T from comment #37)
> > I just tried Fedora KDE Rawhide with Wayland, and I didn't have any issues
> > on my Intel UHD 600. Animations were very close to 60 FPS. I have definitely
> > seen this problem in the past. Or is Fedora doing something differently?
> 
> Intel UHD 600 should not be affected as it is newer than Haswell generation
> graphics. So I'm not surprised that you don't have any issues.
> Could you try with a different distro, like Ubuntu 20.04?

I have a Celeron N4020 Gemini Lake cpu. I can confirm I had stutters and laggy animations on both Kubuntu 22.04 and 22.10.
Comment 42 Antonio Orefice 2022-11-16 12:18:39 UTC
(In reply to Joshua T from comment #40)
> (In reply to Antonio Orefice from comment #38)
> > (In reply to Joshua T from comment #37)
> > > I just tried Fedora KDE Rawhide with Wayland, and I didn't have any issues
> > > on my Intel UHD 600. Animations were very close to 60 FPS. I have definitely
> > > seen this problem in the past. Or is Fedora doing something differently?
> > 
> > Are you unable to spot any performance degradation versus Xorg then?
> 
> I used the Show Kwin FPS desktop effect, and Xorg was only slightly smoother
> than Wayland.

If you take a look at the first post, you'll read that the fps effect is not a good meter; indeed using it workarounds the issues under Wayland.
Comment 43 Lemmiwinks 2022-11-16 17:09:01 UTC
I can confirm that using the Show FPS effect makes the whole situation a lot better.
Could it be that there is some kind of CPU/GPU frequency scaling error?
Comment 44 Joshua T 2022-11-16 17:15:12 UTC
I used the show FPS effect to get evidence of the performance of wayland vs xorg. The effect didn’t improve the performance in any way for me. Wayland on Fedora 38 KDE was still smooth before enabling the effect.
Comment 45 Rean 2023-01-01 17:27:00 UTC
I'm going to add my experience with Wayland and X11 performance. I hope this can be addressed in the future. My Plasma 5.26.4 is powered by an Intel Pentium G2030T (2) 2.60GHz. I tested both X11 and Wayland with blur and contrast enabled as well as the FPS effect. I did the same tests on both X11 and Wayland, and I noticed that any blur effect that fills the entire screen makes the framerate drop as low as 40 in Wayland, while on X11 I get 60 fps with the same blurred window filling the entire screen. Also, I get higher render usage with animations and moving windows around in Wayland vs. what I get in X11. I don't think Wayland's performance should be this bad.
Comment 46 Lemmiwinks 2023-01-08 22:57:41 UTC
I think I made an observation: under System Settings -> Compositor, setting the latency to "force smoothest animations", seems to almost fix the problem for me. Maybe someone could try this...
Comment 47 Antonio Orefice 2023-01-09 05:06:06 UTC
(In reply to Lemmiwinks from comment #46)
> I think I made an observation: under System Settings -> Compositor, setting
> the latency to "force smoothest animations", seems to almost fix the problem
> for me. Maybe someone could try this...

The very same workaround is pointed out by the reporter in the very first post.
Comment 48 Nate Graham 2023-01-17 04:10:36 UTC
Git commit cf583aa3673cfca275b64d25c457d9e4a298c46c by Nate Graham, on behalf of Vlad Zahorodnii.
Committed on 17/01/2023 at 04:10.
Pushed by ngraham into branch 'master'.

Change default latency policy to "Force smoothest animations"

There are some performance differences between X11 and Wayland. Desktop
systems are mostly unaffected by them, but laptops suffer a bit.

On Wayland, kwin always does double buffering. This is great for
reducing latency and avoiding tearing, but if the gpu can't keep up with
the work, you're going to see stuttering.

Another issue is that in order to reduce latency, we need to have very
good frame stats. At the moment, kwin records only cpu render time, but
we also need to record the gpu time. We've already done some work in
this area, but it's most likely Plasma 6 material. (plasma/kwin!1163)

In the meantime, let's change the default latency policy to "Force
smoother animations." It's going to improve frame rate. If people care
about latency, they can change latency policy in system settings; the
option is still there.

M  +1    -1    src/kcms/compositing/kwincompositing_setting.kcfg
M  +1    -1    src/kwin.kcfg
M  +1    -1    src/options.h

https://invent.kde.org/plasma/kwin/commit/cf583aa3673cfca275b64d25c457d9e4a298c46c
Comment 49 Vlad Zahorodnii 2023-01-17 13:57:14 UTC
*** Bug 439511 has been marked as a duplicate of this bug. ***
Comment 50 Vlad Zahorodnii 2023-01-18 14:10:36 UTC
*** Bug 416200 has been marked as a duplicate of this bug. ***
Comment 51 Vlad Zahorodnii 2023-01-20 12:09:00 UTC
*** Bug 464520 has been marked as a duplicate of this bug. ***
Comment 52 Nate Graham 2023-02-22 23:14:15 UTC
We're getting reports from Intel iGPU users that the Wayland session is and smoother with that change in Plasma 5.27. And this is my experience as well!

I was feeling curious so I decided to do some investigation and measuring. From what I can tell using intel_gpu_top, here's what's going on with my Intel HD 620 GPU:

On both X11 and Wayland, the GPU's clock speed is always below 20 MHz at idle (and frequently 0MHz), which mean the GPU is idling. In this state it reports using about 1.3 watts of power.

On X11, when I open the Overview Effect with Meta+W, the GPU immediately comes out of idle and the clock speed jumps to 300+ MHz. The animation is always smooth.

On Wayland, when I do this the GPU comes out of idle and the clock speed jumps to 600+ MHz, and even when using the new default setting of "Force smoothest animations", it's not quite as smooth as on X11. Sometimes it's close, sometimes it's less close. It's better than it was before, but still not quiiiiite as good as on X11.

When I turn on the Show FPS effect, an animation is constantly played and minimum GPU clock speed jumps to about 80 MHz. The power consumption is slightly higher at about 1.55 watts. The GPU now constantly has work to do and doesn't idle anymore; in this state, newly-played animations are as smooth as on X11.

So it seems like there is a difference on X11 vs Wayland regarding how the GPU handles coming out of an idle state. On X11 it does this quickly enough that a newly requested animation is smooth. On Wayland, that's not the case, and to replicate the same results, you have to force it to not go idle. The cost in power consumption to do so is small, but measurable. And on a laptop, it would be impactful.

I don't have enough information to speculate about whether this is a GPU driver bug, and/or if it's something KWin can do better about.

Regardless, the differences between X11 and Wayland in smoothness are now much smaller than they were before, even if we're not 100% of the way there I'm removing this bug from https://community.kde.org/Plasma/Wayland_Showstoppers since such a small difference in performance is not a showstopper.
Comment 53 kde-yyds 2023-04-05 14:10:44 UTC
I'm using kwin 5.27.2, but the performance on wayland is still much worse than kwin-x11, especially when blur effect is enabled.
Comment 54 medin 2023-05-09 23:10:32 UTC
Wayland is slow and laggy compared to X11 on my two laptops with i3-5005u and i3-1005G1.
Comment 55 Nate Graham 2023-05-10 08:16:57 UTC
There's a plan to try triple buffering on kwin_wayland; see https://invent.kde.org/plasma/kwin/-/issues/124. That should help, as it's what KWin does on X11 which does help with smoothness.
Comment 56 postix 2023-06-25 16:09:29 UTC
*** Bug 445070 has been marked as a duplicate of this bug. ***
Comment 57 David Edmundson 2023-06-25 20:54:44 UTC
Can reporters here confirm if performance improves if the effect "background contrast" is disabled?
Comment 58 Nate Graham 2023-06-26 19:30:48 UTC
...And if the issue does go away with that effect disabled, then the issue was probably Bug 469151 which has just been fixed!
Comment 59 kde.org 2023-06-26 21:11:16 UTC
(In reply to David Edmundson from comment #57)
> Can reporters here confirm if performance improves if the effect "background
> contrast" is disabled?

Unfortunately, I'm currently away for a longer time from my computer. Therefore I cannot check, if enabling or disabling the effect makes any difference. Generally, I don't have any effects active in order to put as little strain on the GPU (and CPU) as possible, so I __THINK__ that I am experiencing the slower and more sluggish performance on Wayland vs Xorg without having this effect enabled. I cannot comment, however, if the performance would be even worse if I had enabled this effect.  I will check as soon as I get back to the computer.
And while I am at it, a big THANK YOU for looking into this issue :-)
Comment 60 medin 2023-06-26 22:05:20 UTC
(In reply to Nate Graham from comment #58)
> ...And if the issue does go away with that effect disabled, then the issue
> was probably Bug 469151 which has just been fixed!

It doesn't differ too much with Background contrast effect, on the other hand it seems disabling Blur makes really animations much smoother, especially with Task Manager thumbnails.
And one thing to note is that mouse lagging seems to be somehow improved without needing to add KWIN_DRM_NO_AMS=1 as env var.

The problem is that Wayland when idle looks good and smooth but starts to greatly slow down animations when you open some apps, and some apps are randomly frozen (grayed effect) and you need to wait for them to finish their works, I don't know if it's coming from my weak CPU, but on X11 the balance between heavy loading and animations seems much better handled than Wayland.
Comment 61 Lyubomir 2023-06-27 13:00:17 UTC
Created attachment 159927 [details]
settings

What do these three dots mean? Is it activated or not?
Comment 62 David Edmundson 2023-06-27 13:43:53 UTC
There's a tri-state based on whether it's enabled in user settings but not enabled by the compositor or something.

It's very confusing, just click it again to get to a solid checkmark or blank.
Comment 63 Patrick Silva 2023-06-27 15:25:12 UTC
I'm using this env variable on my system:

KWIN_DRM_USE_MODIFIERS=0

Performance is the same on both Wayland and X11 sessions regardless the background contrast and blur effects are enabled or not.

Operating System: Arch Linux 
KDE Plasma Version: 5.27.6
KDE Frameworks Version: 5.107.0
Qt Version: 5.15.10
Kernel Version: 6.3.9-arch1-1 (64-bit)
Graphics Platform: Wayland
Processors: 4 × Intel® Core™ i5-4670K CPU @ 3.40GHz
Memory: 15,5 GiB of RAM
Graphics Processor: Mesa Intel® HD Graphics 4600
Manufacturer: Gigabyte Technology Co., Ltd.
Product Name: B85M-D3PH
Comment 64 Patrick Silva 2023-06-27 15:41:39 UTC
glxgears test slightly stutters on Wayland.

My results on Wayland, with "blur" and "background contrast" effects disabled:

$ glxgears
Running synchronized to the vertical refresh.  The framerate should be
approximately the same as the monitor refresh rate.
295 frames in 5.0 seconds = 58.996 FPS
290 frames in 5.0 seconds = 57.807 FPS
290 frames in 5.0 seconds = 57.808 FPS
289 frames in 5.0 seconds = 57.799 FPS
291 frames in 5.0 seconds = 58.007 FPS
289 frames in 5.0 seconds = 57.800 FPS
290 frames in 5.0 seconds = 57.808 FPS
290 frames in 5.0 seconds = 57.807 FPS
290 frames in 5.0 seconds = 57.996 FPS
291 frames in 5.0 seconds = 58.006 FPS
289 frames in 5.0 seconds = 57.796 FPS
291 frames in 5.0 seconds = 58.006 FPS
289 frames in 5.0 seconds = 57.794 FPS
290 frames in 5.0 seconds = 57.954 FPS
291 frames in 5.0 seconds = 57.866 FPS
290 frames in 5.0 seconds = 57.997 FPS
290 frames in 5.0 seconds = 57.809 FPS
291 frames in 5.0 seconds = 58.008 FPS
290 frames in 5.0 seconds = 57.999 FPS
290 frames in 5.0 seconds = 57.995 FPS
288 frames in 5.0 seconds = 57.599 FPS
291 frames in 5.0 seconds = 58.004 FPS
290 frames in 5.0 seconds = 57.997 FPS
290 frames in 5.0 seconds = 57.812 FPS
288 frames in 5.0 seconds = 57.599 FPS

My results on X11, also with "blur" and "background contrast" effects disabled:

$ glxgears
Running synchronized to the vertical refresh.  The framerate should be
approximately the same as the monitor refresh rate.
294 frames in 5.0 seconds = 58.792 FPS
301 frames in 5.0 seconds = 59.998 FPS
300 frames in 5.0 seconds = 59.969 FPS
301 frames in 5.0 seconds = 60.027 FPS
301 frames in 5.0 seconds = 60.000 FPS
300 frames in 5.0 seconds = 59.999 FPS
300 frames in 5.0 seconds = 59.999 FPS
300 frames in 5.0 seconds = 59.999 FPS
301 frames in 5.0 seconds = 60.000 FPS
300 frames in 5.0 seconds = 59.998 FPS
301 frames in 5.0 seconds = 60.004 FPS
300 frames in 5.0 seconds = 59.997 FPS
Comment 65 Linus Kardell 2023-06-28 17:31:12 UTC
(In reply to David Edmundson from comment #57)
> Can reporters here confirm if performance improves if the effect "background
> contrast" is disabled?

It helps a bit with my poweroff screen performance, but it's not enough.

(In reply to Nate Graham from comment #55)
> There's a plan to try triple buffering on kwin_wayland; see
> https://invent.kde.org/plasma/kwin/-/issues/124. That should help, as it's
> what KWin does on X11 which does help with smoothness.

Though that's only used on intel according to that page, so it doesn't explain the difference on AMD.
Comment 66 David Edmundson 2023-07-07 10:21:37 UTC
*** Bug 467283 has been marked as a duplicate of this bug. ***
Comment 67 Linus Kardell 2023-07-26 13:22:00 UTC
It's even worse on llvmpipe. My laptop fell back to llvmpipe after an update now for some reason, and it runs at about 1-2 FPS.
Comment 68 Zamundaaa 2023-07-26 13:33:38 UTC
Let's keep this bug report about Intel iGPUs. If performance is worse than expected with AMD or llvmpipe, please file a separate bug report.
Comment 69 Bug Janitor Service 2023-07-26 16:36:51 UTC
A possibly relevant merge request was started @ https://invent.kde.org/plasma/kwin/-/merge_requests/4268
Comment 70 Linus Kardell 2023-07-26 16:48:08 UTC
(In reply to Zamundaaa from comment #68)
> Let's keep this bug report about Intel iGPUs. If performance is worse than
> expected with AMD or llvmpipe, please file a separate bug report.

I have made a report about AMD at https://bugs.kde.org/show_bug.cgi?id=464520, and it was marked as a duplicate of this one
Comment 71 Zamundaaa 2023-12-07 13:26:22 UTC
Git commit 4ad5670ddfcd7400c8b84c12cbf8bd97a0590f43 by Xaver Hugl.
Committed on 07/12/2023 at 14:17.
Pushed by zamundaaa into branch 'master'.

backends/drm: set dma-fence deadlines if available

This tells the kernel when a buffer should be done rendering, which allows
it to for example increase GPU clocks in order to hopefully hit our deadlines.
That in turn should reduce the amount of dropped frames

M  +27   -10   src/backends/drm/drm_buffer.cpp
M  +3    -0    src/backends/drm/drm_buffer.h
M  +9    -0    src/backends/drm/drm_commit.cpp
M  +1    -0    src/backends/drm/drm_commit.h
M  +1    -0    src/backends/drm/drm_commit_thread.cpp

https://invent.kde.org/plasma/kwin/-/commit/4ad5670ddfcd7400c8b84c12cbf8bd97a0590f43
Comment 72 Syntist 2023-12-08 23:53:30 UTC
(In reply to Zamundaaa from comment #71)
> Git commit 4ad5670ddfcd7400c8b84c12cbf8bd97a0590f43 by Xaver Hugl.
> Committed on 07/12/2023 at 14:17.
> Pushed by zamundaaa into branch 'master'.
> 
> backends/drm: set dma-fence deadlines if available
> 
> This tells the kernel when a buffer should be done rendering, which allows
> it to for example increase GPU clocks in order to hopefully hit our
> deadlines.
> That in turn should reduce the amount of dropped frames
> 
> M  +27   -10   src/backends/drm/drm_buffer.cpp
> M  +3    -0    src/backends/drm/drm_buffer.h
> M  +9    -0    src/backends/drm/drm_commit.cpp
> M  +1    -0    src/backends/drm/drm_commit.h
> M  +1    -0    src/backends/drm/drm_commit_thread.cpp
> 
> https://invent.kde.org/plasma/kwin/-/commit/
> 4ad5670ddfcd7400c8b84c12cbf8bd97a0590f43

Is it possible we can get this fix integrated in the current version of KWin? 
I think this is the major show stopper for Intel iGPU user on KDE Wayland.
Comment 73 Ben Daines 2023-12-10 05:36:55 UTC
(In reply to syed.talha.khalid from comment #72)
> (In reply to Zamundaaa from comment #71)
> > Git commit 4ad5670ddfcd7400c8b84c12cbf8bd97a0590f43 by Xaver Hugl.
> > Committed on 07/12/2023 at 14:17.
> > Pushed by zamundaaa into branch 'master'.
> > 
> > backends/drm: set dma-fence deadlines if available
> > 
> > This tells the kernel when a buffer should be done rendering, which allows
> > it to for example increase GPU clocks in order to hopefully hit our
> > deadlines.
> > That in turn should reduce the amount of dropped frames
> > 
> > M  +27   -10   src/backends/drm/drm_buffer.cpp
> > M  +3    -0    src/backends/drm/drm_buffer.h
> > M  +9    -0    src/backends/drm/drm_commit.cpp
> > M  +1    -0    src/backends/drm/drm_commit.h
> > M  +1    -0    src/backends/drm/drm_commit_thread.cpp
> > 
> > https://invent.kde.org/plasma/kwin/-/commit/
> > 4ad5670ddfcd7400c8b84c12cbf8bd97a0590f43
> 
> Is it possible we can get this fix integrated in the current version of
> KWin? 
> I think this is the major show stopper for Intel iGPU user on KDE Wayland.

That would be pretty sweet.  Plasma 6 release is still months away and distros will take even longer to include Plasma 6 on their own schedules.
Comment 74 Zamundaaa 2023-12-15 14:58:29 UTC
Afaik 5.27 wouldn't really benefit from this due to how KWin submits frames to the kernel there.
Comment 75 Syntist 2023-12-15 20:33:22 UTC
(In reply to Zamundaaa from comment #74)
> Afaik 5.27 wouldn't really benefit from this due to how KWin submits frames
> to the kernel there.

Oh make sense, so is the fix Available in Beta version Plasma 6.0? Or we have to wait for it to land on next Beta Release?
Comment 76 Bug Janitor Service 2023-12-26 02:19:33 UTC
A possibly relevant merge request was started @ https://invent.kde.org/plasma/kwin/-/merge_requests/4833
Comment 77 Antonio Orefice 2024-02-29 19:13:19 UTC
Just tried with an i5-1035g1 under wayland with its gpu that should be far more performant than my haswell, but still on the logout screen mouse cursor was stuttering (a lot) under wayland on the newer cpu and butter smooth on X11 on the aging haswell.
Comment 78 Bug Janitor Service 2024-03-08 13:53:27 UTC
A possibly relevant merge request was started @ https://invent.kde.org/plasma/kwin/-/merge_requests/5390
Comment 79 Zamundaaa 2024-03-26 16:15:01 UTC
Git commit d59451adcf8f7a4d90659a60bfbe73230091b0e3 by Xaver Hugl.
Committed on 26/03/2024 at 16:06.
Pushed by zamundaaa into branch 'master'.

effects/quickeffect: render qtquick scenes in the normal render pass

If they're rendered in prePaintScreen, the time needed to render them is not
accounted for in compositing scheduling, which may make KWin miss vblank
Related: bug 482861, bug 482034

M  +5    -0    src/effect/offscreenquickview.cpp
M  +9    -10   src/effect/quickeffect.cpp

https://invent.kde.org/plasma/kwin/-/commit/d59451adcf8f7a4d90659a60bfbe73230091b0e3
Comment 80 David Edmundson 2024-05-14 21:22:41 UTC
Performance is something we're constantly improving, there is nothing left in this bug report that's specific enough to keep this ticket open
Comment 81 Nate Graham 2024-05-14 21:30:22 UTC
FWIW, on my hardware (2020 10th gen Intel i7 with HD610 iGPU driving a 4K screen) the issue is completely resolved for me, at least until the system is under heavy load--when I think some stuttering it expected.
Comment 82 Zamundaaa 2024-05-24 15:50:53 UTC
Git commit 40a9f08dd34948264ba5b533da5373a0a51ad3f1 by Xaver Hugl.
Committed on 24/05/2024 at 15:32.
Pushed by zamundaaa into branch 'master'.

backends/drm: allow up to two composited frames to be pending at the same time

This should improve responsiveness on setups where rendering each frame takes longer
than the refresh cycle of the display.
Related: bug 454098

M  +4    -0    src/backends/drm/drm_output.cpp

https://invent.kde.org/plasma/kwin/-/commit/40a9f08dd34948264ba5b533da5373a0a51ad3f1
Comment 83 Zamundaaa 2024-05-24 15:59:50 UTC
Git commit 192232e8d1736307f93792ea77de80e61f746bd3 by Xaver Hugl.
Committed on 24/05/2024 at 15:59.
Pushed by zamundaaa into branch 'Plasma/6.1'.

backends/drm: allow up to two composited frames to be pending at the same time

This should improve responsiveness on setups where rendering each frame takes longer
than the refresh cycle of the display.
Related: bug 454098

M  +4    -0    src/backends/drm/drm_output.cpp

https://invent.kde.org/plasma/kwin/-/commit/192232e8d1736307f93792ea77de80e61f746bd3
Comment 84 Nate Graham 2024-06-12 14:07:14 UTC
This issue is completely fixed for me with Plasma 6.1, FWIW! It's super awesome.