Bug 122020

Summary: doesn't start kdesvnd fast enough
Product: [Frameworks and Libraries] kdelibs Reporter: Andreas Pakulat <apaku>
Component: kdedAssignee: David Faure <faure>
Status: RESOLVED FIXED    
Severity: normal CC: ana, rdieter
Priority: NOR    
Version: unspecified   
Target Milestone: ---   
Platform: Debian testing   
OS: Linux   
Latest Commit: Version Fixed In:
Attachments: A testcase demostrating this bug
A path to kdelibs/kio/kio/kmimetype.cpp - do not timeout

Description Andreas Pakulat 2006-02-15 12:43:50 UTC
Version:            (using KDE KDE 3.5.1)
Installed from:    Debian testing/unstable Packages

Hi,

I have a problem here with dynamic pop-up menus in konqueror. I always get some one of 2 possible error messages when right-clicking on a website. The errors are from kdesvn-kio-plugins telling me either that the actions defined in the .desktop files are invalid or that there's a dcop communicating problem. It always works the 2nd time I right-click. This has already been filed as bugreport with Debian BTS, but as this seems to be an upstream issue I forward it here.

Also the maintainer of kdesvn cannot reproduce this on his machine, thus he thinks this might be a timout-issue. I'm using a Pentium-M Laptop with 1.4 GHz and 512 MB Ram here and I'm having this issue always when the context-menu should include the SVN-entries.

For more information about what we tried and what was the outcome, please see:

http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=348411

Andreas
Comment 1 Martin Steigerwald 2006-02-26 14:14:07 UTC
Hello,

I can reproduce this on my IBM ThinkPad T23 with Pentium 3 1.13GHz with Debian Etch/Sid using KDE 3.5.1 and 

root@deepdance:~ -> dpkg -l | grep kdesvn
ii  kdesvn                  0.7.3-1                 subversion client with tight KDE integration
ii  kdesvn-kio-plugins      0.7.3-1                 subversion I/O slaves for KDE

AFAIK it didn't happen with 0.7.2 or less or kdesvn.

I either get:

1)

4 or 5 error messages like this
"Die Desktop-Datei /usr/share/apps/konqueror/servicemenus/kdesvn_subversion_toplevel.desktop
hat einen ungültigen Menü-Eintrag
Log."

(in english: the desktop file blabla has a invalid menu entry)

Also with: Info. Blame and Rename.

2) 

"Won't receive DCOP signal" or something like this

I am trying the workaround to comment out "X-KDE-GetActionMenu="

/usr/share/apps/konqueror/servicemenus/kdesvn_subversion.desktop
/usr/share/apps/konqueror/servicemenus/kdesvn_subversion_toplevel.desktop

that is described in that debian bug (348411) as that bug is rather annoying over time. ;)

I also tried to reproduce this with a new user with new KDE configuration, but there this bug doesn't seem to happen. So maybe its some migration issue (old config or menu files laying around)? But I didn't find anything. Or maybe that new KDE user setup doesn't start as many services so it starts up faster?

Regards,
Martin
Comment 2 Martin Steigerwald 2006-03-28 01:29:25 UTC
Is there any analysis or even solution for this bug? The maintainer of the kdesvn Debian package Michael Biebl tried to solve the bug, but this doesn't work. It is still in kdesvn 0.8.0-1.

BTW the work-around does worked okay here.
Comment 3 Tro 2006-05-15 03:54:05 UTC
I can also reproduce this with KDE 3.5.2.

Right click in Konqueror uses I'd say about a gig+ of RAM to bring up the above-mentioned messages, which thrashes my system (only 512MB of RAM). But this doesn't happen all the time. Usually, I can observe this behaviour when my system has an above average load, which is consistent with the hypothesis that this bug is a timing issue. I wish I could provide more details, but I don't even know where to start. What's relevant?
Comment 4 Sune Vuorela 2006-06-04 18:56:14 UTC
*** This bug has been confirmed by popular vote. ***
Comment 5 David Faure 2006-08-15 21:59:10 UTC
Can you help me understand the bug by describing what's in those servicemenus and which dcop call is being made? I don't have kdesvn[d] installed.
Comment 6 Andreas Pakulat 2006-08-15 22:08:23 UTC
On 15.08.06 19:59:11, David Faure wrote:
> Can you help me understand the bug by describing what's in those servicemenus


Various subversion-related actions, for example Checkout, Blame log and
so on.

> and which dcop call is being made? I don't have kdesvn[d] installed.


The dcop call is: kded kdesvnd getActionMenu(KURL::List) and is given to
konqueror via:

X-KDE-GetActionMenu=kded kdesvnd getActionMenu(KURL::List)

Andreas
Comment 7 David Faure 2006-08-16 23:01:30 UTC
On Tuesday 15 August 2006 22:08, Andreas Pakulat wrote:
> The dcop call is: kded kdesvnd getActionMenu(KURL::List)

OK - and this loads kdesvnd on demand. But that's synchronous, I don't understand "not fast enough" in the descirption of this bug.

Is the problem reproduceable with the "dcop" command-line client?
I just found out that I have ksvnd (but not kdesvnd), and for that one it seems to work:

$ dcop kded kded unloadModule ksvnd
true
$ dcop kded ksvnd getActionMenu '' ''
Diff
Rename
[...]
Comment 8 Michael Biebl 2006-08-16 23:35:07 UTC
The problem is, if the system is under high load or if you have a slow machine, it seems that the dcop reply times out or better is too late.
If I start konqueror from the console I get 
>> Very strange! got a DCOPReply opcode, but we were not waiting for a reply!
and konqueror displays the error messages.
Comment 9 Modestas Vainius 2006-10-26 10:25:20 UTC
Below I attach bug122020.tar.gz, a small testcase, demonstrating this bug. When you run it, you will get something like this on the console.

DCOPsrv: Starting DCOPsrv, registering as dcopsrv1
DCOPsrv: Starting new DCOPsrv service...
DCOPsrv: Starting DCOPsrv, registering as dcopsrv2
DCOPsrv: Starting new DCOPsrv service...
Executing the 1st DCOP call: service dcopsrv1 function sleep(3000)
DCOPcl: 1st dcop call sleep(3000) with timeout failed. The first part of the bug
Executing the 2nd DCOP call: service dcopsrv2 function sleep(4000)
2nd dcop call sleep(4000) w/o timeout returned 3000
2nd dcop call returned the return value (3000) of the 1st DCOP call. Bug proven!
Very strange! got a DCOPReply opcode, but we were not waiting for a reply!

sleep(int msecs) is a small DCOP function, which causes the process (the DCOP service provider) to sleep for msecs miliseconds and returns msecs value to the caller. As you a return value of the 2nd DCOP call is wrong (should have been 4000) due to this bug. And "Very strange! got a DCOPReply opcode, but we were not waiting for a reply!" message is where the real reply of the 2nd call (return value 4000) gets discarded.

I also attach a patch to avoid using timeout in kdelibs/kio/kio/kmimetype.diff It's not truly a fix for the actual DCOP bug, but rather a workaround to avoid it. The other workaround could be to establish a new DCOP connection and call with timeout on it, but, in my opinion, abandoning timeout is a clean and logical solution.

A true solution would be to fix this race condition in DCOP code. That's where experienced DCOP dev should step in. Or at least please clearly DOCUMENT this bug in DCOPClient documentation (e.g. a big fat warning that it is not safe to do another dcop call on the same connection if a previous call might have timed out).
Comment 10 Modestas Vainius 2006-10-26 10:28:28 UTC
Created attachment 18270 [details]
A testcase demostrating this bug

$ tar xvzf bug122020.tar.gz
$ cd bug122020
<edit Makefile if necessary (it's OK for debian based systems)>
$ make && ./run.sh ; killall dcopsrv
Comment 11 Modestas Vainius 2006-10-26 10:31:44 UTC
Created attachment 18271 [details]
A path to kdelibs/kio/kio/kmimetype.cpp - do not timeout

Do not time out when waiting for the reply for the X-KDE-GetActionMenu DCOP
call. DCOP protocol is synchronouos by design and a reply for the timed out
call (i.e. garbage) still arrives and might be falsely returned as a valid
reply for the subsequent DCOP call on the same connection. The application
currently known to be badly hurt by this is kdesvn and its konqueror service
menu integration. Its X-KDE-GetActionMenu might take longer than current 100
milisecond timeout to complete. As a result then, a late reply will "pollute" a
main DCOP connection of the client application (e.g. konqueror) causing further
DCOP failures.
Comment 12 David Faure 2006-10-26 10:39:16 UTC
Thanks for the very good analysis of the problem.
I agree with your comments relative to the kmimetype.cpp patch, and I have applied it to branches/KDE/3.5/kdelibs/kio/kio/kmimetype.cpp in subversion.
At this point I seriously doubt that anyone is going to fix the DCOP problem; those developers who knew the DCOP internals have dropped out,
and the KDE4 development branch has already replaced DCOP with DBUS.
I also think that whoever added the timeout in the kmimetype call didn't really think of the implications :)