Bug 282793 - nepomukservicestub, i.e. libstreamanalyzer stub, crashes on scanning home directory
Summary: nepomukservicestub, i.e. libstreamanalyzer stub, crashes on scanning home dir...
Status: RESOLVED UPSTREAM
Alias: None
Product: nepomuk
Classification: Miscellaneous
Component: general (show other bugs)
Version: unspecified
Platform: Debian unstable Linux
: NOR crash
Target Milestone: ---
Assignee: Sebastian Trueg
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-09-26 10:57 UTC by Martin Steigerwald
Modified: 2012-01-04 10:42 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments
xsession-errors with some information regarding the crashes (446.06 KB, text/plain)
2011-09-26 11:10 UTC, Martin Steigerwald
Details
second xsession-errors log (21.85 KB, text/plain)
2011-09-26 18:39 UTC, Martin Steigerwald
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Steigerwald 2011-09-26 10:57:46 UTC
Version:           unspecified (using KDE 4.6.5) 
OS:                Linux

Application: nepomukservicestub (0.2)
KDE Platform Version: 4.6.5 (4.6.5)
Qt Version: 4.7.3
Operating System: Linux 3.0.0-1-amd64 x86_64
Distribution: Debian GNU/Linux unstable (sid)

-- Information about the crash:
The crash can be reproduced every time. Just by letting Nepomuk index my home directory.

I am reporting this nevertheless as it again makes Nepomuk unusable on my machine. I tried Nepomuk from beginning from KDE 4.2 again and again and it always crashed at some point. I now tried again after upgrading to streamanalyzer 0.76, but still no luck.

Last thing it displays in systemsettings indexing status is:

martin@merkaba:~/Wissen/Wikipedia/Interview mit Jimbo Wales - Wikipedia> ls -l
insgesamt 67320
-rw-r--r-- 1 martin martin 18556378 Jun 24  2005 jimbo-audio.ogg
-rw-r--r-- 1 martin martin 50373074 Jun 24  2005 jimbo_small.ogg
martin@merkaba:~/Wissen/Wikipedia/Interview mit Jimbo Wales - Wikipedia>

This seems to be available so I am not sure whether its that file where it choke.

There are no KDE 4.7.x packages for Debian Sid/Experimental yet.

I have the following:

merkaba:~> apt-show-versions | egrep "(soprano|nepomuk|streamanaly)"        
libnepomuk4/sid uptodate 4:4.6.5-2
libnepomukquery4a/sid uptodate 4:4.6.5-2
libnepomukutils4/sid uptodate 4:4.6.5-2
libsoprano-dev/sid uptodate 2.6.0+dfsg.1-4
libsoprano4/sid uptodate 2.6.0+dfsg.1-4
libstreamanalyzer0/sid uptodate 0.7.6-2
soprano-daemon/sid uptodate 2.6.0+dfsg.1-4

+ Qt 4:4.7.3-8

Reproducible: Always

Steps to Reproduce:
Let Nepomuk try to index the file in my home directory. I.e.:

- log out
- rm -r /home/martin/.kde/share/apps/nepomuk
- rm /home/martin/.kde/share/config/*nepomuk*
- log into the KDE session again
- wait for the DrKonqi windows to appear

Actual Results:  

I got four DrKonqi windows with a useless backtrace. I do not see any debug packages that apply:

merkaba:~> apt-cache search nepomuk | grep dbg  
merkaba:~#1> apt-cache search streamana | grep dbg
merkaba:~#1> 


-- Backtrace:
A useful backtrace could not be generated

It only displays:

Application: Nepomuk-Dienst (nepomukservicestub), signal: Segmentation fault

Expected Results:  
Expected results:

Libstreamanalyzer indexers never ever crash on any file. If thats not possible although I think it should, it should at least report the exact file where it crashes, offer to create a useful bug report, cause I think crashing on file indexing is a bug, and not try to index the file again until further notice.


Are there any logfiles?

I found something in ~/.xsession-errors. I try to isolate the stuff and will attach it then.
Comment 1 Martin Steigerwald 2011-09-26 11:10:00 UTC
Created attachment 63981 [details]
xsession-errors with some information regarding the crashes

It seems to contain informations of what the libstreamanalyzer did before the crash.
Comment 2 Sebastian Trueg 2011-09-26 14:38:51 UTC
Could you please install debugging symbols for kdebase-runtime. Otherwise it is impossible to diagnose the crash.
Comment 3 Martin Steigerwald 2011-09-26 14:46:01 UTC
Wanted to do so a moment ago, but they are already installed:

martin@merkaba:~> apt-show-versions | grep kdebase-runtime    
kdebase-runtime/sid uptodate 4:4.6.5-1+b1
kdebase-runtime-data/sid uptodate 4:4.6.5-1
kdebase-runtime-dbg/sid uptodate 4:4.6.5-1+b1

Furthermore I have:

martin@merkaba:~> apt-show-versions | grep "kde.*dbg"     
kdebase-dbg/sid uptodate 4:4.6.5-1
kdebase-runtime-dbg/sid uptodate 4:4.6.5-1+b1
kdebase-workspace-dbg/sid uptodate 4:4.6.5-3
kdelibs5-dbg/sid uptodate 4:4.6.5-2
Comment 4 Sebastian Trueg 2011-09-26 14:57:58 UTC
It seems the problem is a big video file which the service chokes on each time. Any idea which one that could be?
Comment 5 Martin Steigerwald 2011-09-26 17:12:57 UTC
Well the last thing Systemsettings Nepomuk KCM displayed was that it was indexing

~/Wissen/Wikipedia/Interview mit Jimbo Wales - Wikipedia

and in there is indeed a media file:

martin@merkaba:~/Wissen/Wikipedia/Interview mit Jimbo Wales - Wikipedia> ls -lh 
insgesamt 66M
-rw-r--r-- 1 martin martin 18M Jun 24  2005 jimbo-audio.ogg
-rw-r--r-- 1 martin martin 49M Jun 24  2005 jimbo_small.ogg
martin@merkaba:~/Wissen/Wikipedia/Interview mit Jimbo Wales - Wikipedia> mediainfo *
General
ID                                       : 31415 (0x7AB7)
Complete name                            : jimbo-audio.ogg
Format                                   : OGG
File size                                : 17.7 MiB

General
Complete name                            : jimbo_small.ogg
Format                                   : AVI
Format/Info                              : Audio Video Interleave
File size                                : 48.0 MiB
Duration                                 : 25mn 29s
Overall bit rate                         : 264 Kbps
Writing application                      : MEncoder 1.0pre7-3.4.3
Writing library                          : MPlayer

Video
ID                                       : 0
Format                                   : MPEG-4 Visual
Format profile                           : Simple@L1
Format settings, BVOP                    : No
Format settings, QPel                    : No
Format settings, GMC                     : No warppoints
Format settings, Matrix                  : Default (H.263)
Codec ID                                 : divx
Codec ID/Info                            : Mainly used by Google
Codec ID/Hint                            : DivX
Duration                                 : 25mn 29s
Bit rate                                 : 187 Kbps
Width                                    : 320 pixels
Height                                   : 240 pixels
Display aspect ratio                     : 4:3
Frame rate                               : 25.000 fps
Color space                              : YUV
Chroma subsampling                       : 4:2:0
Bit depth                                : 8 bits
Scan type                                : Progressive
Compression mode                         : Lossy
Bits/(Pixel*Frame)                       : 0.097
Stream size                              : 34.0 MiB (71%)
Writing library                          : FFmpeg0.4.9-pre1b4752

Audio
ID                                       : 1
Format                                   : MPEG Audio
Format version                           : Version 1
Format profile                           : Layer 3
Mode                                     : Joint stereo
Mode extension                           : MS Stereo
Codec ID                                 : 55
Codec ID/Hint                            : MP3
Duration                                 : 25mn 29s
Bit rate mode                            : Constant
Bit rate                                 : 64.0 Kbps
Channel(s)                               : 2 channels
Sampling rate                            : 48.0 KHz
Compression mode                         : Lossy
Stream size                              : 11.7 MiB (24%)
Alignment                                : Aligned on interleaves
Interleave, duration                     : 24 ms (0.60 video frame)


martin@merkaba:~/Wissen/Wikipedia/Interview mit Jimbo Wales - Wikipedia>

Could that be the culprit? I downloaded this interview somewhere. But I did not note down where - there Nepomuk would have come handy. Lets do a little digging with Google.

Ah, I think I got it:

http://www.pro-linux.de/news/1/8300/linuxtag-interview-mit-und-keynote-von-jimbo-wales.html

http://media.linuxtag.org/jimbo_wales/jimbo_small.ogg

I think thats the closest bet I can give at the moment. Beside that I have lots of videos on the SSD made my various digicams (*.mov and *.avi) and some other videos. Most videos are on an external harddisk that was not connected tough. And indexing files on external harddisk is disabled anyway.

Do you have an easy way to test whether its above downloadable file?
Comment 6 Martin Steigerwald 2011-09-26 17:19:00 UTC
Hmmm, its not downloadable anymore. Okay, I started uploading it to my personal server as Creative Commons Commercial-Sharealike-Attribution-Licence seems to allow:

http://martin-steigerwald.de/zeit/kde-bug-282793/

Author of this video / interview is LinuxTag.

Please tell me when you got the video, then I free space for storing the video again.
Comment 7 Martin Steigerwald 2011-09-26 17:24:38 UTC
I do an own test: I exclude this directory from indexing. Maybe this works.
Comment 8 Martin Steigerwald 2011-09-26 17:40:26 UTC
Hmmm, it crashed again. So it might be a different file. Or it chokes on another file. This time I just have "Strigi indiziert gerade Dateien." in systemsettings, so no hint at all.

Ok, three more crashes. I will wait till it stops executing nepomukservicestub due to "too many crashes" or finishing moving through my home directory and excerpt xsession-errors then. It seems like different files, cause less "READ" lines in xsession-errors.
Comment 9 Martin Steigerwald 2011-09-26 18:39:36 UTC
Created attachment 63989 [details]
second xsession-errors log

Seems different file(s). Nepomuk Systemsettings kcm in the end displayed a directory it which might contain the broken file. There is a pdf and some doc files in there. Its kind of private thus I will not upload it here.

So one way would be to exclude this directory as well and then try again. But this might turn out tedious if there are lots of other files it chokes on. I have lots of different files with different file formats on the harddisk. But I still expect libstream* not to crash on them.
Comment 10 Sebastian Trueg 2011-09-28 18:39:26 UTC
(In reply to comment #6)
> Hmmm, its not downloadable anymore. Okay, I started uploading it to my personal
> server as Creative Commons Commercial-Sharealike-Attribution-Licence seems to
> allow:
> 
> http://martin-steigerwald.de/zeit/kde-bug-282793/
> 
> Author of this video / interview is LinuxTag.
> 
> Please tell me when you got the video, then I free space for storing the video
> again.

I successfully indexed both files without any crashes...
Comment 11 Sebastian Trueg 2011-10-26 11:04:51 UTC
Marking this as fixed upstream since the upcoming strigi 0.7.7 has some fixes which also include pdf files.
Comment 12 Martin Steigerwald 2011-10-26 12:16:14 UTC
Thanks, Sebastian. I thought about more specific tests like just letting it search the directory with the contents that possibly trigger the crash, but I have been rather busy. I will try to get a hold of the newer strigi version and try again. Hopefully by then the Debian Qt/KDE team managed to bring out KDE 4.7.2 or later packages...
Comment 13 Martin Steigerwald 2012-01-04 10:42:40 UTC
Sebastian, indeed this seems to be fixed with:

merkaba:~> apt-show-versions | egrep "(streamanalyzer|soprano|nepomuk|virtuoso)"
libnepomuk4/experimental uptodate 4:4.7.4-1
libnepomukquery4a/experimental uptodate 4:4.7.4-1
libnepomukutils4/experimental uptodate 4:4.7.4-1
libsoprano-dev/experimental uptodate 2.7.3+dfsg.1-1
libsoprano4/experimental uptodate 2.7.3+dfsg.1-1
libstreamanalyzer0/sid uptodate 0.7.7-1
soprano-daemon/experimental uptodate 2.7.3+dfsg.1-1
virtuoso-minimal/wheezy uptodate 6.1.3+dfsg1-2
virtuoso-opensource-6.1-bin/wheezy uptodate 6.1.3+dfsg1-2
virtuoso-opensource-6.1-common/wheezy uptodate 6.1.3+dfsg1-2

For the first time Nepomuk is working stable here!

Many thanks!