Bug 231989 - Nepomuk store size constantly increases without any limit
Summary: Nepomuk store size constantly increases without any limit
Status: RESOLVED UNMAINTAINED
Alias: None
Product: nepomuk
Classification: Miscellaneous
Component: general (show other bugs)
Version: unspecified
Platform: Gentoo Packages Linux
: NOR normal
Target Milestone: ---
Assignee: Sebastian Trueg
URL:
Keywords:
: 253115 263197 (view as bug list)
Depends on:
Blocks:
 
Reported: 2010-03-24 11:12 UTC by Tassilo Horn
Modified: 2015-01-23 16:22 UTC (History)
7 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Tassilo Horn 2010-03-24 11:12:04 UTC
Version:            (using KDE 4.4.1)
OS:                Linux
Installed from:    Gentoo Packages

The Nepomuk store size (right click on the Search
Service icon > Configure File Indexer) seems to increase without limits
and to me it looks like it's increasing its pace.

On my system, there are currently 29,187 files in the index, and this
number is pretty constant.  After I've started to use KDE (4.4) about
1.5 month ago and the initial file indexing was done, the store size was
about 0.6 GiB.  Till now, that has tripled to 1.8 GiB.

A week ago, it was about 1.4 GiB, so I somehow get the impression, that after updating to KDE 4.4.1 it increases faster...

Most of the files in the index are frequently changing source code files
in various repositories.  Maybe Nepomuk keeps a history for each file?
Well, for files in a VCS, that would by quite awkward...

I would like to debug that further.  Are there any tools I can use to check what informations Nepomuk collected for a given file?

Maybe this is related to Bug #208596.  There, Sebastian Trueg posted some sopranocmd, whithout explaining what it should say.  When I do

  $ sopranocmd --model main --dbus org.kde.NepomukStorage list \
    "<file:///home/horn/bin/Xml2tg>" 
  Total results: 0
  Execution time: 00:00:00.72

I can do that for any file in the indexed directory, and the result is always 0.  But /home/horn/bin/Xml2tg is indexed.  When I use the Search in dolphin for some string contained in that file, it is listed in the results...

I also tried re-indexing everything in order to delete informations of deleted files using

  qdbus org.kde.nepomuk.services.nepomukstrigiservice \
       /nepomukstrigiservice org.kde.nepomuk.Strigi.updateAllFolders true

but the Nepomuk store size is still 1.8 GiB.
Comment 1 Sebastian Trueg 2011-01-06 19:18:45 UTC
Nepomuk does not use the file URL as resource URI. Use the following command instead:

$ sopranocmd --model main --dbus org.kde.NepomukStorage query \
  "select ?p ?o where { ?r ?p ?o . ?r nie:url <file:/...> . }"

However, the point of this bug is valid: the size of the Virtuoso DB only grows but never goes down.
Comment 2 Sebastian Trueg 2011-01-06 19:21:34 UTC
*** Bug 253115 has been marked as a duplicate of this bug. ***
Comment 3 Sebastian Trueg 2011-01-15 12:16:55 UTC
*** Bug 263197 has been marked as a duplicate of this bug. ***
Comment 4 Alejandro Nova 2011-12-05 21:39:59 UTC
Please, retest with KDE 4.8 Beta 2.
Comment 5 Carlo Wood 2013-01-14 17:32:17 UTC
My home directory just ran full because of this.
-rw------- 1 carlo carlo 4282384384 Jan 14 18:25 .kde/share/apps/nepomuk/repository/main/data/virtuosobackend/soprano-virtuoso.db

4 GB...

I immediately tried to purge nepomuk from my system, but that causes all of KDE to be deleted.. So much for freedom :/.

KDE version of 'wheezy' (4.8.4).
Comment 6 Nikos Alexandris 2013-12-02 09:22:44 UTC
Hi,

I think this is still valid (in my system at least). I work with

Qt: 4.8.5, KDE Development Platform: 4.11.00, kde4-config: 1.0

and I ran twice "Nepomuk Cleaner: 4.11.00".  Didn't see any disk space usage reduction. Any hints? Getting a "/home" filled is problematic.
Comment 7 Nikos Alexandris 2013-12-02 09:55:05 UTC
Update: I wrongly reported this here. Actually, my problem is #264465. Apologies.
Comment 8 Nikos Alexandris 2013-12-02 10:05:52 UTC
Ok, back in this thread again!  It is actually the "soprano-virtuoso.db" file which is huge. Currently 87+GB.

What can I do to see if all this is really "metadata" and, if not, clean/reduce it?
Comment 9 Vishesh Handa 2015-01-23 16:22:39 UTC
Thank you for taking the time to file a bug report.

The Nepomuk project is no longer included in the KDE Software Compilation. With Plasma 5, we have replaced most of the underlying technology with Baloo and other components. Hopefully this will have addressed your concern.

We encourage you to try out Plasma 5 (+Baloo) and let us know if your problem persists.