Bug 226849

Summary: folders once indexed by strigi cannot be removed from the index
Product: [Unmaintained] nepomuk Reporter: Benjamin Schmitz <vortex>
Component: generalAssignee: Sebastian Trueg <sebastian>
Status: RESOLVED FIXED    
Severity: normal CC: trueg
Priority: NOR    
Version: unspecified   
Target Milestone: ---   
Platform: Arch Linux   
OS: Linux   
Latest Commit: Version Fixed In:
Sentry Crash Report:

Description Benjamin Schmitz 2010-02-14 17:32:44 UTC
Version:            (using KDE 4.4.0)
OS:                Linux
Installed from:    Archlinux Packages

The contents of folders that were once indexed are not removed from the index when you uncheck the folder in the "strigi index folders" systemsettings module.

For example I had strigi accidentally index a large folder hierarchy containing source code which slows searching down and often causes a lot of unwanted results to be displayed.

Therefore I would like to remove the complete folder hierarchy from the search index. However, I haven't found a way to do so, except by removing the whole nepomuk storage and rebuilding the index.

Folders that are set not to be indexed should also be removed from the index if they were indexed previously.
Comment 1 Sebastian Trueg 2010-02-15 14:45:07 UTC
the process of removing the old data can take some time if its a lot of files. Are you sure you gave the indexing service the time to do its work?
Comment 2 Benjamin Schmitz 2010-02-15 22:11:27 UTC
The status icon says the file indexer is "idle". I can also see no background processes running. The indexed files aren't removed after you re-login either.

It seems files once indexed just stay in the index. Changes to these files are probably not monitored anymore but the once indexed version stays.
Comment 3 Sebastian Trueg 2010-02-16 13:01:31 UTC
Are these files maybe from before KDE 4.3. Or did you start with a clean DB in KDE 4.4?
Comment 4 Benjamin Schmitz 2010-02-17 23:03:54 UTC
I started with a clean DB. The files still show up in search results after several days now.
Comment 5 Sebastian Trueg 2010-02-19 17:20:39 UTC
SVN commit 1092847 by trueg:

Determine all entries to remove in a single query using some fancy FILTERs.

BUG: 226849


 M  +64 -60    indexscheduler.cpp  


WebSVN link: http://websvn.kde.org/?view=rev&revision=1092847