I have several files of tens of Gb in size. These are technically text files, so the Nepomuk indexer would try to index them, causing issues with memory (and eventually getting killed by the OOM killer). However, adding an exclude pattern to them (they end in .fastq, so *.fastq was used) did not have the desired effect: the indexer kept on being run on them (despite logouts and logins and restarts of the service). The only way to prevent that was to remove the paths in the indexing KCM. Reproducible: Always Steps to Reproduce: 1. Add a custom pattern to ignore files 2. Activate the indexing service Actual Results: Despite the pattern match, files are indexed. Expected Results: Files matching should not be indexed. Git master from today (14th Dec 2012).
This seems to work with the current master. There were some recent changes that may be related to this problem. Can you please test it again?
Thank you for taking the time to file a bug report. The Nepomuk project is no longer included in the KDE Software Compilation. With Plasma 5, we have replaced most of the underlying technology with Baloo and other components. Hopefully this will have addressed your concern. We encourage you to try out Plasma 5 (+Baloo) and let us know if your problem persists.