Bug 388967

Summary: FR: Support for "robots.txt" like functionality
Product: [Frameworks and Libraries] frameworks-baloo Reporter: Richard Llom <richard.llom>
Component: generalAssignee: Pinak Ahuja <pinak.ahuja>
Status: CONFIRMED ---    
Severity: wishlist CC: nate, vini.ipsmaker
Priority: NOR    
Version First Reported In: unspecified   
Target Milestone: ---   
Platform: Other   
OS: Linux   
Latest Commit: Version Fixed/Implemented In:
Sentry Crash Report:

Description Richard Llom 2018-01-14 17:00:53 UTC
It would be nice to have baloo support a "robots.txt" like functionality. That is, by a file in a folder (e.g. mypath/robots.txt) one can specify the index behavior for that path. E.g:
- do not index at all
- do not index certain mimetype
- type of indexing (basic or full)
- etc...

I don't care for the exact file name. It probably shouldn't be robots.txt because that could conflict with actual web content.

Ideally something with a leading dot, so it is hidden by default ".indexrc", ".crawlerrc", etc...
Comment 1 vini.ipsmaker 2020-07-30 06:51:24 UTC
GNOME's tracker uses the names ".nomedia" (same as Android) and ".trackerignore" for this feature.

https://gnome.pages.gitlab.gnome.org/tracker/faq/#how-can-i-control-what-tracker-indexes