Bug 388967 - FR: Support for "robots.txt" like functionality
Summary: FR: Support for "robots.txt" like functionality
Status: CONFIRMED
Alias: None
Product: frameworks-baloo
Classification: Frameworks and Libraries
Component: general (show other bugs)
Version: unspecified
Platform: Other Linux
: NOR wishlist
Target Milestone: ---
Assignee: Pinak Ahuja
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-01-14 17:00 UTC by Richard Llom
Modified: 2024-07-09 19:13 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Richard Llom 2018-01-14 17:00:53 UTC
It would be nice to have baloo support a "robots.txt" like functionality. That is, by a file in a folder (e.g. mypath/robots.txt) one can specify the index behavior for that path. E.g:
- do not index at all
- do not index certain mimetype
- type of indexing (basic or full)
- etc...

I don't care for the exact file name. It probably shouldn't be robots.txt because that could conflict with actual web content.

Ideally something with a leading dot, so it is hidden by default ".indexrc", ".crawlerrc", etc...
Comment 1 vini.ipsmaker 2020-07-30 06:51:24 UTC
GNOME's tracker uses the names ".nomedia" (same as Android) and ".trackerignore" for this feature.

https://gnome.pages.gitlab.gnome.org/tracker/faq/#how-can-i-control-what-tracker-indexes