I have an idea that search dialogs such as krunner or any filter fields across kde would support enhanced diacritics matching. So that non-diacritic input would match all diacritic variants of text. Reproducible: Always Steps to Reproduce: 1.have KDE in languge whose alphabet contains diacritics (ENGLISH has not it! :|) like Slavic languages. open any text filter dialog like System settings OR krunner. 2. you are about to find "systém" OR "system" string. 3. you type "system" Actual Results: only "system" string matches Expected Results: both "system", "systém" and other variants (ěë) would match. This feature exists in Wind0ws 7.
This issue still exists and it is confusing because no other platform I've used differentiates between something like system and systém. Is there any reason why it shouldn't be as easy to solve as stripping the diacritics from words by converting it to ASCII?
Sorry for my previous comment, it sounded too confrontational.
Tested it just now and both "system" and "systém" work. But for example pisma (== "fonts" in Czech) doesn't work but search for "font" finds "Písma" so I'm thinking in some cases it is the case where diacritics are accounted for and in other cases it doesn't find keywords.
*** Bug 316077 has been marked as a duplicate of this bug. ***
*** Bug 414689 has been marked as a duplicate of this bug. ***
*** Bug 328763 has been marked as a duplicate of this bug. ***
A similar patch has been made to baloo quite some time ago: https://invent.kde.org/frameworks/baloo/commit/59318e9694c0847bcaa5e71a4fbadde877e7a33e
*** Bug 426017 has been marked as a duplicate of this bug. ***
I am not sure how this should be implemented, should the diacritics be removed like in the baloo patch or should we make sure that we check both the stripped and normal variant for matches? Maybe a user whose languages actually uses diacritics can comment :)
Hi! I think it is necessary to also search a non-stripped version. This is because the meaning can change leading to different results especially in file searches. As an example, italian "e" translates to "and", while "è" translates to "is".
I agree with the above comment. Both need to be searched.
I've just migrated a couple of systems from Windows 10 to Linux with KDE. My native language is Greek and the issue is the one the OP has described years ago. For years everything (Web searches, most searches in office apps, instant messager searches, android searches) works diacretic-less. In windows it wouldn't matter if Ι searced for ενέρ or ενερ to match "Παροχή ενέργειας". Switching to diacretics typing is always a fuss, and for that reason Windows has included an option to disable diacretics-significance in searching, regardless if that is made for file names or file content. (Note: the ONLY notable exception I can think of is the Viber app, whose developers insist on having diacretic-based searches). Please do consider including this option system-wide in KDE, which could be disabled by default (for compatibility purposes). I can think of two use cases to pursue this: 1) The rather significant number of users switching to windows to KDE-based distros (KDE keyboard shortcuts similarity with Windows was a major reason for me to switch) 2) A lot of lower-quality Greek text is not punctuated at all; diacretic-based searching would fail in this scenario.