Version: 0.8.2 (using 4.2.2 (KDE 4.2.2), Gentoo) Compiler: x86_64-pc-linux-gnu-gcc OS: Linux (x86_64) release 2.6.28-gentoo-r2 Steps to reproduce: 1) Create a pdf document via latex with a long word at the end of the line such that it gets automatically split (according to the hyphenation rules of the language) 2) Search for the split word in the pdf 3) See that the hyphenated version is not found I'll also attach a sample document. In this document, the search for "Gefahrenanalysematrix" only returns one hit. However, the word occurs twice in the document. Once split up (first / second line) and once non-split up in line three. I use poppler-0.10.5
Created attachment 33035 [details] pdf file to show bug 190433 Search for Gefahrenanalysematrix in this document and see only one hit due to hyphenation
Created attachment 33245 [details] Test case for the bug. To reproduce the bug just try to find the word "SIMULAÇÃO". As it can be easily seen, the word is present on the main title, in the first page of the document.
If we really want to be technically correct, there are no "words" in a PDF documents, but just characters at some positions. This is the same issue of #161324 (which this depends on), ie doing actual text recognizing.
*** Bug 148458 has been marked as a duplicate of this bug. ***
*** Bug 228245 has been marked as a duplicate of this bug. ***
*** Bug 253371 has been marked as a duplicate of this bug. ***
Will be fixed in Okular from KDE 4.9.0 thanks to the work of Mahfuzur Rahman Mamun