if I look at the title of a pdf file into the information panel, this consists by the real title followed by a series of strange characters ( perhaps Chinese, Japanese, Korean or unknown)
Steps to Reproduce:
1. open or create a simple pdf file with Calligra Word or LibreOffice Write.
2. select the pdf file
3. look at the information panel
Thanks for the bug report. Please always include a screenshot when you see something strange in the application. I'm not quite sure if you refer to the preview image (in that case, it would be a problem with the thumbnailer) or to the title of the PDF file (in which case it might be a Nepomuk problem).
It would also be good if you could attach a problematic file, because I could not reproduce any problems with a few test files yet. Thanks for your help!
Screenshot is present in the URL section above. Nepomuk is disabled. I don't know how I could create a problematic file in this case.
Created attachment 82380 [details]
Thanks for the quick reply.
(In reply to comment #2)
> Screenshot is present in the URL section above.
Oops, sorry, I must have missed that! Sorry about that.
If I'm not mistaken, this information inside the Information Panel is provided by Nepomuk even if the indexer is disabled, so I'll reassign.
> I don't know how I could create a problematic file in this case.
Well, if "thisisatest.pdf" does not contain anything private, you could attach it here.
Created attachment 82381 [details]
Another test file can be fetched from http://www.mabb.de/files/content/document/Foerderung/mabb_Broschuere_OER_in_der_Praxis.pdf
It displays "Title: Offene " followed by many garbage characters (looks like binary), actual title should be "Offene Bildungsresourcen (OER) in der Praxis".
Interesting detail: If I hover over the PDF from comment #6 forth and back multiple times, the "Title: Offene" is constant, while the garbage that follows it changes randomly, so it looks like the parser references random pointers.
Git commit 4a719dc3a0a8ee8e896e56544c2dfa642fd0f037 by Christoph Feck.
Committed on 06/10/2013 at 23:39.
Pushed by cfeck into branch 'KDE/4.11'.
Fix trailing garbage in extracted PDF title
M +2 -3 services/fileindexer/indexer/popplerextractor.cpp
*** Bug 324706 has been marked as a duplicate of this bug. ***