if I look at the title of a pdf file into the information panel, this consists by the real title followed by a series of strange characters ( perhaps Chinese, Japanese, Korean or unknown) Reproducible: Sometimes Steps to Reproduce: 1. open or create a simple pdf file with Calligra Word or LibreOffice Write. 2. select the pdf file 3. look at the information panel
Thanks for the bug report. Please always include a screenshot when you see something strange in the application. I'm not quite sure if you refer to the preview image (in that case, it would be a problem with the thumbnailer) or to the title of the PDF file (in which case it might be a Nepomuk problem). It would also be good if you could attach a problematic file, because I could not reproduce any problems with a few test files yet. Thanks for your help!
Screenshot is present in the URL section above. Nepomuk is disabled. I don't know how I could create a problematic file in this case.
Created attachment 82380 [details] screenshoot screenshoot
Thanks for the quick reply. (In reply to comment #2) > Screenshot is present in the URL section above. Oops, sorry, I must have missed that! Sorry about that. If I'm not mistaken, this information inside the Information Panel is provided by Nepomuk even if the indexer is disabled, so I'll reassign. > I don't know how I could create a problematic file in this case. Well, if "thisisatest.pdf" does not contain anything private, you could attach it here.
Created attachment 82381 [details] problematic file
Another test file can be fetched from http://www.mabb.de/files/content/document/Foerderung/mabb_Broschuere_OER_in_der_Praxis.pdf It displays "Title: Offene " followed by many garbage characters (looks like binary), actual title should be "Offene Bildungsresourcen (OER) in der Praxis".
Interesting detail: If I hover over the PDF from comment #6 forth and back multiple times, the "Title: Offene" is constant, while the garbage that follows it changes randomly, so it looks like the parser references random pointers.
https://git.reviewboard.kde.org/r/113138/
Git commit 4a719dc3a0a8ee8e896e56544c2dfa642fd0f037 by Christoph Feck. Committed on 06/10/2013 at 23:39. Pushed by cfeck into branch 'KDE/4.11'. Fix trailing garbage in extracted PDF title FIXED-IN: 4.11.3 REVIEW: 113138 M +2 -3 services/fileindexer/indexer/popplerextractor.cpp http://commits.kde.org/nepomuk-core/4a719dc3a0a8ee8e896e56544c2dfa642fd0f037
*** Bug 324706 has been marked as a duplicate of this bug. ***