selecting text from pdf returns garbarge on sometime pdf files. Reproducible: Always Steps to Reproduce: 1. open http://frama-c.com/download/user-manual-Neon-20140301.pdf 2. got to page 3 3. select "Frama-C User Manual" Actual Results: Resulted text is not same as seen on okular, its garbarge. For example, text "Frama-C User Manual" from user-manual-Neon-20140301.pdf results in "❘❡❧❡❛s❡ ◆❡♦♥✲✷✵✶✹✵✸✵✶" Expected Results: Resulted text should same as seen on okular. For example, text "Frama-C User Manual" from user-manual-Neon-20140301.pdf should result in "Frama-C User Manual" - the affected file seem to be created using pdftex
Works fine here with poppler 0.26.3. Can you tell the version of the poppler library (pdftotext -v)? Can you test the newest version? Thanks in advance for your answers.
My version on Ubuntu 14.04 (Trusty) is: pdftotext -v pdftotext version 0.24.5 Copyright 2005-2013 The Poppler Developers - http://poppler.freedesktop.org Copyright 1996-2011 Glyph & Cog, LLC
I build package from Ubuntu utopic (http://packages.ubuntu.com/utopic/libpoppler46) on Trusty and got version 0.26.2 With this version of poppler, this bug went away. I highly recommend to increase the required poppler version for okular to at least 0.26.2 (or an version which has the fix in).
Well, it works and for some people compiling is a no go, you should always use the newest possible version, but we don't want to kick out of using Okular by having an unreasonable high