Bug 338147

Summary: selecting text from pdf returns garbarge
Product: [Applications] okular Reporter: Christoph Thielecke <crissi99>
Component: PDF backendAssignee: Okular developers <okular-devel>
Status: RESOLVED FIXED    
Severity: normal CC: aacid, yurchor
Priority: NOR    
Version: 0.19.3   
Target Milestone: ---   
Platform: Other   
OS: Linux   
Latest Commit: Version Fixed In:
Sentry Crash Report:

Description Christoph Thielecke 2014-08-09 12:26:45 UTC
selecting text from pdf returns garbarge on sometime pdf files.



Reproducible: Always

Steps to Reproduce:
1. open http://frama-c.com/download/user-manual-Neon-20140301.pdf
2. got to page 3
3. select "Frama-C User Manual"
Actual Results:  
Resulted text is not same as seen on okular, its garbarge. For example, 
text  "Frama-C User Manual" from user-manual-Neon-20140301.pdf results in "❘❡❧❡❛s❡ ◆❡♦♥✲✷✵✶✹✵✸✵✶"


Expected Results:  
Resulted text should same as seen on okular. For example, text  "Frama-C User Manual" from user-manual-Neon-20140301.pdf should result in "Frama-C User Manual"

- the affected file seem to be created using pdftex
Comment 1 Yuri Chornoivan 2014-08-09 12:33:31 UTC
Works fine here with poppler 0.26.3.

Can you tell the version of the poppler library (pdftotext -v)? Can you test the newest version?

Thanks in advance for your answers.
Comment 2 Christoph Thielecke 2014-08-09 12:41:19 UTC
My version on Ubuntu 14.04 (Trusty) is:

pdftotext -v
pdftotext version 0.24.5
Copyright 2005-2013 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2011 Glyph & Cog, LLC
Comment 3 Christoph Thielecke 2014-08-09 13:39:58 UTC
I build package from Ubuntu utopic (http://packages.ubuntu.com/utopic/libpoppler46)
on Trusty and got version 0.26.2

With this version of poppler, this bug went away.

I highly recommend to increase the required poppler version for okular to at least 0.26.2 (or an version which has the fix in).
Comment 4 Albert Astals Cid 2014-08-09 14:46:58 UTC
Well, it works and for some people compiling is a no go, you should always use the newest possible version, but we don't want to kick out of using Okular by having an unreasonable high