Bug 200298

Summary: pdf - copy to clipp board - strange last character(s)
Product: [Applications] okular Reporter: Ferdinand Gassauer <gassauer>
Component: PDF backendAssignee: Okular developers <okular-devel>
Status: RESOLVED NOT A BUG    
Severity: normal    
Priority: NOR    
Version: unspecified   
Target Milestone: ---   
Platform: unspecified   
OS: Linux   
Latest Commit: Version Fixed In:
Attachments: shows strange (unusable) chars
test pdf

Description Ferdinand Gassauer 2009-07-15 14:02:07 UTC
Version:           0.8.90 (using 4.2.96 (KDE 4.2.96 (KDE 4.3 RC2)) "release 142", KDE:KDE4:Factory:Desktop / openSUSE_11.0)
Compiler:          gcc
OS:                Linux (x86_64) release 2.6.25.20-0.4-default

the copied string ends very often with 0A



some programs do not like handle this correctly
Comment 1 Pino Toscano 2009-07-15 14:17:45 UTC
Please
a) attach a sample document showing the issue
b) precise which is the version of the poppler-qt4 library installed on your system
Comment 2 Ferdinand Gassauer 2009-07-15 14:51:39 UTC
libpoppler2-0.6.4-11.1
libpoppler3-0.8.7-5.1
libpoppler4-0.10.6-6.7
libpoppler-devel-0.10.6-6.7
libpoppler-glib2-0.6.4-11.1
libpoppler-glib3-0.8.7-5.1
libpoppler-qt2-0.10.6-6.7
libpoppler-qt4-2-0.6.4-11.1
libpoppler-qt4-3-0.10.6-6.7
poppler-data-0.2.0-12.1
poppler-tools-0.10.6-6.7

see attachment
the line above is copied
the result shows this
similar happens if I past to a file opened in vi - save it and open in okteta
Comment 3 Ferdinand Gassauer 2009-07-15 14:52:53 UTC
Created attachment 35352 [details]
shows strange (unusable) chars
Comment 4 Pino Toscano 2009-07-15 14:58:58 UTC
What's that? I need a _PDF_ document where you copy text from.
Comment 5 Ferdinand Gassauer 2009-07-16 07:09:49 UTC
Created attachment 35375 [details]
test pdf

created with OO 3.1.1
but the problem happens with all pdf's I have tried
Comment 6 Pino Toscano 2009-07-30 01:39:33 UTC
In your document, I can see a trailing 0A triplet when copying text from:
- Acrobat Reader 9.1.2
- Okular 0.8.4 + Poppler 0.10.6
- Evince 2.26.1 + Poppler 0.10.6

Thus, I guess the PDF producer adds those.
Comment 7 Ferdinand Gassauer 2009-07-30 09:45:52 UTC
sorry to reopen
can we assume that adobe produces "good" pdf's?

http://www.adobe.com/devnet/acrobat/pdfs/plugin_apps_developer_guide.pdf
got to page 218
copy the complete "topic "line or below
it says 22 letters copied
and I get 3 times this  triplet 

Topic<here>Description<here>See<here>
Comment 8 Pino Toscano 2009-07-30 09:56:42 UTC
> can we assume that adobe produces "good" pdf's?

never

> http://www.adobe.com/devnet/acrobat/pdfs/plugin_apps_developer_guide.pdf

same result, trailing A0 triplets for all the browsers in comment #6