Bug 287781 - Text selection/copying is broken in the presence of drop-caps
Summary: Text selection/copying is broken in the presence of drop-caps
Status: CONFIRMED
Alias: None
Product: okular
Classification: Applications
Component: general (show other bugs)
Version: 0.19.60
Platform: Compiled Sources Linux
: NOR normal
Target Milestone: ---
Assignee: Okular developers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-11-28 19:00 UTC by Marijn Kruisselbrink
Modified: 2021-03-09 23:59 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Marijn Kruisselbrink 2011-11-28 19:00:11 UTC
Version:           unspecified (using KDE 4.6.5) 
OS:                Linux

When trying to select a paragraph that starts with a drop-cap, the order of words inside that paragraph is completely messed up in current okular master. For example try selecting/copying the first paragraph on page 7 of http://www.nwcg.gov/pms/pubs/442/archive/pms442.pdf and notice how the copied text has words in the wrong order. I've also tested it in Okular 0.12.5 and there it is actually slightly better, the text is copied much more in the correct order there. In that version it then of course completely fails with the two-column layout of the document, so it seems likely the support for that caused a regression in handling drop-caps.

Reproducible: Always

Steps to Reproduce:
Open http://www.nwcg.gov/pms/pubs/442/archive/pms442.pdf, go to page 7, using the Text selection tool select the first paragraph of text ("Land management [..] training drivers."), copy and paste in some other application.

Actual Results:  
The pasted text has the order of words completely messed up.

Expected Results:  
The words are copied in the order they appear in the document.
Comment 1 Albert Astals Cid 2011-11-28 21:42:42 UTC
Mamun, can you have a look at this?
Comment 2 Mohammad Mahfuzur Rahman Mamun 2011-11-29 04:40:44 UTC
The problem is due to drop-caps and more due to the bounding box for the characters of drop-caps. If we just select the L(drop-cap), it can be seen that, its bounding rectangle is overlapping with some other characters bounding rectangle. Actually, this is the main reason of the problem. Our focus mainly was to make multi-column selection in journals/papers where this type of situation usually do not occur. There are actually more problems for magazines or newspapers which I would like to handle later.
Comment 3 Justin Zobel 2021-03-09 23:59:32 UTC
Thank you for the bug report.

As this report hasn't seen any changes in 5 years or more, we ask if you can please confirm that the issue still persists.

If this bug is no longer persisting or relevant please change the status to resolved.