Summary: | wrong orientation of the extracted text | ||
---|---|---|---|
Product: | [Applications] okular | Reporter: | Salvo "LtWorf" Tomaselli <tiposchi> |
Component: | PDF backend | Assignee: | Okular developers <okular-devel> |
Status: | RESOLVED FIXED | ||
Severity: | normal | CC: | etienne.rebetez, thp |
Priority: | NOR | ||
Version: | 0.6 | ||
Target Milestone: | --- | ||
Platform: | unspecified | ||
OS: | Linux | ||
Latest Commit: | Version Fixed In: |
Description
Salvo "LtWorf" Tomaselli
2008-02-27 19:57:02 UTC
On Wed, Feb 27, 2008 at 06:57:03PM -0000, Salvo Tomaselli wrote:
Hej,
> When i paste the text, the resulting text some contains words from the selected
> area (but not the whole selected text), and also words from unselected areas.
@Pino or @Albert, can reproduce it here. Somehow the rotation
calculation is messed up. If you rotate the document ccw and select the
bottom area of the page, you'll get the headline of the text... really
strange.
Is it a poppler bug?
Ciao,
Tobias
Not sure, but while the pages have a 270 degree rotation, somehow the text does not. I have a local fix that makes the orientation testcase (from poppler) working fine, but then with this document gets even worse than how it is now... Here are another two documents which show this problem: http://www.ifi.uzh.ch/dbtg/uploads/media/01-Einfuehrung_03.pdf http://www.ifi.uzh.ch/dbtg/uploads/media/02-XML_03.pdf Additionally, annotation tools which rely on text selection do not work correctly. Evince 2.21.91 is able to select text correctly in these documents, so poppler is perhaps not be the root of the problem. SVN commit 784735 by pino: Apply Albert's patch to use the new functions in Poppler-Qt4 for getting the bounding box of the characaters in the correct way (almost). (Unfortunately, this requires the master version from Poppler's GIT repository, to be released hopefully today as 0.8RC1.) BUG: 158517 M +51 -1 generator_pdf.cpp WebSVN link: http://websvn.kde.org/?view=rev&revision=784735 SVN commit 784736 by pino: Backport: apply Albert's patch to use the new functions in Poppler-Qt4 for getting the bounding box of the characaters in the correct way (almost). (Unfortunately, this requires the master version from Poppler's GIT repository, to be released hopefully today as 0.8RC1.) CCBUG: 158517 M +51 -1 generator_pdf.cpp WebSVN link: http://websvn.kde.org/?view=rev&revision=784736 Giving a better title, so can be found easily. *** Bug 161016 has been marked as a duplicate of this bug. *** *** Bug 160274 has been marked as a duplicate of this bug. *** |