Bug 328029 - Have highlighted text show in xml file in docdata directory
Summary: Have highlighted text show in xml file in docdata directory
Status: RESOLVED INTENTIONAL
Alias: None
Product: okular
Classification: Applications
Component: general (show other bugs)
Version: unspecified
Platform: unspecified Linux
: NOR wishlist
Target Milestone: ---
Assignee: Okular developers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2013-11-24 20:05 UTC by Chris George
Modified: 2013-11-27 20:02 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Chris George 2013-11-24 20:05:27 UTC
Instead of or in addition to the current positional information, have the text that was highlighted as an annotation appear in the xml file of annotations in the docdata directory.

Reproducible: Always
Comment 1 Albert Astals Cid 2013-11-25 20:55:25 UTC
Sorry but no, the .xml file is there not for information exchange thus adding the text makes no sense in that regard.
Comment 2 Chris George 2013-11-26 00:25:11 UTC
Currently the xml file is the only way I have of accessing the note annotations I am making. 

Does this mean that this functionality is not going to be supported in the future?
Comment 3 Albert Astals Cid 2013-11-27 00:24:47 UTC
No, the xml file is not going away, we're just not adding unneeded data to it as the one you are requesting.
Comment 4 Chris George 2013-11-27 00:28:40 UTC
Unneeded data? I need it. How do I get it out of the pdf?
Comment 5 Fabio D'Urso 2013-11-27 20:02:54 UTC
You can read the coordinates of the highlighted rectangles from the xml file, and use them to extract the corresponding text from the PDF file. I think you can use okular's own API to do that, have a look at Page::text(RegularAreaRect *) (http://api.kde.org/4.x-api/kdegraphics-apidocs/okular/html/classOkular_1_1Page.html#a11ab0f2abe5c1e760c046a33fd5393f3).
You can also directly obtain the list of the annotations using okular's API instead of parsing the xml data.