328029 – Have highlighted text show in xml file in docdata directory

Bug 328029 - Have highlighted text show in xml file in docdata directory

Summary: Have highlighted text show in xml file in docdata directory

Status:	RESOLVED INTENTIONAL

Alias:	None

Product:	okular
Classification:	Applications
Component:	general (show other bugs)
Version:	unspecified
Platform:	unspecified Linux

Importance:	NOR wishlist
Target Milestone:	---
Assignee:	Okular developers

URL:
Keywords:

Depends on:
Blocks:

Reported:	2013-11-24 20:05 UTC by Chris George
Modified:	2013-11-27 20:02 UTC (History)
CC List:	2 users (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:

Attachments
Add an attachment

Note You need to log in before you can comment on or make changes to this bug.

Description Chris George 2013-11-24 20:05:27 UTC

Instead of or in addition to the current positional information, have the text that was highlighted as an annotation appear in the xml file of annotations in the docdata directory.

Reproducible: Always

Comment 1 Albert Astals Cid 2013-11-25 20:55:25 UTC

Sorry but no, the .xml file is there not for information exchange thus adding the text makes no sense in that regard.

Comment 2 Chris George 2013-11-26 00:25:11 UTC

Currently the xml file is the only way I have of accessing the note annotations I am making. 

Does this mean that this functionality is not going to be supported in the future?

Comment 3 Albert Astals Cid 2013-11-27 00:24:47 UTC

No, the xml file is not going away, we're just not adding unneeded data to it as the one you are requesting.

Comment 4 Chris George 2013-11-27 00:28:40 UTC

Unneeded data? I need it. How do I get it out of the pdf?

Comment 5 Fabio D'Urso 2013-11-27 20:02:54 UTC

You can read the coordinates of the highlighted rectangles from the xml file, and use them to extract the corresponding text from the PDF file. I think you can use okular's own API to do that, have a look at Page::text(RegularAreaRect *) (http://api.kde.org/4.x-api/kdegraphics-apidocs/okular/html/classOkular_1_1Page.html#a11ab0f2abe5c1e760c046a33fd5393f3).
You can also directly obtain the list of the annotations using okular's API instead of parsing the xml data.