Bug 491593 - Feature: Tool to Extract References, similar to Annotations/Bookmarks panel
Summary: Feature: Tool to Extract References, similar to Annotations/Bookmarks panel
Status: REPORTED
Alias: None
Product: okular
Classification: Applications
Component: general (show other bugs)
Version: 23.08.1
Platform: Other Microsoft Windows
: NOR wishlist
Target Milestone: ---
Assignee: Okular developers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-08-11 20:07 UTC by Leonard Mada
Modified: 2024-08-12 21:01 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments
PDF document with References without links (1.57 MB, application/pdf)
2024-08-11 20:07 UTC, Leonard Mada
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Leonard Mada 2024-08-11 20:07:34 UTC
Created attachment 172521 [details]
PDF document with References without links

Tool to Extract References: similar to Annotations/Bookmarks panel

SUMMARY
Some journals do not provide links for the references (even as of 2015). Navigating manually to the Bibliography section can be quit annoying (see attached pdf document).

A tool to automatically extract the references would be very useful:
- extract references on each page, e.g. using a pseudo-regex expression
<superscript>number(-number)?(,number(-number)?)*</superscript>;
- display them in a lateral tab (similar to the Annotations tab);
- let user specify on which pages the Bibliography is;
- link the extracted references to the Bibliography;


STEPS TO REPRODUCE
1. Open attached pdf-document
2. Go to pages 2-3 (or 3-4), section "Literature Update"
3. There are many references

OBSERVED RESULT
There is NO usable link for any of the references.
Note: the paragraph seems to include a link; but the external link is useless, as the user has to browse first to the Bibliography pages to extract the details of the references! The link is present in MS Edge as well, so is presumably a problem in the pdf document.

EXPECTED RESULT
Extract the references (superscript text) and display them in a specialized tab.
Link these references to the Bibliography pages:
- the user could specify on which page a particular set of references start, e.g.:
1 => page 6; 23 => page 7; 70 => page 8; 108 => page 9; ...

I have extracted the references manually for the exemplified section:
- the text below was processed manually and is included for easier understanding;
- the "References" panel would display only the numbers and link them to the Bibliography pages;

Vaccine type:
one or more short TAA-derived peptides: 90-142, 143-145
full-length TAAs: 146
“synthetic long peptides” (SLPs): 147-150, 152
“carbohydrate-mimetic peptides” (CMPs): 151,  153-155

Malignancies:
various hematological malignancies: 90,111,122,126
melanoma: 99,100,113,141
breast: 94,96,134,138,151
head and neck: 102,103
gastroesophageal: 108,121,130
pulmonary: 95,105,114,142
pancreatic: 109,115,117,124,125,132, 135,137,148
prostate: 97,98,104,107,112,119,139
ovarian: 116,131
colorectal: 91,120,129,136,140

Standalone adjuvanted interventions injected intradermally or intratumorally: 94,102,104,105,108-110,112,115,116,119,121,123,124,128,129,131,138,140-142,146,147,149,156
[...]


SOFTWARE/OS VERSIONS
Windows:  10

ADDITIONAL INFORMATION