Bug 161213 - Extreme memory usage when searching for text in large PDF
Summary: Extreme memory usage when searching for text in large PDF
Alias: None
Product: okular
Classification: Applications
Component: general (show other bugs)
Version: 0.6.3
Platform: Ubuntu Linux
: NOR normal
Target Milestone: ---
Assignee: Okular developers
Depends on:
Reported: 2008-04-24 00:15 UTC by Dustin Vaselaar
Modified: 2008-05-04 17:11 UTC (History)
0 users

See Also:
Latest Commit:
Version Fixed In:


Note You need to log in before you can comment on or make changes to this bug.
Description Dustin Vaselaar 2008-04-24 00:15:31 UTC
Version:           0.6.3 (using KDE 4.0.3)
Installed from:    Ubuntu Packages
OS:                Linux

When searching for uncommon text using the "Find" function in large PDF files such as:
I experience extreme memory usage.

For example when searching for the word "abracadabra" the virtual and resident memory increase from approximately 100 and 32 MB respectively, to greater than 550 and 450 MB (I stopped the test at that point otherwise my computer would become unresponsive.)
Comment 1 Pino Toscano 2008-05-01 21:03:33 UTC
SVN commit 803048 by pino:

Internally replace a TextEntity with a "lighter version", that stores the raw UTF-16 data of the text.
This way, we can save about 4 int's for each text entity; this is not much for small documents,
but with big documents with lots of text (eg, the PDF specs) we can save a lot (more than 50MB!).

CCBUG: 161213

 M  +84 -29    textpage.cpp  
 M  +8 -8      textpage_p.h  

WebSVN link: http://websvn.kde.org/?view=rev&revision=803048
Comment 2 Albert Astals Cid 2008-05-04 17:11:03 UTC
SVN commit 803949 by aacid:

limit the number of text pages we keep in memory so that searching does not bring your system to its knees

BUG: 161213

 M  +46 -0     core/document.cpp  
 M  +5 -0      core/document_p.h  
 M  +18 -2     core/generator.cpp  
 M  +5 -0      core/generator.h  
 M  +13 -3     generators/poppler/generator_pdf.cpp  

WebSVN link: http://websvn.kde.org/?view=rev&revision=803949