Bug 473065 - Baloo File Extractor crashes everytime on boot in KFileMetaData::PopplerExtractor::extract
Summary: Baloo File Extractor crashes everytime on boot in KFileMetaData::PopplerExtra...
Status: RESOLVED DOWNSTREAM
Alias: None
Product: frameworks-kfilemetadata
Classification: Frameworks and Libraries
Component: general (other bugs)
Version First Reported In: 5.105.0
Platform: Fedora RPMs Linux
: NOR crash
Target Milestone: ---
Assignee: Pinak Ahuja
URL:
Keywords: drkonqi
Depends on:
Blocks:
 
Reported: 2023-08-06 07:21 UTC by aawasthi
Modified: 2023-11-10 01:11 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description aawasthi 2023-08-06 07:21:20 UTC
Application: baloo_file_extractor (5.105.0)

Qt Version: 5.15.9
Frameworks Version: 5.105.0
Operating System: Linux 6.4.7-100.fc37.x86_64 x86_64
Windowing System: Wayland
Distribution: Fedora Linux 37 (Workstation Edition)
DrKonqi: 5.27.4 [KCrashBackend]

-- Information about the crash:
Using default Gnome as boot; however, KDE is also installed.

The crash can be reproduced every time.

-- Backtrace:
Application: Baloo File Extractor (baloo_file_extractor), signal: Bus error

[KCrash Handler]
#4  0x00007eddfc360d10 in FileStream::getPos() () from /lib64/libpoppler.so.123
#5  0x00007eddfc3d935d in PDFDoc::setup(std::optional<GooString> const&, std::optional<GooString> const&, std::function<void ()> const&) () from /lib64/libpoppler.so.123
#6  0x00007eddfc3d97d9 in PDFDoc::PDFDoc(std::unique_ptr<GooString, std::default_delete<GooString> >&&, std::optional<GooString> const&, std::optional<GooString> const&, void*, std::function<void ()> const&) () from /lib64/libpoppler.so.123
#7  0x00007eddfc71d610 in Poppler::DocumentData::DocumentData(QString const&, std::optional<GooString> const&, std::optional<GooString> const&) [clone .constprop.0] () from /lib64/libpoppler-qt5.so.1
#8  0x00007eddfc6f8929 in Poppler::Document::load(QString const&, QByteArray const&, QByteArray const&) () from /lib64/libpoppler-qt5.so.1
#9  0x00007eddfd115b90 in KFileMetaData::PopplerExtractor::extract(KFileMetaData::ExtractionResult*) () from /usr/lib64/qt5/plugins/kf5/kfilemetadata/kfilemetadata_popplerextractor.so
#10 0x000055cd42d02428 in Baloo::App::index(Baloo::Transaction*, QString const&, unsigned long long) ()
#11 0x000055cd42d0410c in Baloo::App::processNextFile() ()
#12 0x00007f1e1f0d43e6 in QSingleShotTimer::timerEvent(QTimerEvent*) () from /lib64/libQt5Core.so.5
#13 0x00007f1e1f0c80f5 in QObject::event(QEvent*) () from /lib64/libQt5Core.so.5
#14 0x00007f1e1f09d618 in QCoreApplication::notifyInternal2(QObject*, QEvent*) () from /lib64/libQt5Core.so.5
#15 0x00007f1e1f0edaa1 in QTimerInfoList::activateTimers() () from /lib64/libQt5Core.so.5
#16 0x00007f1e1f0ee3c4 in idleTimerSourceDispatch(_GSource*, int (*)(void*), void*) () from /lib64/libQt5Core.so.5
#17 0x00007f1e1ddd1c7f in g_main_context_dispatch () from /lib64/libglib-2.0.so.0
#18 0x00007f1e1de28178 in g_main_context_iterate.constprop () from /lib64/libglib-2.0.so.0
#19 0x00007f1e1ddcef00 in g_main_context_iteration () from /lib64/libglib-2.0.so.0
#20 0x00007f1e1f0ee71a in QEventDispatcherGlib::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) () from /lib64/libQt5Core.so.5
#21 0x00007f1e1f09c06a in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) () from /lib64/libQt5Core.so.5
#22 0x00007f1e1f0a4132 in QCoreApplication::exec() () from /lib64/libQt5Core.so.5
#23 0x000055cd42cfa079 in main ()
[Inferior 1 (process 2536) detached]

Reported using DrKonqi
Comment 1 Nate Graham 2023-08-07 22:01:25 UTC
Looks like the issue is in KFileMetaData or poppler, which is crashing while trying to parse a document for content indexing. Any chance you can attach the PDF file causing this crash? You can find it by running `balooctl failed` in a terminal window?

Obviously if the document contains sensitive atomic secrets, don't attach it. :)  But then it will probably be difficult or impossible to fix the issue, unfortunately.
Comment 2 aawasthi 2023-08-08 17:18:22 UTC
(In reply to Nate Graham from comment #1)
> Looks like the issue is in KFileMetaData or poppler, which is crashing while
> trying to parse a document for content indexing. Any chance you can attach
> the PDF file causing this crash? You can find it by running `balooctl
> failed` in a terminal window?
> 
> Obviously if the document contains sensitive atomic secrets, don't attach
> it. :)  But then it will probably be difficult or impossible to fix the
> issue, unfortunately.

Thank you for your efforts. However, "balooctl failed" returns "All Files were indexed successfully".
Comment 3 Nate Graham 2023-08-08 19:27:18 UTC
Hmm, thanks anyway.
Comment 4 aawasthi 2023-08-23 05:23:02 UTC
The bug has disappeared on reinstalling "Poppler" package, which provides "libpoppler.so.123".
Comment 5 Nate Graham 2023-08-23 15:47:55 UTC
Cool, I guess it was a local installation or configuration issue.
Comment 6 aawasthi 2023-08-24 10:26:30 UTC
(In reply to Nate Graham from comment #5)
> Cool, I guess it was a local installation or configuration issue.

Looks like. Thank you and team for all your efforts.