Bug 422008 - Baloo repeatedly fails to index certain files (fails in PostingDB::put MDB_BAD_TXN: Transaction must abort, has a child, or is invalid) without crashing
Summary: Baloo repeatedly fails to index certain files (fails in PostingDB::put MDB_BA...
Status: RESOLVED UPSTREAM
Alias: None
Product: frameworks-baloo
Classification: Frameworks and Libraries
Component: Baloo File Daemon (show other bugs)
Version: 5.70.0
Platform: Arch Linux Linux
: NOR major
Target Milestone: ---
Assignee: Stefan Brüns
URL: https://bugs.openldap.org/show_bug.cg...
Keywords:
Depends on:
Blocks:
 
Reported: 2020-05-24 12:38 UTC by Leon De Andrade
Modified: 2022-06-09 16:01 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments
baloo stuck (239.35 KB, image/png)
2020-05-24 12:38 UTC, Leon De Andrade
Details
crash, maybe relevant (3.80 KB, text/plain)
2020-05-25 09:42 UTC, Leon De Andrade
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Leon De Andrade 2020-05-24 12:38:28 UTC
Created attachment 128743 [details]
baloo stuck

SUMMARY

Baloo tries to index the same files (sucessfully) over and over again.
See the screenshots, about 20 files are indexed forever. 
The files seem to be my most recent ones.

STEPS TO REPRODUCE
I'm not sure, I have problems with baloo since ever on different machines. It's the first time I actually checked what the problem is.

Please tell me if I can provide more debug information.
Comment 1 Stefan Brüns 2020-05-24 15:47:51 UTC
For one of the affected files, after each index run, provide the output of:

$> date && stat <filename) && balooshow <filename>

$> df -T /home/
Comment 2 Stefan Brüns 2020-05-24 15:49:01 UTC
also:
$> df -T /home/leon/Downloads/
Comment 3 Leon De Andrade 2020-05-24 16:00:31 UTC
Okay I will, once I notice it again. 

Which reminds me that I probably should say that if I suspend baloo and try to continue it later (not sure how much later) it won't.

If I just download something new, it will index normally and go back to Idle (Powersave).
Comment 4 Leon De Andrade 2020-05-25 07:08:37 UTC
So today it started again, after a restart. Maybe there is connection to restarting my system, will investigate that further.

As yesterday it lists the most recent files 

/home/leon/untitled/mainwindow.ui:

date && stat /home/leon/untitled/mainwindow.ui && balooshow /home/leon/untitled/mainwindow.ui

Mo 25. Mai 08:59:52 CEST 2020
  File: /home/leon/untitled/mainwindow.ui
  Size: 536             Blocks: 8          IO Block: 4096   regular file
Device: 806h/2054d      Inode: 8522113     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/    leon)   Gid: (  985/   users)
Access: 2020-05-24 21:09:23.932741559 +0200
Modify: 2020-05-24 21:09:22.442695158 +0200
Change: 2020-05-24 21:09:22.452695469 +0200
 Birth: 2020-05-24 21:09:22.442695158 +0200
36602196627818502 2054 8522113 /home/leon/untitled/mainwindow.ui
        Mtime: 1590347362 2020-05-24T21:09:22
        Ctime: 1590347362 2020-05-24T21:09:22

df -T /home/

Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120827480 160436092  43% /home

df -T /home/leon/Downloads/

Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120827540 160436032  43% /home




date && stat /home/leon/untitled/mainwindow.ui && balooshow /home/leon/untitled/mainwindow.ui

Mo 25. Mai 09:01:25 CEST 2020
  File: /home/leon/untitled/mainwindow.ui
  Size: 536             Blocks: 8          IO Block: 4096   regular file
Device: 806h/2054d      Inode: 8522113     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/    leon)   Gid: (  985/   users)
Access: 2020-05-24 21:09:23.932741559 +0200
Modify: 2020-05-24 21:09:22.442695158 +0200
Change: 2020-05-24 21:09:22.452695469 +0200
 Birth: 2020-05-24 21:09:22.442695158 +0200
36602196627818502 2054 8522113 /home/leon/untitled/mainwindow.ui
        Mtime: 1590347362 2020-05-24T21:09:22
        Ctime: 1590347362 2020-05-24T21:09:22

df -T /home/
Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120827548 160436024  43% /home

df -T /home/leon/Downloads/

Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120827040 160436532  43% /home


date && stat /home/leon/untitled/mainwindow.ui && balooshow /home/leon/untitled/mainwindow.ui

Mo 25. Mai 09:07:19 CEST 2020
  File: /home/leon/untitled/mainwindow.ui
  Size: 536             Blocks: 8          IO Block: 4096   regular file
Device: 806h/2054d      Inode: 8522113     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/    leon)   Gid: (  985/   users)
Access: 2020-05-24 21:09:23.932741559 +0200
Modify: 2020-05-24 21:09:22.442695158 +0200
Change: 2020-05-24 21:09:22.452695469 +0200
 Birth: 2020-05-24 21:09:22.442695158 +0200
36602196627818502 2054 8522113 /home/leon/untitled/mainwindow.ui
        Mtime: 1590347362 2020-05-24T21:09:22
        Ctime: 1590347362 2020-05-24T21:09:22

df -T /home/
Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120827828 160435744  43% /home

df -T /home/leon/Downloads/

Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120827832 160435740  43% /home
Comment 5 Leon De Andrade 2020-05-25 07:25:03 UTC
15 mins later:

date && stat /home/leon/untitled/mainwindow.ui && balooshow /home/leon/untitled/mainwindow.ui
Mo 25. Mai 09:23:37 CEST 2020
  File: /home/leon/untitled/mainwindow.ui
  Size: 536             Blocks: 8          IO Block: 4096   regular file
Device: 806h/2054d      Inode: 8522113     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/    leon)   Gid: (  985/   users)
Access: 2020-05-24 21:09:23.932741559 +0200
Modify: 2020-05-24 21:09:22.442695158 +0200
Change: 2020-05-24 21:09:22.452695469 +0200
 Birth: 2020-05-24 21:09:22.442695158 +0200
36602196627818502 2054 8522113 /home/leon/untitled/mainwindow.ui
        Mtime: 1590347362 2020-05-24T21:09:22
        Ctime: 1590347362 2020-05-24T21:09:22


df -T /home/
Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120828708 160434864  43% /home

df -T /home/leon/Downloads/
Filesystem     Type 1K-blocks      Used Available Use% Mounted on
/dev/sda6      ext4 296389012 120828712 160434860  43% /home
Comment 6 Leon De Andrade 2020-05-25 09:42:07 UTC
Created attachment 128762 [details]
crash, maybe relevant

Just now baloo crashed, I'm not sure though if it's related or any helpful.
Comment 7 Bug Janitor Service 2020-06-09 04:33:08 UTC
Dear Bug Submitter,

This bug has been in NEEDSINFO status with no change for at least
15 days. Please provide the requested information as soon as
possible and set the bug status as REPORTED. Due to regular bug
tracker maintenance, if the bug is still in NEEDSINFO status with
no change in 30 days the bug will be closed as RESOLVED > WORKSFORME
due to lack of needed information.

For more information about our bug triaging procedures please read the
wiki located here:
https://community.kde.org/Guidelines_and_HOWTOs/Bug_triaging

If you have already provided the requested information, please
mark the bug as REPORTED so that the KDE team knows that the bug is
ready to be confirmed.

Thank you for helping us make KDE software even better for everyone!
Comment 8 Leon De Andrade 2020-06-10 08:54:50 UTC
Additional infos provided
Comment 9 Stefan Brüns 2020-06-10 22:04:28 UTC
There is nothing standing out in the provided infos to explain the behavior.

Is "Appimage" really the correct platform?
Comment 10 Leon De Andrade 2020-06-18 07:41:12 UTC
No, it's not sorry. I must have clicked the wrong one, I'm on arch.
Comment 11 Leon De Andrade 2020-07-26 11:40:48 UTC
Still there! Everytime I reboot my pc baloo starts and won't stop ... is there anything I can more do to help fixing this bug?
Comment 12 Leon De Andrade 2020-08-18 13:52:47 UTC
I looked into journalctl and there were a couple of informations repeating every few seconds.

Invalid encoding. Ignoring "/home/leon/Downloads/Telegram Desktop/BitVis_0.0.8/zebra.bin":

After deleting this file, baloo is still stuck, but the cpu usage had dropped massively! So it seems like the "Ignoring" doesn't work so well.

baloo_file_extractor[3542]: org.kde.baloo.engine: PostingDB::put MDB_BAD_TXN: Transaction must abort, has a child, or is invalid
baloo_file_extractor[3542]: org.kde.baloo.engine: PositionDB::put MDB_BAD_TXN: Transaction must abort, has a child, or is invalid

Then these two messages appear like 20 times and repeat every few seconds. I guess these are the culprit for the loop. Does this help you?
Comment 13 Nate Graham 2020-10-26 16:57:44 UTC
I can't reproduce the looping anymore, as Baloo not notices repeated failures and stops after a bit. However I can reproduce the failure to index with this error message for a few of my text files.
Comment 14 Nate Graham 2020-10-27 12:27:41 UTC
This appears to be an upstream issue in lmdb itself: https://bugs.openldap.org/show_bug.cgi?id=9360