| Summary: | Duplicate search restriction is sometimes ignored | ||
|---|---|---|---|
| Product: | [Applications] digikam | Reporter: | Michael <mas_masterandservant> |
| Component: | Searches-Similarity | Assignee: | Digikam Developers <digikam-bugs-null> |
| Status: | REPORTED --- | ||
| Severity: | normal | CC: | caulier.gilles, metzpinguin |
| Priority: | NOR | ||
| Version First Reported In: | 8.1.0 | ||
| Target Milestone: | --- | ||
| Platform: | Microsoft Windows | ||
| OS: | Microsoft Windows | ||
| Latest Commit: | Version Fixed/Implemented In: | ||
| Sentry Crash Report: | |||
| Attachments: |
Faulty result
Config for finding duplicates |
||
|
Description
Michael
2023-08-16 12:49:04 UTC
I think you misunderstand the album restriction. I also had to look a little at the source code. It more or less does not refer to the reference albums chosen above. The Album Restriction applies to the album of the found reference image. We can argue about whether all these functions make sense. The selection of the reference album or the determination of the reference image is new and good. I find the restriction of the album selector unnecessary (Only Tags, current Tab...). It just caused currently problems for a user on the mailing list. Maik Then I guess I'm the next one to have problems with this. I usually move large amounts of images. For a gut feeling, I have attached my database statistics below. There are already duplicates in the database that I have to keep with different metadata. On the other hand, I already have a few thousand images to import again and I have to find the images in those imports that are already in the database. So my requirement is to find all the images in the import that are already in the database. But I only want to delete duplicates from the import. So practically the whole existing database is the reference. Sometimes I also get a request to preserve a duplicate from the import and add it to the current database with new metadata. So it would help me if I can define what the albums of the reference images are and which albums are the images to be searched. Whether an album is then just a reference, a search or both is then completely up to me. The rest regarding quality and information content of an image, I could cover so far with the advanced search. So in principle, I don't need this function to automatically select the reference, because I usually do that in advance via other functions and filter out the required qualities. So now my question regarding the duplicate search, do you have any advice for me on how to achieve my goal if this is not a bug? How shall we proceed with this ticket? Thanks and many greetings Michael digikam version 8.1.0 Images: AVIF: 1 BMP: 6695 EPS: 6 GIF: 23135 ICNS: 6 ICO: 94 JP2: 2 JPEG: 141 JPG: 1339786 KRA: 2 PCX: 956 PNG: 34809 PPM: 26 PSD: 17 RAW-ARW: 8 RAW-CR2: 30737 RAW-CRW: 903 RAW-DNG: 34 RAW-HDR: 1 RAW-RAW: 1 TGA: 157 TIFF: 340 WEBP: 2817 WMF: 6 XCF: 97 XPM: 10 total: 1440787 : Videos: 3GP: 223 AVI: 1591 MOV: 2395 MP4: 5994 MPEG: 55 VOB: 47 WMV: 240 total: 10545 : Audio: AAC: 85 M4A: 150 MP3: 17302 MP4: 3 MPC: 7 OGG: 340 WAV: 1313 WMA: 6 total: 19206 : Total Items: 1470538 Albums: 86485 Tags: 2618 : Database backend: QSQLITE Database Path: F:/digikam_db/ Database locale: UTF-8 @Michael, This problem still reproducible with the new digiKam 8.2.0 pre-release Windows installer available at usual place: https://files.kde.org/digikam/ This new bundle is based on last Qt framework 5.15.11 and KDE framework 5.110. Thanks in advance Gilles Caulier I have tested the current dev version as you suggest. I created two albums, album 1 and 2, and used three images, A, B and C. The albums contained the following constellations: 1: A B B C 2: A B C C The configuration of the search as shown in the screenshot. The result was 3 sets of duplicates. Set 1: Album 1: A (Reference) Album 2: A Set 2: Album 1: B (Reference) B <- I don't want this Album 2: B Set 3: Album 1: C (Reference) Album 2: C C Clicking on "Remove duplicates" will also remove images from album 1. However, my aim is that only images from album 2 are removed. The expected result should be the following 3 sets of duplicates. Set 1: Album 1: A (Reference) Album 2: A Set 2: Album 1: B (Reference) Album 2: B Set 3: Album 1: C (Reference) Album 2: C C Created attachment 163248 [details]
Config for finding duplicates
Hi, The 8.7.0 pre-release Windows installer from today have been rebuilt from scratch with Qt 6.8.3, KDE 6.12, OpenCV 4.11 + CUDA support, Exiv2 0.28.5, ExifTool 13.27, ffmpeg 7, all image codecs updated to last version (jxl, avif, heif, aom, etc.). Please try with this version to see if your problem still reproducible... https://files.kde.org/digikam/ Thanks in advance Best regards Gilles Caulier |