Summary: | Objects / Forms / Monuments / Context detection and recognition using Deep Learning | ||
---|---|---|---|
Product: | [Applications] digikam | Reporter: | Daniel <daniel-other+kdebug> |
Component: | Tags-AutoAssignement | Assignee: | Digikam Developers <digikam-bugs-null> |
Status: | RESOLVED FIXED | ||
Severity: | wishlist | CC: | caulier.gilles, citbparpmakajjecpg, fellowsgarden+kde, iwannaberich, minhnghiaduong997, quochungtran1999, tahariel |
Priority: | NOR | ||
Version: | 7.0.0 | ||
Target Milestone: | --- | ||
Platform: | unspecified | ||
OS: | All | ||
Latest Commit: | Version Fixed In: | 8.3.0 |
Description
Daniel
2020-01-31 15:44:07 UTC
Hi, Seriously, you don't found the "People" tab from right sidebar ??? Look also this blog post for release 7.0.0-beta1... https://www.digikam.org/news/2019-12-22-7.0.0-beta1_release_announcement/ In other words, it's already implemented and ready to test : https://files.kde.org/digikam/ Best Gilles Caulier Ok, the file title is not enough explicit... It's about objects and forms detection recognition, not only faces... Sorry for the noise Gilles Caulier (In reply to caulier.gilles from comment #1) > Hi, > > Seriously, you don't found the "People" tab from right sidebar ??? > > Look also this blog post for release 7.0.0-beta1... > > https://www.digikam.org/news/2019-12-22-7.0.0-beta1_release_announcement/ > > In other words, it's already implemented and ready to test : > > https://files.kde.org/digikam/ > > Best > > Gilles Caulier Hey Gilles, Yes, yes, yes I already know that (I reported bug #415782), but this is not what I meant here: What I meant is the following: Assigning tags/description/metadata based on what is recognized in the image: a car/a tree/a table/a kite/whatever without having it trained on your own images. -- Daniel Hi all, I found this project which have been already ported as a Darktable plugin : https://github.com/scheckmedia/photils-dt Photils tool analysis your image in local with a Neural Network and generate a data vector send to a remote web service. You image is not send on the web. The web service return a list of tags as strings which can be used to populate the database... I can create a Photils plugin version for digiKam as a 3rdparty tool. I'm in contact with the Darktable plugin author. Any comments are welcome. Gilles Caulier Nghia, Just look my previous comment #4... Gilles (In reply to caulier.gilles from comment #5) > Nghia, > > Just look my previous comment #4... > > Gilles Hello Gilles, It's a very interesting feature. What is the name of the model used in the data vector generator? The context extraction from photo might be feasible, but at first, we need to scale the implementation of YOLO detection to accelerate the speed of processing or change to another version of SSD-MobileNet. Because the current version of SSD-Mobile used in digikam is only for face detection and it doesn't work really well. Furthermore, context extraction might envoke a Recurrent neural network, if such a pre-trained model exists for downloading, we can do it. Nghia Nghia, I don't know yet which model is used. You can ask directly to the developer on Linkedin: https://www.linkedin.com/in/tobiasscheck/ I'm talking with it by this way... Gilles *** Bug 426003 has been marked as a duplicate of this bug. *** Another source code written in Python to parse collection for object detection and tags image in database accordingly. https://github.com/oliveox/digikam-object-detection-plugin/tree/master/src It's based on Yolo2 model. Gilles Caulier See the advancement of the student project about AI based auto-tags (mostly completed) : https://community.kde.org/GSoc/2023/StatusReports/QuocHungTran# Gilles Caulier Hi, With next digiKam 8.3.0 release, the auto-tags assignment feature have been implemented without using a cloud service. The processing is done in core application with delegate neural network models stored in computer. For more details about auto-tags assignment feature, look on student work report : https://community.kde.org/GSoc/2023/StatusReports/QuocHungTran#Add_Automatic_Tags_Assignment_Tools_and_Improve_Face_Recognition_Engine_for_digiKam Best regards Gilles Caulier |