460320 2022-10-12 16:47:00 +0000 Add OCR feature 2025-11-15 14:53:30 +0000 1 1 2 Applications Spectacle General 20.12.3 Debian stable Linux RESOLVED FIXED https://bugs.kde.org/show_bug.cgi?id=488582 NOR wishlist --- 1 kochnorman dev 4wy78uwh akid.anis799 andrea.ippo christian.tallner dev fanzhuyifan geqch0akc kde madness742 me nate ostap.tymchenko https://invent.kde.org/plasma/spectacle/-/merge_requests/462 25.12.0 0 oldest_to_newest 2160448 0 kochnorman 2022-10-12 16:47:00 +0000 SUMMARY *** It would be great to be able to OCR screenshots directly, so you can copy text from them or make them searchable via pdfgrep. This could be done by Tesseract, enabled only if installed, and would kind of easy to add, but very useful. *** 2193452 1 nate 2023-01-05 18:47:03 +0000 *** Bug 463177 has been marked as a duplicate of this bug. *** 2220141 2 nate 2023-04-03 23:20:24 +0000 *** Bug 467942 has been marked as a duplicate of this bug. *** 2223608 3 nate 2023-04-18 19:14:54 +0000 Skanpage is already using Tesseract for OCR, so that could be a place to look for inspiration. 2225280 4 akid.anis799 2023-04-25 04:17:29 +0000 as a workaround one could try to use this command for OCR with spectacle: For X11: `spectacle --nonotify --region --background -o /proc/selt/fd/1 | tesseract stdin stdout | xclip -in -selection clipboard` For Wayland: `spectacle --nonotify --region --background -o /proc/selt/fd/1 | tesseract stdin stdout | wl-copy` 2244917 5 andrea.ippo 2023-08-07 12:18:29 +0000 Normcap is doing this, although it's a standalone app: https://github.com/dynobo/normcap/ Maybe worth looking into/getting in touch with the DEV. May I add, it would be cool if OCR capabilities weren't limited to Spectacle, but somehow baked-in in some frameworks part, and then be usable by any KDE app that can display images by pressing a button and having the detected text appear as overlay (e.g. gwenview showing a photo of a receipt, okular showing a page that was scanned without OCR, etc). Sounds complex and impacting quite some apps, but would be a wonderful productivity addition (if OCR accuracy is spot-on) 2244995 6 ostap.tymchenko 2023-08-07 18:29:35 +0000 (In reply to andrea.ippo from comment #5) > Normcap is doing this, although it's a standalone app: > https://github.com/dynobo/normcap/ > > Maybe worth looking into/getting in touch with the DEV. > > May I add, it would be cool if OCR capabilities weren't limited to > Spectacle, but somehow baked-in in some frameworks part, and then be usable > by any KDE app that can display images by pressing a button and having the > detected text appear as overlay (e.g. gwenview showing a photo of a receipt, > okular showing a page that was scanned without OCR, etc). > > Sounds complex and impacting quite some apps, but would be a wonderful > productivity addition (if OCR accuracy is spot-on) I dont think it would actually be so hard. Tesseract OCR is both very advanced and open source. All KDE would have to do is to have it preinstalled, and then implement it into the apps. obviously implementing it would be a lot of work but having it be in KDE isnt hard. 2266999 7 akid.anis799 2023-11-19 09:53:19 +0000 New workaround for using ocr with spectacle For X11: spectacle --nonotify --region --background -o /tmp/screenshot.png && tesseract /tmp/screenshot.png stdout | xclip -in -selection clipboard For Wayland: spectacle --nonotify --region --background -o /tmp/screenshot.png && tesseract /tmp/screenshot.png stdout | wl-copy 2312404 8 noahadvs 2024-04-18 01:04:46 +0000 *** Bug 479412 has been marked as a duplicate of this bug. *** 2348739 9 me 2024-08-26 07:49:14 +0000 This may well be out of scope, but i'd also like the OCR result to be saved.. somewhere, preferably in the image, so that i may search for text content of screenshots through dolphin 2448117 10 cherkaba 2025-08-17 15:22:45 +0000 +1 for ocr implementation 2466670 11 dev 2025-10-30 17:03:43 +0000 Hi, take a look at this MR https://invent.kde.org/plasma/spectacle/-/merge_requests/462