Bug 511838 - Show OCR output as Kdialog box or native window
Summary: Show OCR output as Kdialog box or native window
Status: RESOLVED INTENTIONAL
Alias: None
Product: Spectacle
Classification: Applications
Component: General (other bugs)
Version First Reported In: git-master
Platform: Arch Linux Linux
: NOR minor
Target Milestone: ---
Assignee: Noah Davis
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2025-11-08 18:04 UTC by Sauf
Modified: 2025-11-08 20:23 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed/Implemented In:
Sentry Crash Report:


Attachments
OCR UX Showcase (I piped the OCR result to KDialog using a bash script) (1.23 MB, video/mp4)
2025-11-08 18:04 UTC, Sauf
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Sauf 2025-11-08 18:04:35 UTC
Created attachment 186623 [details]
OCR UX Showcase (I piped the OCR result to KDialog using a bash script)

SUMMARY
Currently the image OCR text result copies to clipboard, and most of the times requires to open a text editor afterwards. It's extra step. I think showing OCR output as Kdialog box has better UX. refer to video attachment.

Original PR: https://invent.kde.org/plasma/spectacle/-/merge_requests/462
Comment 1 Jhair Paris 2025-11-08 19:00:13 UTC
(In reply to Sauf from comment #0)
> Created attachment 186623 [details]
> OCR UX Showcase (I piped the OCR result to KDialog using a bash script)
> 
> SUMMARY
> Currently the image OCR text result copies to clipboard, and most of the
> times requires to open a text editor afterwards. It's extra step. I think
> showing OCR output as Kdialog box has better UX. refer to video attachment.
> 
> Original PR: https://invent.kde.org/plasma/spectacle/-/merge_requests/462

I'm not so sure it's a better UX.

When you take the screenshot, a system notification appears allowing you to open a .txt file with all the text that was extracted.
Comment 2 Noah Davis 2025-11-08 20:23:43 UTC
While the user experience could be better, I don't think this would make the UX substantially better enough in a particular way to justify the slight inconvenience of having to click more. The linked MR already has a discussion about an alternative UX that I think would genuinely be better, but requires significantly more work to create.