Bug 142625 - Support 'astral' Unicode planes (those beyond codepoint FFFF)
Summary: Support 'astral' Unicode planes (those beyond codepoint FFFF)
Status: CONFIRMED
Alias: None
Product: kcharselect
Classification: Applications
Component: general (show other bugs)
Version: unspecified
Platform: Debian stable Linux
: VHI wishlist
Target Milestone: ---
Assignee: Christoph Feck
URL:
Keywords:
: 207175 308046 327093 327518 332351 364488 381369 425179 426522 428523 445398 472376 (view as bug list)
Depends on:
Blocks:
 
Reported: 2007-03-06 22:55 UTC by John Karp
Modified: 2023-07-19 03:33 UTC (History)
16 users (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description John Karp 2007-03-06 22:55:42 UTC
Version:            (using KDE KDE 3.5.5)
Installed from:    Debian stable Packages

KCharSelect only supports 16-bit Unicode codepoints 0000-FFFF, e.g. the basic multilingual plane (BMP); the 'Table' field only accepts numbers 0-255.

But there are thousands of additional codepoints in the 'astral planes', which start at 10000. KCharSelect should allow their selection.
Comment 1 John Karp 2008-08-18 19:37:24 UTC
*** This bug has been confirmed by popular vote. ***
Comment 2 Constantin Berzan 2009-03-02 20:57:01 UTC
QChar treats all characters as 16-bit numbers, making it impossible to use higher planes in today's Qt. I have filled a bug report/suggestion to Qt for this, and will update this bug report as I find out more.
Comment 3 Constantin Berzan 2009-03-04 14:31:22 UTC
Qt Software have opened a task for this suggestion. You can track it at http://www.qtsoftware.com/developer/task-tracker/index_html?id=246840&method=entry
Comment 4 Daniel Laidig 2009-10-27 18:55:30 UTC
*** Bug 207175 has been marked as a duplicate of this bug. ***
Comment 5 Shriramana Sharma 2011-07-13 16:17:20 UTC
Hello -- I believe there is no active bug on Qt corresponding to this bug. 

When I ask the new Qt bugtracker to find old "task tracker bug 246840" (http://bugreports.qt.nokia.com/secure/IssueNavigator.jspa?reset=&customfield_10141=246840) it turns up http://bugreports.qt.nokia.com/browse/QTBUG-3690 which I *don't* think is exactly the cause of this bug.
Comment 6 Bryce 2011-07-13 18:48:13 UTC
This ticket needs to be reassigned: I no longer maintain this module.
Comment 7 Christoph Feck 2012-10-08 08:29:25 UTC
*** Bug 308046 has been marked as a duplicate of this bug. ***
Comment 8 aditsu 2013-09-14 05:08:07 UTC
Yep, the unicode range in kcharselect seems to be limited to 65535 (code point / decimal entity), i.e. 0xffff. If I search for 65536, it gives me the 0x6553 character.
Comment 9 Christoph Feck 2013-11-03 23:45:09 UTC
*** Bug 327093 has been marked as a duplicate of this bug. ***
Comment 10 Christoph Feck 2013-11-12 23:23:10 UTC
*** Bug 327518 has been marked as a duplicate of this bug. ***
Comment 11 Christoph Feck 2014-03-20 22:00:32 UTC
*** Bug 332351 has been marked as a duplicate of this bug. ***
Comment 12 The Lightning Stalker 2015-01-09 20:37:38 UTC
KCharSelect has such great search features, but is so horribly out of date.  Onioncode 8 is going to come out soon with more new 3 and 4-byte characters.  Are you guys just lazy or what?
Comment 13 Salvo "LtWorf" Tomaselli 2015-01-10 14:10:58 UTC
Have you tried submitting a patch?
Comment 14 The Lightning Stalker 2015-01-11 23:01:54 UTC
(In reply to Salvo "LtWorf" Tomaselli from comment #13)
> Have you tried submitting a patch?

Where am I going to get the time for that?
I am a busy man.
Comment 15 Christoph Feck 2015-01-14 23:55:25 UTC
Comment #2 still applies.
Comment 16 Christoph Feck 2016-06-20 03:15:47 UTC
*** Bug 364488 has been marked as a duplicate of this bug. ***
Comment 17 Christoph Feck 2016-07-23 01:53:34 UTC
Git commit 7baa78ca83ab3c47d2f03259959ec25e59b2f711 by Christoph Feck.
Committed on 23/07/2016 at 01:42.
Pushed by cfeck into branch 'master'.

[KCharSelect] Handle characters outside BMP

This is the initial commit to support Unicode code points
outside of QChar range U+0000..U+FFFF. To fix this, multiple
steps are needed:

- a new API, which is not based on QChar, but uses uint
  datatype for code points (provided by this commit),

- a new database, which has information about symbols
  outside the BMP range. This commit uses a temporary
  remapping trick to inject SMP symbols (U+1Fxxx) into
  the Private Use Area until we have a new data base
  format,

- applications that use the new uint based API (not
  provided by this commit). See KF5 Porting Notes for
  porting information.

REVIEW: 120904

M  +-    --    src/kcharselect-data
M  +133  -56   src/kcharselect.cpp
M  +91   -4    src/kcharselect.h
M  +11   -11   src/kcharselect_p.h
M  +128  -72   src/kcharselectdata.cpp
M  +26   -22   src/kcharselectdata_p.h

http://commits.kde.org/kwidgetsaddons/7baa78ca83ab3c47d2f03259959ec25e59b2f711
Comment 18 Christoph Feck 2016-08-18 12:51:40 UTC
Git commit b4b8b5a383768fc8dea8914a98ab2b5a1b66e7bc by Christoph Feck.
Committed on 18/08/2016 at 12:48.
Pushed by cfeck into branch 'master'.

Use new API to allow selecting characters from all planes

REVIEW: 128453

M  +14   -13   kcharselectdia.cc
M  +3    -3    kcharselectdia.h

http://commits.kde.org/kcharselect/b4b8b5a383768fc8dea8914a98ab2b5a1b66e7bc
Comment 19 Christoph Feck 2016-08-18 13:32:22 UTC
For the KDE Applications 16.12 release in December it will be possible to select characters from the following additional blocks with the kcharselect application. To see them, modern fonts need to be installed.

Other applications using the KCharSelect widget (e.g. Calligra, LabPlot, Kexi, Konversation, Krita, KXStitch) need to be updated separately.

U+1F000..U+1F0FF  Mahjong Tiles, Domino Tiles, Playing Cards
U+1F100..U+1F1FF  Enclosed Alphanumeric Supplement
U+1F200..U+1F2FF  Enclosed Ideographic Supplement
U+1F300..U+1F5FF  Miscellaneous Symbols and Pictographs
U+1F600..U+1F64F  Emoticons
U+1F650..U+1F67F  Ornamental Dingbats
U+1F680..U+1F6FF  Transport and Map Symbols
U+1F700..U+1F77F  Alchemical Symbols
U+1F780..U+1F7FF  Geometric Shapes Extended
U+1F800..U+1F8FF  Supplemental Arrows-C
U+1F900..U+1F9FF  Supplemental Symbols and Pictographs

It is possible to use other characters, e.g. U+1D000, by searching for the code point.

Keeping this open until we have a new database format which allows to browse remaining blocks (historical scripts, Han ideograph extensions, etc).

If you have suggestions how to improve browsing (especially Han ideographs, which need to be extended to over 80,000), please write to the kde-utils-devel list, or add a ticket here.
Comment 20 Christoph Feck 2017-06-18 23:58:52 UTC
*** Bug 381369 has been marked as a duplicate of this bug. ***
Comment 21 judy wirkkala 2018-10-02 18:49:28 UTC
No clue what I'm doing ....but I get this error message when trying to upload a jpeg to KXstich program:::

Could not start process Cannot talk to klauncher: The name org.kde.klauncher5 was not provided by any .service files.
Comment 22 Christoph Feck 2020-08-10 08:54:33 UTC
*** Bug 425179 has been marked as a duplicate of this bug. ***
Comment 23 Christoph Feck 2020-09-14 17:56:22 UTC
*** Bug 426522 has been marked as a duplicate of this bug. ***
Comment 24 Christoph Feck 2020-11-01 03:00:18 UTC
*** Bug 428523 has been marked as a duplicate of this bug. ***
Comment 25 Christoph Feck 2021-11-13 03:05:26 UTC
*** Bug 445398 has been marked as a duplicate of this bug. ***
Comment 26 Christoph Feck 2023-07-19 03:20:06 UTC
*** Bug 472376 has been marked as a duplicate of this bug. ***