You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When showing the details of a unicode character it would be useful to show the character classification and official Unicode character name.
Here's an example for a few characters at U+2325
$ utf8ls.pl U+0073 U+2325
s U+73 [LowercaseLetter] LATIN SMALL LETTER S
⌥ U+2325 [OtherSymbol] OPTION KEY
⌦ U+2326 [OtherSymbol] ERASE TO THE RIGHT
⌧ U+2327 [OtherSymbol] X IN A RECTANGLE BOX
⌨ U+2328 [OtherSymbol] KEYBOARD
〈 U+2329 [OpenPunctuation] LEFT-POINTING ANGLE BRACKET
〉 U+232A [ClosePunctuation] RIGHT-POINTING ANGLE BRACKET
You could convert this output into a JSON lookup for each unicode code point to display along with the character.
Yes please, it took me a while to understand what I was seeing with multibyte UTF-8 characters representing a non breakable space, which of course shows up blank currently. It would become evident if it showed instead something like "UTF-8 ' ' : No-Break Space (NBSP)"
When showing the details of a unicode character it would be useful to show the character classification and official Unicode character name.
Here's an example for a few characters at U+2325
You could convert this output into a JSON lookup for each unicode code point to display along with the character.
You can generate a full table in json format with my perl script here:
https://github.com/bcowgill/bsac-linux-cfg/blob/master/bin/utf8ls.pl
or I can provide it if you cannot run perl
The text was updated successfully, but these errors were encountered: