Jump to content

Talk:Hiragana (Unicode block)

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

I think there’s more to the Unicode

[edit]

Looking at the Scripts.txt file for Unicode 15.1 it contains the following section:

# ================================================

3041..3096  ; Hiragana # Lo [86] HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMALL KE

309D..309E  ; Hiragana # Lm [2] HIRAGANA ITERATION MARK..HIRAGANA VOICED ITERATION MARK

309F  ; Hiragana # Lo HIRAGANA DIGRAPH YORI

1B001..1B11F  ; Hiragana # Lo [287] HIRAGANA LETTER ARCHAIC YE..HIRAGANA LETTER ARCHAIC WU

1B132  ; Hiragana # Lo HIRAGANA LETTER SMALL KO

1B150..1B152  ; Hiragana # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO

1F200  ; Hiragana # So SQUARE HIRAGANA HOKA

# Total code points: 381

Shouldn’t the code points from 1B001 on be accounted for in the article as well? Jens.troeger (talk) 07:36, 22 January 2024 (UTC)[reply]

No. Those code points are in different blocks and this article only covers the Hiragana Unicode block (U+3040..U+309F). The "See also" section of this article points the reader to the other blocks containing the Hiragana characters you mentioned. DRMcCreedy (talk) 17:16, 22 January 2024 (UTC)[reply]

QUESTION:

[edit]

Is the Hiragana alphabet in UNICODE alphabetical? In other words, if I write a sort routine based on this ordering, will the sort be correct? Sean.Walton

No. I don't think that will yield the correct results. For example U+304C doesn't seem to go between U+304B and U+304D. See http://www.unicode.org/reports/tr10/ for example. DRMcCreedy (talk) 15:57, 18 May 2024 (UTC)[reply]
Very much not. First, there is an historic alphabetization that is not reflected in the code chart order at all. Second, sokuon can be treated in different ways, depending on the alphabetization expectation of the end user. Third, for compatibility with predecessor standards, dakuten and handakuten forms are representable in two different ways in Unicode, but need to be treated as equivalent in an alphabetization scheme. That having been said, a naïve sort by code point would produce a collation largely in line with a knowledgeable end user's expectation for a reasonable alphabetization. VanIsaac, GHTV contWpWS 16:22, 18 May 2024 (UTC)[reply]