Talk:Hiragana (Unicode block)
This article is rated List-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
I think there’s more to the Unicode
[edit]Looking at the Scripts.txt file for Unicode 15.1 it contains the following section:
# ================================================
3041..3096 ; Hiragana # Lo [86] HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMALL KE
309D..309E ; Hiragana # Lm [2] HIRAGANA ITERATION MARK..HIRAGANA VOICED ITERATION MARK
309F ; Hiragana # Lo HIRAGANA DIGRAPH YORI
1B001..1B11F ; Hiragana # Lo [287] HIRAGANA LETTER ARCHAIC YE..HIRAGANA LETTER ARCHAIC WU
1B132 ; Hiragana # Lo HIRAGANA LETTER SMALL KO
1B150..1B152 ; Hiragana # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
1F200 ; Hiragana # So SQUARE HIRAGANA HOKA
# Total code points: 381
Shouldn’t the code points from 1B001 on be accounted for in the article as well? Jens.troeger (talk) 07:36, 22 January 2024 (UTC)
- No. Those code points are in different blocks and this article only covers the Hiragana Unicode block (U+3040..U+309F). The "See also" section of this article points the reader to the other blocks containing the Hiragana characters you mentioned. DRMcCreedy (talk) 17:16, 22 January 2024 (UTC)
QUESTION:
[edit]Is the Hiragana alphabet in UNICODE alphabetical? In other words, if I write a sort routine based on this ordering, will the sort be correct? Sean.Walton
- No. I don't think that will yield the correct results. For example U+304C doesn't seem to go between U+304B and U+304D. See http://www.unicode.org/reports/tr10/ for example. DRMcCreedy (talk) 15:57, 18 May 2024 (UTC)
- Very much not. First, there is an historic alphabetization that is not reflected in the code chart order at all. Second, sokuon can be treated in different ways, depending on the alphabetization expectation of the end user. Third, for compatibility with predecessor standards, dakuten and handakuten forms are representable in two different ways in Unicode, but need to be treated as equivalent in an alphabetization scheme. That having been said, a naïve sort by code point would produce a collation largely in line with a knowledgeable end user's expectation for a reasonable alphabetization. VanIsaac, GHTV contWpWS 16:22, 18 May 2024 (UTC)
- List-Class Computing articles
- Low-importance Computing articles
- All Computing articles
- List-Class Typography articles
- Low-importance Typography articles
- List-Class Writing system articles
- Low-importance Writing system articles
- List-Class Japan-related articles
- Low-importance Japan-related articles
- WikiProject Japan articles