Talk:Hiragana (Unicode block)

Computing Low‑importance

	This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing articles
Low	This article has been rated as Low-importance on the project's importance scale.

Typography Low‑importance

	This article is within the scope of WikiProject Typography, a collaborative effort to improve the coverage of articles related to Typography on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TypographyWikipedia:WikiProject TypographyTemplate:WikiProject TypographyTypography articles
Low	This article has been rated as Low-importance on the importance scale.

Writing systems Low‑importance

	Writing portal This article falls within the scope of WikiProject Writing systems, a WikiProject interested in improving the encyclopaedic coverage and content of articles relating to writing systems on Wikipedia. If you would like to help out, you are welcome to drop by the project page and/or leave a query at the project’s talk page.Writing systemsWikipedia:WikiProject Writing systemsTemplate:WikiProject Writing systemsWriting system articles
Low	This article has been rated as Low-importance on the project's importance scale.

Japan: CJKV Low‑importance

This article is within the scope of WikiProject Japan, a collaborative effort to improve the coverage of Japan-related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the project, participate in relevant discussions, and see lists of open tasks. Current time in Japan: 07:49, November 23, 2024 (JST, Reiwa 6) (Refresh)JapanWikipedia:WikiProject JapanTemplate:WikiProject JapanJapan-related articles

Low

This article has been rated as Low-importance on the project's importance scale.

This article is supported by the joint CJKV task force.

WikiProject Japan to do list:

Peer review: None
A-class review: None

Featured content candidates –

Articles: None
Pictures: None
Lists: None

Good article nominations: Vinland Saga (TV series), Godzilla Minus One
Add requested images to articles that need them.
Pages for Deletion: Participate in Japan-related deletion discussions.
Improve and expand Japan-related stubs.
Create some requested articles.
Help translate an article from the Japanese Wikipedia into English.
Assess unassessed articles

I think there’s more to the Unicode

Looking at the Scripts.txt file for Unicode 15.1 it contains the following section:

# ================================================

3041..3096 ; Hiragana # Lo [86] HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMALL KE

309D..309E ; Hiragana # Lm [2] HIRAGANA ITERATION MARK..HIRAGANA VOICED ITERATION MARK

309F ; Hiragana # Lo HIRAGANA DIGRAPH YORI

1B001..1B11F ; Hiragana # Lo [287] HIRAGANA LETTER ARCHAIC YE..HIRAGANA LETTER ARCHAIC WU

1B132 ; Hiragana # Lo HIRAGANA LETTER SMALL KO

1B150..1B152 ; Hiragana # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO

1F200 ; Hiragana # So SQUARE HIRAGANA HOKA

# Total code points: 381

Shouldn’t the code points from 1B001 on be accounted for in the article as well? Jens.troeger (talk) 07:36, 22 January 2024 (UTC)[reply]

No. Those code points are in different blocks and this article only covers the Hiragana Unicode block (U+3040..U+309F). The "See also" section of this article points the reader to the other blocks containing the Hiragana characters you mentioned. DRMcCreedy (talk) 17:16, 22 January 2024 (UTC)[reply]

QUESTION:

Is the Hiragana alphabet in UNICODE alphabetical? In other words, if I write a sort routine based on this ordering, will the sort be correct? Sean.Walton

No. I don't think that will yield the correct results. For example U+304C doesn't seem to go between U+304B and U+304D. See http://www.unicode.org/reports/tr10/ for example. DRMcCreedy (talk) 15:57, 18 May 2024 (UTC)[reply]

Very much not. First, there is an historic alphabetization that is not reflected in the code chart order at all. Second, sokuon can be treated in different ways, depending on the alphabetization expectation of the end user. Third, for compatibility with predecessor standards, dakuten and handakuten forms are representable in two different ways in Unicode, but need to be treated as equivalent in an alphabetization scheme. That having been said, a naïve sort by code point would produce a collation largely in line with a knowledgeable end user's expectation for a reasonable alphabetization. Van Isaac, GHTV^cont_WpWS 16:22, 18 May 2024 (UTC)[reply]