諺文音節是一個位於基本多文種平面的Unicode區塊,包含了現代標準語中使用的預組諺文音節。這些音節可以通過算法直接映射到諺文字母區塊中的兩個或三個字符的序列:

  • 位於U+1100..U+1112的字符:19個現代諺文的初聲輔音字母;
  • 位於U+1161..U+1175的字符:21個現代諺文的元音字母;
  • 無,或位於U+11A8..U+11C2的字符:27個現代諺文的終聲輔音字母。
諺文音節
Hangul Syllables
範圍U+AC00..U+D7AF
(11,184個碼位)
平面基本多文種平面BMP
文字諺文
應用朝鮮語
已分配11,172個碼位
未分配12個保留碼位
來源標準KS X 1001
統一碼版本歷史
2.011,172 (+11,172)
碼表
點擊此處
註釋[1][2]
在1.1版本中,6656個字符編碼於U+3400U+4DFF。但他們在2.0版本時被移至現在的碼位,並額外增加了4516個字符。

本區塊是根據2個或3個諺文字母(上述每個諺文字母子範圍中的一個)组成的每一個音节的規範順序編碼的。

區塊 编辑

諺文音節
Hangul Syllables[1][2]
Unicode Consortium 官方碼表 (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+AC0x
U+AC1x
U+AC2x
U+AC3x 갿
U+AC4x
U+AC5x
U+AC6x
U+AC7x 걿
U+AC8x
U+AC9x
U+ACAx
U+ACBx 겿
U+ACCx
U+ACDx
U+ACEx
U+ACFx 곿
U+AD0x
U+AD1x
U+AD2x
U+AD3x 괿
U+AD4x
U+AD5x
U+AD6x
U+AD7x 굿
U+AD8x
U+AD9x
U+ADAx
U+ADBx 궿
U+ADCx
U+ADDx
U+ADEx
U+ADFx 귿
U+AE0x
U+AE1x
U+AE2x
U+AE3x 긿
U+AE4x
U+AE5x
U+AE6x
U+AE7x 깿
U+AE8x
U+AE9x
U+AEAx
U+AEBx 꺿
U+AECx
U+AEDx
U+AEEx
U+AEFx 껿
U+AF0x
U+AF1x
U+AF2x
U+AF3x 꼿
U+AF4x
U+AF5x
U+AF6x
U+AF7x 꽿
U+AF8x
U+AF9x
U+AFAx
U+AFBx 꾿
U+AFCx
U+AFDx
U+AFEx
U+AFFx 꿿
U+B00x 뀀
U+B01x
U+B02x
U+B03x
U+B04x
U+B05x
U+B06x
U+B07x
U+B08x
U+B09x
U+B0Ax
U+B0Bx
U+B0Cx
U+B0Dx
U+B0Ex
U+B0Fx
U+B10x
U+B11x
U+B12x
U+B13x
U+B14x
U+B15x
U+B16x
U+B17x
U+B18x
U+B19x
U+B1Ax
U+B1Bx
U+B1Cx
U+B1Dx
U+B1Ex
U+B1Fx
U+B20x
U+B21x
U+B22x
U+B23x
U+B24x
U+B25x
U+B26x
U+B27x
U+B28x
U+B29x
U+B2Ax
U+B2Bx
U+B2Cx
U+B2Dx
U+B2Ex
U+B2Fx
U+B30x
U+B31x
U+B32x
U+B33x
U+B34x
U+B35x
U+B36x
U+B37x
U+B38x
U+B39x
U+B3Ax
U+B3Bx
U+B3Cx
U+B3Dx
U+B3Ex
U+B3Fx
U+B40x
U+B41x
U+B42x
U+B43x
U+B44x
U+B45x
U+B46x
U+B47x
U+B48x
U+B49x
U+B4Ax
U+B4Bx
U+B4Cx
U+B4Dx
U+B4Ex
U+B4Fx
U+B50x
U+B51x
U+B52x
U+B53x
U+B54x
U+B55x
U+B56x
U+B57x
U+B58x
U+B59x
U+B5Ax
U+B5Bx
U+B5Cx
U+B5Dx
U+B5Ex
U+B5Fx
U+B60x
U+B61x
U+B62x
U+B63x
U+B64x
U+B65x
U+B66x
U+B67x
U+B68x
U+B69x
U+B6Ax
U+B6Bx
U+B6Cx
U+B6Dx
U+B6Ex
U+B6Fx
U+B70x
U+B71x
U+B72x
U+B73x
U+B74x
U+B75x
U+B76x
U+B77x
U+B78x
U+B79x
U+B7Ax
U+B7Bx
U+B7Cx
U+B7Dx
U+B7Ex
U+B7Fx
U+B80x
U+B81x
U+B82x
U+B83x
U+B84x
U+B85x
U+B86x
U+B87x
U+B88x
U+B89x
U+B8Ax
U+B8Bx
U+B8Cx
U+B8Dx
U+B8Ex
U+B8Fx
U+B90x
U+B91x
U+B92x
U+B93x 뤿
U+B94x
U+B95x
U+B96x
U+B97x 륿
U+B98x
U+B99x
U+B9Ax
U+B9Bx 릿
U+B9Cx
U+B9Dx
U+B9Ex
U+B9Fx 맿
U+BA0x
U+BA1x
U+BA2x
U+BA3x 먿
U+BA4x
U+BA5x
U+BA6x
U+BA7x 멿
U+BA8x
U+BA9x
U+BAAx
U+BABx 몿
U+BACx
U+BADx
U+BAEx
U+BAFx 뫿
U+BB0x
U+BB1x
U+BB2x
U+BB3x 묿
U+BB4x
U+BB5x
U+BB6x
U+BB7x 뭿
U+BB8x
U+BB9x
U+BBAx
U+BBBx 뮿
U+BBCx
U+BBDx
U+BBEx
U+BBFx 믿
U+BC0x
U+BC1x
U+BC2x
U+BC3x 밿
U+BC4x
U+BC5x
U+BC6x
U+BC7x 뱿
U+BC8x
U+BC9x
U+BCAx
U+BCBx 벿
U+BCCx
U+BCDx
U+BCEx
U+BCFx 볿
U+BD0x
U+BD1x
U+BD2x
U+BD3x 봿
U+BD4x
U+BD5x
U+BD6x
U+BD7x 뵿
U+BD8x
U+BD9x
U+BDAx
U+BDBx 붿
U+BDCx
U+BDDx
U+BDEx
U+BDFx 뷿
U+BE0x
U+BE1x
U+BE2x
U+BE3x 븿
U+BE4x
U+BE5x
U+BE6x
U+BE7x 빿
U+BE8x
U+BE9x
U+BEAx
U+BEBx 뺿
U+BECx
U+BEDx
U+BEEx
U+BEFx 뻿
U+BF0x
U+BF1x
U+BF2x
U+BF3x 뼿
U+BF4x
U+BF5x
U+BF6x
U+BF7x 뽿
U+BF8x
U+BF9x
U+BFAx
U+BFBx 뾿
U+BFCx
U+BFDx
U+BFEx
U+BFFx 뿿
U+C00x 쀀
U+C01x
U+C02x
U+C03x
U+C04x
U+C05x
U+C06x
U+C07x
U+C08x
U+C09x
U+C0Ax
U+C0Bx
U+C0Cx
U+C0Dx
U+C0Ex
U+C0Fx
U+C10x
U+C11x
U+C12x
U+C13x
U+C14x
U+C15x
U+C16x
U+C17x
U+C18x
U+C19x
U+C1Ax
U+C1Bx
U+C1Cx
U+C1Dx
U+C1Ex
U+C1Fx
U+C20x
U+C21x
U+C22x
U+C23x
U+C24x
U+C25x
U+C26x
U+C27x
U+C28x
U+C29x
U+C2Ax
U+C2Bx
U+C2Cx
U+C2Dx
U+C2Ex
U+C2Fx
U+C30x
U+C31x
U+C32x
U+C33x
U+C34x
U+C35x
U+C36x
U+C37x
U+C38x
U+C39x
U+C3Ax
U+C3Bx
U+C3Cx
U+C3Dx
U+C3Ex
U+C3Fx
U+C40x
U+C41x
U+C42x
U+C43x
U+C44x
U+C45x
U+C46x
U+C47x
U+C48x
U+C49x
U+C4Ax
U+C4Bx
U+C4Cx
U+C4Dx
U+C4Ex
U+C4Fx
U+C50x
U+C51x
U+C52x
U+C53x
U+C54x
U+C55x
U+C56x
U+C57x
U+C58x
U+C59x
U+C5Ax
U+C5Bx
U+C5Cx
U+C5Dx
U+C5Ex
U+C5Fx
U+C60x
U+C61x
U+C62x
U+C63x
U+C64x
U+C65x
U+C66x
U+C67x
U+C68x
U+C69x
U+C6Ax
U+C6Bx
U+C6Cx
U+C6Dx
U+C6Ex
U+C6Fx
U+C70x
U+C71x
U+C72x
U+C73x
U+C74x
U+C75x
U+C76x
U+C77x
U+C78x
U+C79x
U+C7Ax
U+C7Bx
U+C7Cx
U+C7Dx
U+C7Ex
U+C7Fx
U+C80x
U+C81x
U+C82x
U+C83x
U+C84x
U+C85x
U+C86x
U+C87x
U+C88x
U+C89x
U+C8Ax
U+C8Bx
U+C8Cx
U+C8Dx
U+C8Ex
U+C8Fx
U+C90x
U+C91x
U+C92x
U+C93x 줿
U+C94x
U+C95x
U+C96x
U+C97x 쥿
U+C98x
U+C99x
U+C9Ax
U+C9Bx 즿
U+C9Cx
U+C9Dx
U+C9Ex
U+C9Fx 짿
U+CA0x
U+CA1x
U+CA2x
U+CA3x 쨿
U+CA4x
U+CA5x
U+CA6x
U+CA7x 쩿
U+CA8x
U+CA9x
U+CAAx
U+CABx 쪿
U+CACx
U+CADx
U+CAEx
U+CAFx 쫿
U+CB0x
U+CB1x
U+CB2x
U+CB3x 쬿
U+CB4x
U+CB5x
U+CB6x
U+CB7x 쭿
U+CB8x
U+CB9x
U+CBAx
U+CBBx 쮿
U+CBCx
U+CBDx
U+CBEx
U+CBFx 쯿
U+CC0x
U+CC1x
U+CC2x
U+CC3x 찿
U+CC4x
U+CC5x
U+CC6x
U+CC7x 챿
U+CC8x
U+CC9x
U+CCAx
U+CCBx 첿
U+CCCx
U+CCDx
U+CCEx
U+CCFx 쳿
U+CD0x
U+CD1x
U+CD2x
U+CD3x 촿
U+CD4x
U+CD5x
U+CD6x
U+CD7x 쵿
U+CD8x
U+CD9x
U+CDAx
U+CDBx 춿
U+CDCx
U+CDDx
U+CDEx
U+CDFx 췿
U+CE0x
U+CE1x
U+CE2x
U+CE3x 츿
U+CE4x
U+CE5x
U+CE6x
U+CE7x 칿
U+CE8x
U+CE9x
U+CEAx
U+CEBx 캿
U+CECx
U+CEDx
U+CEEx
U+CEFx 컿
U+CF0x
U+CF1x
U+CF2x
U+CF3x 켿
U+CF4x
U+CF5x
U+CF6x
U+CF7x 콿
U+CF8x
U+CF9x
U+CFAx
U+CFBx 쾿
U+CFCx
U+CFDx
U+CFEx
U+CFFx 쿿
U+D00x 퀀
U+D01x
U+D02x
U+D03x
U+D04x
U+D05x
U+D06x
U+D07x
U+D08x
U+D09x
U+D0Ax
U+D0Bx
U+D0Cx
U+D0Dx
U+D0Ex
U+D0Fx
U+D10x
U+D11x
U+D12x
U+D13x
U+D14x
U+D15x
U+D16x
U+D17x
U+D18x
U+D19x
U+D1Ax
U+D1Bx
U+D1Cx
U+D1Dx
U+D1Ex
U+D1Fx
U+D20x
U+D21x
U+D22x
U+D23x
U+D24x
U+D25x
U+D26x
U+D27x
U+D28x
U+D29x
U+D2Ax
U+D2Bx
U+D2Cx
U+D2Dx
U+D2Ex
U+D2Fx
U+D30x
U+D31x
U+D32x
U+D33x
U+D34x
U+D35x
U+D36x
U+D37x
U+D38x
U+D39x
U+D3Ax
U+D3Bx
U+D3Cx
U+D3Dx
U+D3Ex
U+D3Fx
U+D40x
U+D41x
U+D42x
U+D43x
U+D44x
U+D45x
U+D46x
U+D47x
U+D48x
U+D49x
U+D4Ax
U+D4Bx
U+D4Cx
U+D4Dx
U+D4Ex
U+D4Fx
U+D50x
U+D51x
U+D52x
U+D53x
U+D54x
U+D55x
U+D56x
U+D57x
U+D58x
U+D59x
U+D5Ax
U+D5Bx
U+D5Cx
U+D5Dx
U+D5Ex
U+D5Fx
U+D60x
U+D61x
U+D62x
U+D63x
U+D64x
U+D65x
U+D66x
U+D67x
U+D68x
U+D69x
U+D6Ax
U+D6Bx
U+D6Cx
U+D6Dx
U+D6Ex
U+D6Fx
U+D70x
U+D71x
U+D72x
U+D73x
U+D74x
U+D75x
U+D76x
U+D77x
U+D78x
U+D79x
U+D7Ax
註釋
1.^ 依據 Unicode 14.0
2.^ 灰色區域表示未分配的碼位

歷史 编辑

碼位的移動使在Unicode中編碼諺文音節一事變得複雜了起來:

  • 版本1.0.0時,在U+3400..U+3D2D編碼了2350個來自KS C 5601-1987的現代諺文音節。而碼位U+3400..U+3D2D現在是中日韓統一表意文字擴展區A的一部分。
  • 版本1.1時,在U+3D2E..U+44B7編碼了額外1930個來自KS C 5657-1991的現代諺文音節,在U+44B8..U+44BD編碼了6個來自GB 12052-89的現代諺文音節,在U+44BE..U+4DFF編碼了2370個不在以上標準的的現代諺文音節。這些字符的碼位範圍涵蓋了中日韓統一表意文字擴展區A的剩餘部分及易經六爻符號的全部。
    • 另外,版本1.1尚有3處錯誤:[3]
      • U+384E:삤在Unicode字符數據庫中,但삣在ISO/IEC 10646-1:1993碼表和源標準的映射中。
      • U+40BC:삣在Unicode字符數據庫中,但삤在ISO/IEC 10646-1:1993碼表和源標準的映射中。
      • U+436C:콫在Unicode字符數據庫中,但콪在ISO/IEC 10646-1:1993碼表和源標準的映射中。
  • 版本2.0增加了KS C 5601-1992中剩餘的4516個可能的音節,並將所有已編碼的音節重新排列[4][5]到目前的碼位範圍U+AC00..U+D7AF內,這樣就可以通過算法將諺文音節分解為單個的諺文字母。

RFC 2279解釋說,這一重大的不兼容變化是在假設不存在使用Unicode的諺文數據或軟件的情況下做出的:

“允許這種不兼容變化的官方理由是,不存在包含諺文的應用和數據,這種說法很可能是真的,但仍然無法證實。 這一事件被稱爲‘Korean mess’,相關委員會已承諾永遠不會再做出這種不兼容的變化。” — RFC 2279
"The official justification for allowing such an incompatible change was that no implementations and no data containing Hangul existed, a statement that is likely to be true but remains unprovable. The incident has been dubbed the 'Korean mess', and the relevant committees have pledged to never, ever again make such an incompatible change." — RFC 2279

隨後,Unicode採用了一項編碼穩定性政策,即“一旦一个字符被編碼,它將不會被移動或刪除”。[6]

以下Unicode文檔記錄了定義本區塊中特定字符的目的與過程:

版本 最終碼位[a] 碼位数 UTC ID L2英语International Committee for Information Technology Standards ID WG2英语ISO/IEC JTC 1/SC 2 ID 文檔
2.0 U+AC00..D7A3 11,172 N767页面存档备份,存于互联网档案馆 Ksar, Mike, Unconfirmed minutes WG2-Paris meeting of October 1991, 1991-11-25 
X3L2/93-078 N848 Modified Korean Position, 1992-07-02 
UTC/1994-xxx Discussion on Korean Hangul Proposal, Unicode Technical Committee Meeting #62: Discussion of Korean Hangul Proposal, 1994-09-30 
X3L2/95-031 N1158 Korean National Position for adding Hangul characters, 1995-03-08 
N1170 Canadian Position on Korean Proposal in N 1158 for adding Hangul characters, 1995-03-10 
UTC/1995-021B Aliprand, Joan, Closed Caucus Minutes, UTC #64, 1995-03-10 
N1198 Working Draft for a pDAM to 10646 on Korean Hangul, 1995-04-05 
X3L2/95-053.1 N1199 Background on Korean Coding, 1995-04-30 
N1203页面存档备份,存于互联网档案馆 Umamaheswaran, V. S.; Ksar, Mike, 6.4.1, Unconfirmed minutes of SC2/WG2 Meeting 27, Geneva, 1995-05-03 
X3L2/95-053 N1209 PDAM no. 5 to ISO/IEC 10646-1: Hangul Character Collections, 1995-05-09 
UTC/1995-xxx Discussion on Korean, Unicode Technical Committee Meeting #65, Minutes, 1995-06-02 
X3L2/95-090 N1253 (doc, txt页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S.; Ksar, Mike, 6.3, Unconfirmed Minutes of WG 2 Meeting # 28 in Helsinki, Finland; 1995-06-26--27, 1995-09-09 
N1285 Hangul Syllable Character Name Generation Algorithm, 1995-11-08 
N1303 (html, doc页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S.; Ksar, Mike, 7. Disposition of comments on pDAM-5 - Korean Hangul, Minutes of Meeting 29, Tokyo, 1996-01-26 
N1331 Paterson, Bruce, DAM 5 (Korean Hangul) Submittal to JTC1, 1996-03-14 
N1332 Paterson, Bruce, BMP Revised Layout (DAM 5 diagram attachment), 1996-03-14 
N1391 Paterson, Bruce, Hangul syllable name algorithm, simplified, 1996-05-18 
N1353页面存档备份,存于互联网档案馆 Umamaheswaran, V. S.; Ksar, Mike, 6.1, Draft minutes of WG2 Copenhagen Meeting # 30, 1996-06-25 
N1537 Table of Replies and Feedback on Amendment 5 – Hangul, 1997-01-29 
L2/97-125 N1561 Paterson, Bruce, Draft Report on JTC1 letter ballot on DAM No. 5 to ISO/IEC 10646-1 (Hangul), 1997-05-27 
N1570 Paterson, Bruce, Almost Final Text (pages 2-5 and 182 only) of DAM 5 – Hangul, 1997-06-23 
L2/97-288 N1603页面存档备份,存于互联网档案馆 Umamaheswaran, V. S., 5.3.1, Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997, 1997-10-24 
N1806 (pdf, doc页面存档备份,存于互联网档案馆)) Kim, Kyongsok; Paterson, Bruce, Defect Report on AMD 5 - Hangul Syllables with Editor's response, 1998-07-08 
L2/99-022 N1942页面存档备份,存于互联网档案馆 Paterson, Bruce, Hangul syllable name rules, proposed for ISO/IEC 10646 2nd Edition, 1998-12-08 
L2/99-010 N1903 (pdf, html页面存档备份,存于互联网档案馆), doc页面存档备份,存于互联网档案馆)) Umamaheswaran, V. S., 11.1, Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25, 1998-12-30 
L2/99-114 N2018页面存档备份,存于互联网档案馆 Paterson, Bruce, Draft Technical Technical Corrigendum No. 3 to ISO/IEC 10646-1: 1993, 1999-03-31 
L2/99-232 N2003页面存档备份,存于互联网档案馆 Umamaheswaran, V. S., 7.1.2 Hangul syllable name rules, Minutes of WG 2 meeting 36, Fukuoka, Japan, 1999-03-09--15, 1999-08-03 
L2/99-297 N2119页面存档备份,存于互联网档案馆 Disposition of Comments Report on SC 2 N 3306, Draft Technical Corrigendum No. 3 to ISO/IEC 10646-1: 1993, 1999-09-20 
L2/99-298 N2120页面存档备份,存于互联网档案馆 Paterson, Bruce, Final Text for Technical Corrigendum No. 3 to ISO/IEC 10646-1: 1993, 1999-09-21 
L2/03-100 Edberg, Peter, Hangul Mapping Errors, 2002-11-05 
L2/02-463 N2564页面存档备份,存于互联网档案馆 Kim, Kyongsok, 3-way cross-reference tables - KS X 1001, KPS 9566, and UCS, 2002-11-30 
L2/04-392 West, Andrew, Unicode 1 Hangul Mapping Errors, 2004-10-21 
L2/04-361 Moore, Lisa, Properties - Unicode 1 Hangul Mapping Errors, UTC #101 Minutes, 2004-11-23 
L2/17-080 Chung, Jaemin, Informative document about three pre-Unicode-2.0 modern hangul syllables, 2017-03-29 
  1. ^ 建議的碼位和字符名稱可能與最終的結果不同。

另見 编辑

參考資料 编辑

  1. ^ Unicode character database. The Unicode Standard. [2016-07-09]. (原始内容存档于2017-09-25). 
  2. ^ Enumerated Versions of The Unicode Standard. The Unicode Standard. [2016-07-09]. (原始内容存档于2016-06-29). 
  3. ^ Chung, Jaemin. Informative document about three pre-Unicode-2.0 modern hangul syllables (PDF). 2017-03-29 [2022-05-31]. (原始内容 (PDF)存档于2022-04-20). 
  4. ^ Chang, K. D.; Choi, In Sook; Kim, Jung Ho. Korean Hangul Encoding Conversion Table. 1995-10-04 [2022-05-31]. (原始内容存档于2022-04-20). 
  5. ^ Notes and corrections for HANGUL.TXT. 2005-10-13 [2022-05-31]. (原始内容存档于2021-10-06). 
  6. ^ Unicode Character Encoding Stability Policies. Unicode Consortium. 2016-11-14 [2022-05-31]. (原始内容存档于2022-07-14).