Template:ISO 15924 script codes and related Unicode data
ISO 15924 | Script in Unicode[e] | ||||||
---|---|---|---|---|---|---|---|
Code | ISO formal name | Directionality | Unicode Alias[f] | Version | Characters | Notes | Description |
| |||||||
Adlm | Adlam | Adlam | 9.0 | 88 | Ch 19.9 | ||
Afak | Afaka | varies | [i] | — Not in Unicode, proposal is explored||||
Aghb | Caucasian Albanian | Caucasian Albanian | 7.0 | 53 | Ancient/historic | Ch 8.11 | |
Ahom | Ahom, Tai Ahom | Ahom | 8.0 | 65 | Ancient/historic | Ch 15.15 | |
Arab | Arabic | Arabic | 1.0 | 1,365 | Ch 9.2 | ||
Aran | Arabic (Nastaliq variant) | mixed | § Arab) | — Typographic variant of Arabic (||||
Armi | Imperial Aramaic | Imperial Aramaic | 5.2 | 31 | Ancient/historic | Ch 10.4 | |
Armn | Armenian | Armenian | 1.0 | 96 | Ch 7.6 | ||
Avst | Avestan | Avestan | 5.2 | 61 | Ancient/historic | Ch 10.7 | |
Bali | Balinese | Balinese | 5.0 | 124 | Ch 17.3 | ||
Bamu | Bamum | Bamum | 5.2 | 657 | Ch 19.6 | ||
Bass | Bassa Vah | Bassa Vah | 7.0 | 36 | Ancient/historic | Ch 19.7 | |
Batk | Batak | Batak | 6.0 | 56 | Ch 17.6 | ||
Beng | Bengali (Bangla) | Bengali | 1.0 | 96 | Ch 12.2 | ||
Bhks | Bhaiksuki | Bhaiksuki | 9.0 | 97 | Ancient/historic | Ch 14.3 | |
Blis | Blissymbols | varies | [i] | — Not in Unicode, proposal is explored||||
Bopo | Bopomofo | Bopomofo | 1.0 | 77 | Ch 18.3 | ||
Brah | Brahmi | Brahmi | 6.0 | 115 | Ancient/historic | Ch 14.1 | |
Brai | Braille | Braille | 3.0 | 256 | Ch 21.1 | ||
Bugi | Buginese | Buginese | 4.1 | 30 | Ch 17.2 | ||
Buhd | Buhid | Buhid | 3.2 | 20 | Ch 17.1 | ||
Cakm | Chakma | Chakma | 6.1 | 71 | Ch 13.11 | ||
Cans | Unified Canadian Aboriginal Syllabics | Canadian Aboriginal | 3.0 | 726 | Ch 20.2 | ||
Cari | Carian | Carian | 5.1 | 49 | Ancient/historic | Ch 8.5 | |
Cham | Cham | Cham | 5.1 | 83 | Ch 16.10 | ||
Cher | Cherokee | Cherokee | 3.0 | 172 | Ch 20.1 | ||
Chrs | Chorasmian | Chorasmian | 13.0 | 28 | Ancient/historic | Ch 10.8 | |
Cirt | Cirth | varies | — Not in Unicode | ||||
Copt | Coptic | Coptic | 1.0 | 137 | Ancient/historic, Disunified from Greek in 4.1 | Ch 7.3 | |
Cpmn | Cypro-Minoan | left-to-right | Cypro Minoan | 14.0 | 99 | Ancient/historic | Ch 8.4 |
Cprt | Cypriot syllabary | Cypriot | 4.0 | 55 | Ancient/historic | Ch 8.3 | |
Cyrl | Cyrillic | Cyrillic | 1.0 | 443 | Includes typographic variant Old Church Slavonic (§ Cyrs) | Ch 7.4 | |
Cyrs | Cyrillic (Old Church Slavonic variant) | varies | § Cyrl) | — Typographic variant of Cyrillic (Ancient/historic | |||
Deva | Devanagari (Nagari) | Devanagari | 1.0 | 154 | Ch 12.1 | ||
Diak | Dives Akuru | Dives Akuru | 13.0 | 72 | Ancient/historic | Ch 15.14 | |
Dogr | Dogra | Dogra | 11.0 | 60 | Ancient/historic | Ch 15.17 | |
Dsrt | Deseret (Mormon) | Deseret | 3.1 | 80 | Ch 20.4 | ||
Dupl | Duployan shorthand, Duployan stenography | Duployan | 7.0 | 143 | Ch 21.6 | ||
Egyd | Egyptian demotic | mixed | — Not in Unicode | ||||
Egyh | Egyptian hieratic | mixed | — Not in Unicode | ||||
Egyp | Egyptian hieroglyphs | Egyptian Hieroglyphs | 5.2 | 1,080 | Ancient/historic | Ch 11.4 | |
Elba | Elbasan | Elbasan | 7.0 | 40 | Ancient/historic | Ch 8.10 | |
Elym | Elymaic | Elymaic | 12.0 | 23 | Ancient/historic | Ch 10.9 | |
Ethi | Ethiopic (Geʻez) | Ethiopic | 3.0 | 523 | Ch 19.1 | ||
Geok | Khutsuri (Asomtavruli and Nuskhuri) | Georgian | Unicode groups "Khutsori", "Asomtavruli" and "Nuskhuri" into 'Georgian' (§ Geok). Also "Mkhedruli" and "Mtavruli" are 'Georgian' (§ Geor) | Ch 7.7 | |||
Geor | Georgian (Mkhedruli and Mtavruli) | Georgian | 1.0 | 173 | In Unicode, also includes Geok (Nuskhuri) | Ch 7.7 | |
Glag | Glagolitic | Glagolitic | 4.1 | 134 | Ancient/historic | Ch 7.5 | |
Gong | Gunjala Gondi | Gunjala Gondi | 11.0 | 63 | Ch 13.15 | ||
Gonm | Masaram Gondi | Masaram Gondi | 10.0 | 75 | Ch 13.14 | ||
Goth | Gothic | Gothic | 3.1 | 27 | Ancient/historic | Ch 8.9 | |
Gran | Grantha | Grantha | 7.0 | 85 | Ancient/historic | Ch 15.13 | |
Grek | Greek | Greek | 1.0 | 518 | Directionality sometimes as boustrophedon | Ch 7.2 | |
Gujr | Gujarati | Gujarati | 1.0 | 91 | Ch 12.4 | ||
Guru | Gurmukhi | Gurmukhi | 1.0 | 80 | Ch 12.3 | ||
Hanb | Han with Bopomofo (alias for Han + Bopomofo) | mixed | § Hani, § Bopo | — See||||
Hang | Hangul (Hangŭl, Hangeul) | Hangul | 1.0 | 11,739 | Hangul syllables relocated in 2.0 | Ch 18.6 | |
Hani | Han (Hanzi, Kanji, Hanja) | top-to-bottom, columns right-to-left (historically) | Han | 1.0 | 94,215 | Ch 18.1 | |
Hano | Hanunoo (Hanunóo) | Hanunoo | 3.2 | 21 | Ch 17.1 | ||
Hans | Han (Simplified variant) | varies | § Hani) | — Subset of Han (Hanzi, Kanji, Hanja) (||||
Hant | Han (Traditional variant) | varies | § Hani | — Subset of||||
Hatr | Hatran | Hatran | 8.0 | 26 | Ancient/historic | Ch 10.12 | |
Hebr | Hebrew | Hebrew | 1.0 | 134 | Ch 9.1 | ||
Hira | Hiragana | Hiragana | 1.0 | 380 | Ch 18.4 | ||
Hluw | Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs) | Anatolian Hieroglyphs | 8.0 | 583 | Ancient/historic | Ch 11.6 | |
Hmng | Pahawh Hmong | Pahawh Hmong | 7.0 | 127 | Ch 16.11 | ||
Hmnp | Nyiakeng Puachue Hmong | Nyiakeng Puachue Hmong | 12.0 | 71 | Ch 16.12 | ||
Hrkt | Japanese syllabaries (alias for Hiragana + Katakana) | Katakana or Hiragana | See § Hira, § Kana | Ch 18.4 | |||
Hung | Old Hungarian (Hungarian Runic) | Old Hungarian | 8.0 | 108 | Ancient/historic | Ch 8.8 | |
Inds | Indus (Harappan) | mixed | [i] | — Not in Unicode, proposal is explored||||
Ital | Old Italic (Etruscan, Oscan, etc.) | Old Italic | 3.1 | 39 | Ancient/historic | Ch 8.6 | |
Jamo | Jamo (alias for Jamo subset of Hangul) | varies | § Hang | — Subset of||||
Java | Javanese | Javanese | 5.2 | 90 | Ch 17.4 | ||
Jpan | Japanese (alias for Han + Hiragana + Katakana) | varies | § Hani, § Hira and § Kana | — See||||
Jurc | Jurchen | left-to-right | — Not in Unicode | ||||
Kali | Kayah Li | Kayah Li | 5.1 | 47 | Ch 16.9 | ||
Kana | Katakana | Katakana | 1.0 | 320 | Ch 18.4 | ||
Kawi | Kawi | left-to-right | — Not in Unicode | ||||
Khar | Kharoshthi | Kharoshthi | 4.1 | 68 | Ancient/historic | Ch 14.2 | |
Khmr | Khmer | Khmer | 3.0 | 146 | Ch 16.4 | ||
Khoj | Khojki | Khojki | 7.0 | 62 | Ancient/historic | Ch 15.7 | |
Kitl | Khitan large script | left-to-right | — Not in Unicode | ||||
Kits | Khitan small script | Khitan Small Script | 13.0 | 471 | Ancient/historic | Ch 18.12 | |
Knda | Kannada | Kannada | 1.0 | 90 | Ch 12.8 | ||
Kore | Korean (alias for Hangul + Han) | left-to-right | § Hani, § Hang | — See||||
Kpel | Kpelle | left-to-right | [i] | — Not in Unicode, proposal is explored||||
Kthi | Kaithi | Kaithi | 5.2 | 68 | Ancient/historic | Ch 15.2 | |
Lana | Tai Tham (Lanna) | Tai Tham | 5.2 | 127 | Ch 16.7 | ||
Laoo | Lao | Lao | 1.0 | 82 | Ch 16.2 | ||
Latf | Latin (Fraktur variant) | varies | § Latn) | — Typographic variant of Latin (||||
Latg | Latin (Gaelic variant) | left-to-right | § Latn) | — Typographic variant of Latin (||||
Latn | Latin | Latin | 1.0 | 1,475 | See also: Latin script in Unicode | Ch 7.1 | |
Leke | Leke | left-to-right | — Not in Unicode | ||||
Lepc | Lepcha (Róng) | Lepcha | 5.1 | 74 | Ch 13.12 | ||
Limb | Limbu | Limbu | 4.0 | 68 | Ch 13.6 | ||
Lina | Linear A | Linear A | 7.0 | 341 | Ancient/historic | Ch 8.1 | |
Linb | Linear B | Linear B | 4.0 | 211 | Ancient/historic | Ch 8.2 | |
Lisu | Lisu (Fraser) | Lisu | 5.2 | 49 | Ch 18.9 | ||
Loma | Loma | left-to-right | [i] | — Not in Unicode, proposal is explored||||
Lyci | Lycian | Lycian | 5.1 | 29 | Ancient/historic | Ch 8.5 | |
Lydi | Lydian | Lydian | 5.1 | 27 | Ancient/historic | Ch 8.5 | |
Mahj | Mahajani | Mahajani | 7.0 | 39 | Ancient/historic | Ch 15.6 | |
Maka | Makasar | Makasar | 11.0 | 25 | Ancient/historic | Ch 17.8 | |
Mand | Mandaic, Mandaean | Mandaic | 6.0 | 29 | Ch 9.5 | ||
Mani | Manichaean | Manichaean | 7.0 | 51 | Ancient/historic | Ch 10.5 | |
Marc | Marchen | Marchen | 9.0 | 68 | Ancient/historic | Ch 14.5 | |
Maya | Mayan hieroglyphs | mixed | — Not in Unicode | ||||
Medf | Medefaidrin (Oberi Okaime, Oberi Ɔkaimɛ) | Medefaidrin | 11.0 | 91 | Ch 19.10 | ||
Mend | Mende Kikakui | Mende Kikakui | 7.0 | 213 | Ch 19.8 | ||
Merc | Meroitic Cursive | Meroitic Cursive | 6.1 | 90 | Ancient/historic | Ch 11.5 | |
Mero | Meroitic Hieroglyphs | Meroitic Hieroglyphs | 6.1 | 32 | Ancient/historic | Ch 11.5 | |
Mlym | Malayalam | Malayalam | 1.0 | 118 | Ch 12.9 | ||
Modi | Modi, Moḍī | Modi | 7.0 | 79 | Ancient/historic | Ch 15.11 | |
Mong | Mongolian | Mongolian | 3.0 | 168 | Mong includes Clear and Manchu scripts | Ch 13.5 | |
Moon | Moon (Moon code, Moon script, Moon type) | mixed | [i] | — Not in Unicode, proposal is explored||||
Mroo | Mro, Mru | Mro | 7.0 | 43 | Ch 13.8 | ||
Mtei | Meitei Mayek (Meithei, Meetei) | Meetei Mayek | 5.2 | 79 | Ch 13.7 | ||
Mult | Multani | Multani | 8.0 | 38 | Ancient/historic | Ch 15.9 | |
Mymr | Myanmar (Burmese) | Myanmar | 3.0 | 223 | Ch 16.3 | ||
Nagm | Nag Mundari | left-to-right | — Not in Unicode | ||||
Nand | Nandinagari | Nandinagari | 12.0 | 65 | Ancient/historic | Ch 15.12 | |
Narb | Old North Arabian (Ancient North Arabian) | Old North Arabian | 7.0 | 32 | Ancient/historic | Ch 10.1 | |
Nbat | Nabataean | Nabataean | 7.0 | 40 | Ancient/historic | Ch 10.10 | |
Newa | Newa, Newar, Newari, Nepāla lipi | Newa | 9.0 | 97 | Ch 13.3 | ||
Nkdb | Naxi Dongba (na²¹ɕi³³ to³³ba²¹, Nakhi Tomba) | left-to-right | — Not in Unicode | ||||
Nkgb | Nakhi Geba (na²¹ɕi³³ gʌ²¹ba²¹, 'Na-'Khi ²Ggŏ-¹baw, Nakhi Geba) | left-to-right | [i] | — Not in Unicode, proposal is explored||||
Nkoo | N’Ko | NKo | 5.0 | 62 | Ch 19.4 | ||
Nshu | Nüshu | Nushu | 10.0 | 397 | Ch 18.8 | ||
Ogam | Ogham | Ogham | 3.0 | 29 | Ancient/historic | Ch 8.14 | |
Olck | Ol Chiki (Ol Cemet’, Ol, Santali) | Ol Chiki | 5.1 | 48 | Ch 13.10 | ||
Orkh | Old Turkic, Orkhon Runic | Old Turkic | 5.2 | 73 | Ancient/historic | Ch 14.8 | |
Orya | Oriya (Odia) | Oriya | 1.0 | 91 | Ch 12.5 | ||
Osge | Osage | Osage | 9.0 | 72 | Ch 20.3 | ||
Osma | Osmanya | Osmanya | 4.0 | 40 | Ch 19.2 | ||
Ougr | Old Uyghur | mixed | Old Uyghur | 14.0 | 26 | Ancient/historic | Ch 14.11 |
Palm | Palmyrene | Palmyrene | 7.0 | 32 | Ancient/historic | Ch 10.11 | |
Pauc | Pau Cin Hau | Pau Cin Hau | 7.0 | 57 | Ch 16.13 | ||
Pcun | Proto-Cuneiform | left-to-right | — Not in Unicode | ||||
Pelm | Proto-Elamite | left-to-right | — Not in Unicode | ||||
Perm | Old Permic | Old Permic | 7.0 | 43 | Ancient/historic | Ch 8.13 | |
Phag | Phags-pa | Phags-pa | 5.0 | 56 | Ancient/historic | Ch 14.4 | |
Phli | Inscriptional Pahlavi | Inscriptional Pahlavi | 5.2 | 27 | Ancient/historic | Ch 10.6 | |
Phlp | Psalter Pahlavi | Psalter Pahlavi | 7.0 | 29 | Ancient/historic | Ch 10.6 | |
Phlv | Book Pahlavi | mixed | — Not in Unicode | ||||
Phnx | Phoenician | Phoenician | 5.0 | 29 | Ancient/historic[g] | Ch 10.3 | |
Piqd | Klingon (KLI pIqaD) | [ii][iii] | — Rejected for inclusion in Unicode|||||
Plrd | Miao (Pollard) | Miao | 6.1 | 149 | Ch 18.10 | ||
Prti | Inscriptional Parthian | Inscriptional Parthian | 5.2 | 30 | Ancient/historic | Ch 10.6 | |
Psin | Proto-Sinaitic | mixed | — Not in Unicode | ||||
Qaaa-Qabx | Reserved for private use (range) | — Not in Unicode | |||||
Ranj | Ranjana | left-to-right | — Not in Unicode | ||||
Rjng | Rejang (Redjang, Kaganga) | Rejang | 5.1 | 37 | Ch 17.5 | ||
Rohg | Hanifi Rohingya | Hanifi Rohingya | 11.0 | 50 | Ch 16.14 | ||
Roro | Rongorongo | mixed | [i] | — Not in Unicode, proposal is explored||||
Runr | Runic | Runic | 3.0 | 86 | Ancient/historic | Ch 8.7 | |
Samr | Samaritan | Samaritan | 5.2 | 61 | Ch 9.4 | ||
Sara | Sarati | mixed | — Not in Unicode | ||||
Sarb | Old South Arabian | Old South Arabian | 5.2 | 32 | Ancient/historic | Ch 10.2 | |
Saur | Saurashtra | Saurashtra | 5.1 | 82 | Ch 13.13 | ||
Sgnw | SignWriting | SignWriting | 8.0 | 672 | Ch 21.7 | ||
Shaw | Shavian (Shaw) | Shavian | 4.0 | 48 | Ch 8.15 | ||
Shrd | Sharada, Śāradā | Sharada | 6.1 | 96 | Ch 15.3 | ||
Shui | Shuishu | left-to-right | — Not in Unicode | ||||
Sidd | Siddham, Siddhaṃ, Siddhamātṛkā | Siddham | 7.0 | 92 | Ancient/historic | Ch 15.5 | |
Sind | Khudawadi, Sindhi | Khudawadi | 7.0 | 69 | Ch 15.8 | ||
Sinh | Sinhala | Sinhala | 3.0 | 111 | Ch 13.2 | ||
Sogd | Sogdian | Sogdian | 11.0 | 42 | Ancient/historic | Ch 14.10 | |
Sogo | Old Sogdian | Old Sogdian | 11.0 | 40 | Ancient/historic | Ch 14.9 | |
Sora | Sora Sompeng | Sora Sompeng | 6.1 | 35 | Ch 15.16 | ||
Soyo | Soyombo | Soyombo | 10.0 | 83 | Ancient/historic | Ch 14.7 | |
Sund | Sundanese | Sundanese | 5.1 | 72 | Ch 17.7 | ||
Sunu | Sunuwar | left-to-right | — Not in Unicode | ||||
Sylo | Syloti Nagri | Syloti Nagri | 4.1 | 45 | Ancient/historic | Ch 15.1 | |
Syrc | Syriac | Syriac | 3.0 | 88 | Includes typographic variants Estrangelo (§ Syre), Western (§ Syrj), and Eastern (§ Syrn) | Ch 9.3 | |
Syre | Syriac (Estrangelo variant) | mixed | § Syrc) | — Typographic variant of Syriac (||||
Syrj | Syriac (Western variant) | mixed | § Syrc) | — Typographic variant of Syriac (||||
Syrn | Syriac (Eastern variant) | mixed | § Syrc) | — Typographic variant of Syriac (||||
Tagb | Tagbanwa | Tagbanwa | 3.2 | 18 | Ch 17.1 | ||
Takr | Takri, Ṭākrī, Ṭāṅkrī | Takri | 6.1 | 68 | Ch 15.4 | ||
Tale | Tai Le | Tai Le | 4.0 | 35 | Ch 16.5 | ||
Talu | New Tai Lue | New Tai Lue | 4.1 | 83 | Ch 16.6 | ||
Taml | Tamil | Tamil | 1.0 | 123 | Ch 12.6 | ||
Tang | Tangut | Tangut | 9.0 | 6,914 | Ancient/historic | Ch 18.11 | |
Tavt | Tai Viet | Tai Viet | 5.2 | 72 | Ch 16.8 | ||
Telu | Telugu | Telugu | 1.0 | 100 | Ch 12.7 | ||
Teng | Tengwar | left-to-right | — Not in Unicode | ||||
Tfng | Tifinagh (Berber) | Tifinagh | 4.1 | 59 | Ch 19.3 | ||
Tglg | Tagalog (Baybayin, Alibata) | Tagalog | 3.2 | 23 | Ch 17.1 | ||
Thaa | Thaana | Thaana | 3.0 | 50 | Ch 13.1 | ||
Thai | Thai | Thai | 1.0 | 86 | Ch 16.1 | ||
Tibt | Tibetan | Tibetan | 2.0 | 207 | Added in 1.0, removed in 1.1 and reintroduced in 2.0 | Ch 13.4 | |
Tirh | Tirhuta | Tirhuta | 7.0 | 82 | Ch 15.10 | ||
Tnsa | Tangsa | left-to-right | Tangsa | 14.0 | 89 | Ch 13.18 | |
Toto | Toto | left-to-right | Toto | 14.0 | 31 | Ch 13.17 | |
Ugar | Ugaritic | Ugaritic | 4.0 | 31 | Ancient/historic | Ch 11.2 | |
Vaii | Vai | Vai | 5.1 | 300 | Ch 19.5 | ||
Visp | Visible Speech | left-to-right | — Not in Unicode | ||||
Vith | Vithkuqi | left-to-right | Vithkuqi | 14.0 | 70 | Ancient/historic | Ch 8.12 |
Wara | Warang Citi (Varang Kshiti) | Warang Citi | 7.0 | 84 | Ch 13.9 | ||
Wcho | Wancho | Wancho | 12.0 | 59 | Ch 13.16 | ||
Wole | Woleai | mixed | [i] | — Not in Unicode, proposal is explored||||
Xpeo | Old Persian | Old Persian | 4.1 | 50 | Ancient/historic | Ch 11.3 | |
Xsux | Cuneiform, Sumero-Akkadian | Cuneiform | 5.0 | 1,234 | Ancient/historic | Ch 11.1 | |
Yezi | Yezidi | Yezidi | 13.0 | 47 | Ancient/historic | Ch 9.6 | |
Yiii | Yi | Yi | 3.0 | 1,220 | Ch 18.7 | ||
Zanb | Zanabazar Square (Zanabazarin Dörböljin Useg, Xewtee Dörböljin Bicig, Horizontal Square Script) | Zanabazar Square | 10.0 | 72 | Ancient/historic | Ch 14.6 | |
Zinh | Code for inherited script | Inherited | 657 | ||||
Zmth | Mathematical notation | — Not a 'script' in Unicode | |||||
Zsym | Symbols | — Not a 'script' in Unicode | |||||
Zsye | Symbols (emoji variant) | — Not a 'script' in Unicode | |||||
Zxxx | Code for unwritten documents | — Not a 'script' in Unicode | |||||
Zyyy | Code for undetermined script | Common | 8,252 | ||||
Zzzz | Code for uncoded script | Unknown | 969,350 | In Unicode: All other code points | |||
Notes
| |||||||
References
|
This documentation is shared between templates {{Unicode blocks}} and {{ISO 15924 script codes and related Unicode data}}.
Usage
The template can be used as usual. It is not a navigation box, so it can be everywhere in an article. The notes are contained within the template, and will not appear in the main References part.
- Note: when resolving red links or wrong links, edit
{{ISO 15924/wp-article}}
. That is where the connection between ISO code and a Wikipedia article is made.
ISO 15924 templates
Item | In template | /subs | Content | Example | Usage in page | Publisher | Note | ||||
---|---|---|---|---|---|---|---|---|---|---|---|
Code (ISO) | {{ISO 15924 code}} | /subpages | ID | Arab | Everywhere | ISO 15924 | enwiki central ISO script id list | ||||
Alias (Unicode) | {{ISO 15924 alias}} | /subpages | ID | Arabic | Unicode | ||||||
Article (enwiki) | {{ISO 15924/wp-article}} | /subpages | ID | [[Arabic script]] | enwiki | ||||||
QID (wikidata) | {{ISO 15924/qid}} | /subpages | ID | Q790681 | Wikidata | ||||||
Number | {{ISO 15924 number}} | /subpages | (ID) | 234 | ISO 15924 | ISO number not used internally (between templates), see Code | |||||
Name | {{ISO 15924 name}} | /subpages | data | Deseret (Mormon) | ISO 15924 | ||||||
Unicode chapter | {{ISO 15924/unicode-chapter}} | /subpages | data | Ch 18.1 | Unicode | Operational | |||||
Script example
character |
{{ISO 15924/script-example-character}} | /subpages | data | ع | User boxes | enwiki | |||||
In Mainspace | |||||||||||
Overview
|
{{ISO 15924 script codes and related Unicode data}} | /subpages | list | ISO 15924 | enwiki | Mainspace: ISO 15924, Script (Unicode), Unicode character property | |||||
Blocks ⇄ Scripts | {{Unicode blocks}} | /subpages | list | some script articles | enwiki | Mainspace, related | |||||
graphs | {{ISO 15924/unicode-script-illustration}} | /subpages | fonts&files | Mainspace, Scripts in Unicode | |||||||
Overviews | |||||||||||
4 ID's:
|
{{ISO 15924/overview-4id}} | /subpages | 4 ID's, related | check consistency | Wikidata | ||||||
Wikidata props | {{ISO 15924/overview-4id/properties}} | /subpages | data | dir | doc, more WD props | Wikidata | |||||
description | {{ISO 15924/overview-4id-relations}} | /subpages | overview | enwiki | 4 ID's: ISO-U+-WD-enwiki | ||||||
Overview: templates | {{ISO 15924/overview-templates}} | /subpages | list | Wikipedia | |||||||
WP-category | {{ISO 15924/wp-category}} | /subpages | data | Category:Arabic script | Not checked for mainspace | enwiki | watered down concept for minor scripts | ||||
Also (doc, userbox, technical, ...) | |||||||||||
Documentation | {{ISO 15924 templates/doc}} | /subpages | prime documentation | Latin script in Unicode (~) | Reused in multiple templates | ||||||
Unicode Latn links | {{ISO 15924/overview-Latn in Unicode}} | /subpages | Unicode | Full overview (aim) | |||||||
Redirect | {{R from ISO 15924 code}} | /subpages | template | Redirects | enwiki | ||||||
userbox | {{User iso15924}} | /subpages | Userboxes | ||||||||
Wikidata properties | |||||||||||
Directionality | (P1406) | P1406 | {{Infobox}}, ... | ||||||||
Unicode ranges | (P5949) | P5949 | {{Infobox}}, ... | ||||||||
ISO English name | (P2561) | P2561 | Crosscheck only | ||||||||
|
Background: How is this table composed
Note that a script is not a language. A single script, like the Latin alphabet, is used in many languages. Unicode is only about scripts, not about languages that use that script. Still there may be nuances, like the English versus Polish language in using accents on letters.
Step 1: ISO defines a script
ISO defines and publishes a script in the ISO 15924 list. It defines the Alpha-4 code (Aaaa-Zzzz), the Numeric code (000-999), and the formal Name for each accepted script. Currently there are some 160 scripts defined in this list. Included are scripts like "Mathematical notation (Zmth)" and "Code for undetermined script (a.k.a. Common, Zyyy)". The list is formally maintained and published by ISO, and practically by the Unicode Consortium office. It is published on the Unicode website. Technically, the list is file iso15924.txt
.
Step 2: Unicode attaches an Alias name
Then, Unicode (not ISO) maintains a list of Alias script names right next to the ISO-defined scripts, for each script Unicode has encoded. The Alias name is an English name for that script.
So the ISO alpha-4 code gets a unique Alias name by Unicode:
Mymr:ISO Name=Myanmar (Burmese), Alias=Myanmar
.
These Alias names are also present in the definition file iso15924.txt
.[1]
Step 3: Usage by Unicode
From that list, Unicode can translate any alpha4-code into the Alias name of the script, and reverse. Unicode does not use the formal ISO name.
A script name is used in the Unicode Name of a character: "U+05BF ֿ HEBREW POINT RAFE".
Per character
In the Unicode database, Unicode adds one single appropriate alpha-4 code to every individual script character. So every letter, punctuation, number and so of a script get that code. Characters used by multiple scripts, such as the period (.), have script code "Zyyy" (Common). The "script" codes for Mathematical and Symbol are not used by Unicode; symbols and mathematical characters have the property script="Unknown".
Then, in the file Scripts.txt
, Unicode publishes the Alias script name per character (possibly by a range of characters). A part of that file looks like:
... 0591..05BD ; Hebrew # Mn [45] HEBREW ACCENT ETNAHTA..HEBREW POINT METEG 05BE ; Hebrew # Pd HEBREW PUNCTUATION MAQAF 05BF ; Hebrew # Mn HEBREW POINT RAFE 05C0 ; Hebrew # Po HEBREW PUNCTUATION PASEQ 05C1..05C2 ; Hebrew # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT 05C3 ; Hebrew # Po HEBREW PUNCTUATION SOF PASUQ ...
This datafile defines which scripts are present in Unicode, and what script is at a certain code point.
In a block
Given a block range of code points, then which scripts are present in that block? See {{Unicode blocks}}: this table is constructed by signaling every script that is present as a block (once).
There is no secure relation between a script name and a block name. Some scripts are in a single block, but other scripts are spread amongst several blocks.
See also
- {{ISO 15924 script codes and related Unicode data/header}} (technical subtemplate)
- {{ISO 15924 script codes and related Unicode data/row}} (technical subtemplate)