Template:ISO 15924 script codes and related Unicode data/header

v t e Scripts in ISO 15924^[a]^[b] and in Unicode^[c]^[d]
ISO 15924				Script in Unicode^[e]
Code	ISO number	ISO formal name	Directionality	Unicode Alias^[f]	Version	Characters	Notes	Description

Template documentation[view] [edit] [history] [purge]

This documentation is shared between templates {{Unicode blocks}} and {{ISO 15924 script codes and related Unicode data}}.

Usage

The template can be used as usual. It is not a navigation box, so it can be everywhere in an article. The notes are contained within the template, and will not appear in the main References part.

Note: when resolving red links or wrong links, edit {{ISO 15924/wp-article}}. That is where the connection between ISO code and a Wikipedia article is made.

ISO 15924 templates

Item

In template

/subs

Content

Example

Publisher

Usage

TPU

Note

Code (ISO)

/subp

ID

Arab

ISO 15924

Everywhere

[TPU]

Alpha-4, enwiki central ISO script id list

Alias (Unicode)

/subp

ID

Arabic

Unicode

[TPU]

Article (enwiki)

/subp

ID

[[Arabic script]]

enwiki

[TPU]

QID (wikidata)

/subp

ID

Q790681

Wikidata

[TPU]

Number; range 000–999

/subp

ID

234

ISO 15924

rarely

[TPU]

ISO number not used as ID in enwiki; see Code

Scripts (sub)merged into main scripts

/subp

Merged scripts

Latf → Latn

Unicode

Script descriptions, re U+

[TPU]

In mainspace: 10× hardcoded (e.g.); 2× Qxxx depr

Name

/subp

data

Deseret (Mormon)

ISO 15924

[TPU]

Unicode chapter

/subp

data

Ch 18.1

Unicode

[TPU]

pdf does not open at .n subchapter

Script example
character

/subp

data

ع‎

enwiki

User boxes

[TPU]

In Mainspace

Overview

ISO
U
enwiki

/subp

list

enwiki

ISO 15924

[TPU]

Mainspace: ISO 15924, Script (Unicode), Unicode character property

Blocks ⇄ Scripts

/subp

list

enwiki

some script articles

[TPU]

Mainspace; related

graphs

/subp

fonts&files

[TPU]

Mainspace, Scripts in Unicode

Overviews

4 IDs:

ISO
U
WD
enwiki

/subp

4 ID's, related

Wikidata

check consistency

[TPU]

Overview: templates

/subp

list

Wikipedia

[TPU]

WP-category

/subp

data

Category:Arabic script

enwiki

Not checked for mainspace

[TPU]

watered down concept for minor scripts

Also (doc, userbox, technical, ...)

Documentation

/subp

prime documentation

Latin script in Unicode (~)

[TPU]

Reused in multiple templates

Redirect

/subp

template

enwiki

Redirects

[TPU]

userbox

/subp

Userboxes

[TPU]

Related Changes

/subp

pages Unicode, ISO 15924 WP:RELC

Related Changes

enwiki

WikiProject

[TPU]

900+700 P x T

Unicode versions

/subp

Version number

as of Unicode version 13.0

enwiki

[TPU]

(new Sep2022)

Wikidata properties

Directionality

script directionality (P1406)

P1406

{{Infobox}}, ...

Unicode ranges

Unicode range (P5949)

P5949

{{Infobox}}, ...

ISO English name

name (P2561)

P2561

Crosscheck only

Modules

Data module

module:Unicode data

/subp

§ Functions overview

HTML named entities

module:Numcr2namecr

/subp

More templates

All subpages of {{ISO_15924}}

{{lang}} connection (§ Indicating writing script)

Template data

This is the TemplateData for this template used by TemplateWizard, VisualEditor and other tools. See a monthly parameter usage report for Template:ISO 15924 script codes and related Unicode data in articles based on its TemplateData.

TemplateData for ISO 15924 script codes and related Unicode data

No description.

Template parameters[Edit template data]
Parameter		Description	Type	Status
1	`1`	no description	Unknown	optional

Background: How is this table composed

Note that a script is not a language. A single script, like the Latin alphabet, is used in many languages. Unicode is only about scripts, not about languages that use that script. Still there may be nuances, like the English versus Polish language in using accents on letters.

Step 1: ISO defines a script

ISO defines and publishes a script in the ISO 15924 list. It defines the Alpha-4 code (Aaaa-Zzzz), the Numeric code (000-999), and the formal Name for each accepted script. Currently there are some 160 scripts defined in this list. Included are scripts like "Mathematical notation (Zmth)" and "Code for undetermined script (a.k.a. Common, Zyyy)". The list is formally maintained and published by ISO, and practically by the Unicode Consortium office. It is published on the Unicode website. Technically, the list is file iso15924.txt.

Step 2: Unicode attaches an Alias name

Then, Unicode (not ISO) maintains a list of Alias script names right next to the ISO-defined scripts, for each script Unicode has encoded. The Alias name is an English name for that script.

So the ISO alpha-4 code gets a unique Alias name by Unicode: Mymr:ISO Name=Myanmar (Burmese), Alias=Myanmar.^[1] These Alias names are also present in the definition file iso15924.txt.

Step 3: Usage by Unicode

From that list, Unicode can translate any alpha4-code into the Alias name of the script, and reverse. Unicode does not use the formal ISO name.

A script name is used in the Unicode Name of a character: "U+05BF ֿ HEBREW POINT RAFE".

Per character

In the Unicode database, Unicode adds one single appropriate alpha-4 code to every individual script character. So every letter, punctuation, number and so of a script get that code. Characters used by multiple scripts, such as the period (.), have script code "Zyyy" (Common). The "script" codes for Mathematical and Symbol are not used by Unicode; symbols and mathematical characters have the property script="Unknown".

Then, in the file Scripts.txt, Unicode publishes the Alias script name per character (possibly by a range of characters). A part of that file looks like:

...
0591..05BD    ; Hebrew # Mn  [45] HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
05BE          ; Hebrew # Pd       HEBREW PUNCTUATION MAQAF
05BF          ; Hebrew # Mn       HEBREW POINT RAFE
05C0          ; Hebrew # Po       HEBREW PUNCTUATION PASEQ
05C1..05C2    ; Hebrew # Mn   [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
05C3          ; Hebrew # Po       HEBREW PUNCTUATION SOF PASUQ
...

This datafile defines which scripts are present in Unicode, and what script is at a certain code point.

In a block

Given a block range of code points, then which scripts are present in that block? See {{Unicode blocks}}: this table is constructed by signaling every script that is present as a block (once).

There is no secure relation between a script name and a block name. Some scripts are in a single block, but other scripts are spread amongst several blocks.

v t e ISO 15924 templates
General	ISO 15924 Unicode `{{Infobox writing system}}`
ISO-defined	`{{ISO 15924 code}}` `{{ISO 15924 name}}` `{{ISO 15924 number}}`
Unicode	{{Unicode Alias script names}} ({{Unicode merged into-scripts}}) `{{ISO 15924 script codes and related Unicode data}}`
Wikipedia (enwiki)	`{{ISO 15924/wp-article}}` (label) `{{ISO 15924/script-example-character}}` `{{ISO 15924/wp-category}}`
Wikidata	`{{ISO 15924/qid}}`
Userboxes	Userboxes/Writing systems `{{User iso15924}}` `{{User iso15924/category-intro}}`
Technical	`{{R from ISO 15924 code}}`
Category:ISO 15924 templates All templates starting with ISO 15924 All templates starting with User iso15924 (Userbox)

v t e Unicode templates
General	{{Unicode navigation}} {{Uncommon Unicode notice}}
Inline	{{Unichar}} {{U+}} {{UTF-8}} {{UTF-8toScalar}} {{UTF-16}}
Character properties	Bidi Class General Category Hexadecimal digit Numeric Type Whitespace
Code points	Planes Private Use Area Unicode blocks
Scripts	{{ISO 15924 script codes and related Unicode data}}
CJK-specific	{{CJK ideographs in Unicode}} {{CJKV}} {{Unihan}} {{Lang-zh}}
Unicode charts Unicode templates

Template:ISO 15924 script codes and related Unicode data/header

Contents

Usage

ISO 15924 templates

Template data

Background: How is this table composed

Step 1: ISO defines a script

Step 2: Unicode attaches an Alias name

Step 3: Usage by Unicode

Per character

In a block

See also