This commit is contained in:
Kovid Goyal 2025-04-13 10:34:45 +05:30
parent d30354e9c8
commit 423bdfc5f3
No known key found for this signature in database
GPG key ID: 06BC317B515ACE7C

View file

@ -407,28 +407,28 @@ character. To do this, characters are divided into various classes, as
described by the rules below, in order of decreasing priority:
.. note::
Notation: :code:`[start, stop, step]` means the integeres from :code:`start`
Notation: :code:`[start, stop, step]` means the integers from :code:`start`
to :code:`stop` in increments of :code:`step`. When the step is not
specified, it defaults to one.
#. Regional indicators: 26 characters starting at :code:`0x1F1E6`. These all
#. *Regional indicators*: 26 characters starting at :code:`0x1F1E6`. These all
have width 2
#. Doublewidth: Parse `EastAsianWidth.txt
#. *Doublewidth*: Parse `EastAsianWidth.txt
<https://www.unicode.org/Public/UCD/latest/ucd/EastAsianWidth.txt>`__ from
the Unicode standard. All characters marked :code:`W` or :code:`F` have
width two. All characters in the following ranges have width two *unless*
they are marked as :code:`A` in :code:`EastAsianWidth.txt`: :code:`[0x3400,
0x4DBF], [0x4E00, 0x9FFF], [0xF900, 0xFAFF], [0x20000, 0x2FFFD], [0x30000, 0x3FFFD]`
#. Wide emoji: Parse `emoji-sequences.txt
#. *Wide emoji*: Parse `emoji-sequences.txt
<https://www.unicode.org/Public/emoji/latest/emoji-sequences.txt>`__ from
the Unicode standard. All :code:`Basic_Emoji` have width two unless they are
followed by :code:`FE0F` in the file. The leading copdepoints in all
:code:`RGI_Emoji_Modifier_Sequence` and :code:`RGI_Emoji_Tag_Sequence` have width two.
All codepoints in :code:`RGI_Emoji_Flag_Sequence` have width two.
#. Marks: These are all zero width characters. They are characters with Unicode
#. *Marks*: These are all zero width characters. They are characters with Unicode
categories whose first letter is :code:`M` or :code:`S`. Additionally,
characters with Unicode category: :code:`Cf`. Finally, they include
all modifier codepoints from :code:`RGI_Emoji_Modifier_Sequence` in the