Skip to content

Commit c95edb1

Browse files
jensmaurertkoeppe
authored andcommitted
[index] Add UTF-8, UTF-16, UTF-32 to the index
1 parent d2ce09d commit c95edb1

File tree

8 files changed

+39
-3
lines changed

8 files changed

+39
-3
lines changed

source/basic.tex

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2989,6 +2989,7 @@
29892989
This information is given for the convenience of users of this document and
29902990
does not constitute an endorsement by ISO or IEC of this product.
29912991
\end{footnote}
2992+
\indextext{UTF-8}%
29922993
UTF-8 encoding form
29932994
and is composed of a contiguous sequence of
29942995
bits,

source/compatibility.tex

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -177,6 +177,7 @@
177177
Required for new features.
178178
\begin{itemize}
179179
\item
180+
\indextext{UTF-8}%
180181
The \keyword{char8_t} keyword is added to differentiate
181182
the types of ordinary and UTF-8 literals\iref{lex.string}.
182183
\item
@@ -225,6 +226,7 @@
225226
\end{codeblock}
226227

227228
\diffref{lex.literal}
229+
\indextext{UTF-8}%
228230
\change
229231
Type of UTF-8 string and character literals.
230232
\rationale
@@ -681,6 +683,7 @@
681683
\end{codeblock}
682684

683685
\diffref{ostream.inserters.character}
686+
\indextext{UTF-8}%
684687
\change
685688
Overload resolution for ostream inserters used with UTF-8 literals.
686689
\rationale
@@ -1961,12 +1964,15 @@
19611964
The type of a \grammarterm{string-literal} is changed
19621965
from ``array of \tcode{char}''
19631966
to ``array of \tcode{const char}''.
1967+
\indextext{UTF-8}%
19641968
The type of a UTF-8 string literal is changed
19651969
from ``array of \tcode{char}''
19661970
to ``array of \tcode{const char8_t}''.
1971+
\indextext{UTF-16}%
19671972
The type of a UTF-16 string literal is changed
19681973
from ``array of \textit{some-integer-type}''
19691974
to ``array of \tcode{const char16_t}''.
1975+
\indextext{UTF-32}%
19701976
The type of a UTF-32 string literal is changed
19711977
from ``array of \textit{some-integer-type}''
19721978
to ``array of \tcode{const char32_t}''.

source/declarations.tex

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5352,6 +5352,9 @@
53525352
\indextext{initialization!character array}
53535353

53545354
\pnum
5355+
\indextext{UTF-8}%
5356+
\indextext{UTF-16}%
5357+
\indextext{UTF-32}%
53555358
An array of ordinary character type\iref{basic.fundamental},
53565359
\keyword{char8_t} array,
53575360
\keyword{char16_t} array,

source/future.tex

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1961,6 +1961,9 @@
19611961
\end{itemize}
19621962

19631963
\pnum
1964+
\indextext{UTF-8}%
1965+
\indextext{UCS-2}%
1966+
\indextext{UTF-32}%
19641967
For the facet \tcode{codecvt_utf8}\indexlibraryglobal{codecvt_utf8}:
19651968
\begin{itemize}
19661969
\item
@@ -1974,6 +1977,9 @@
19741977
\end{itemize}
19751978

19761979
\pnum
1980+
\indextext{UTF-16}%
1981+
\indextext{UCS-2}%
1982+
\indextext{UTF-32}%
19771983
For the facet \tcode{codecvt_utf16}\indexlibraryglobal{codecvt_utf16}:
19781984
\begin{itemize}
19791985
\item
@@ -1989,6 +1995,8 @@
19891995
\end{itemize}
19901996

19911997
\pnum
1998+
\indextext{UTF-8}%
1999+
\indextext{UTF-16}%
19922000
For the facet \tcode{codecvt_utf8_utf16}\indexlibraryglobal{codecvt_utf8_utf16}:
19932001
\begin{itemize}
19942002
\item
@@ -2456,8 +2464,11 @@
24562464
\pnum
24572465
The following class template specializations are required
24582466
in addition to those specified in~\ref{locale.codecvt}.
2467+
\indextext{UTF-8}%
2468+
\indextext{UTF-16}%
24592469
The specialization \tcode{codecvt<char16_t, char, mbstate_t>}
24602470
converts between the UTF-16 and UTF-8 encoding forms, and
2471+
\indextext{UTF-32}%
24612472
the specialization \tcode{codecvt<char32_t, char, mbstate_t>}
24622473
converts between the UTF-32 and UTF-8 encoding forms.
24632474

source/iostreams.tex

Lines changed: 9 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13228,11 +13228,17 @@
1322813228
so no conversion from \keyword{wchar_t} value type arguments or to \tcode{wchar_t}
1322913229
value type return values is performed.
1323013230
\end{note}
13231-
\item \keyword{char8_t}: The encoding is UTF-8. The method of conversion
13231+
\item
13232+
\indextext{UTF-8}%
13233+
\keyword{char8_t}: The encoding is UTF-8. The method of conversion
1323213234
is unspecified.
13233-
\item \keyword{char16_t}: The encoding is UTF-16. The method of conversion
13235+
\item
13236+
\indextext{UTF-16}%
13237+
\keyword{char16_t}: The encoding is UTF-16. The method of conversion
1323413238
is unspecified.
13235-
\item \keyword{char32_t}: The encoding is UTF-32. The method of conversion
13239+
\item
13240+
\indextext{UTF-32}%
13241+
\keyword{char32_t}: The encoding is UTF-32. The method of conversion
1323613242
is unspecified.
1323713243
\end{itemize}
1323813244

source/lex.tex

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -364,6 +364,9 @@
364364
shall be one greater than the value of the previous.
365365
The ordinary and wide literal encodings are otherwise
366366
\impldef{ordinary and wide literal encodings}.
367+
\indextext{UTF-8}%
368+
\indextext{UTF-16}%
369+
\indextext{UTF-32}%
367370
For a UTF-8, UTF-16, or UTF-32 literal,
368371
the UCS scalar value
369372
corresponding to each character of the translation character set

source/locales.tex

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1650,6 +1650,9 @@
16501650
convert the implementation-defined native character set.
16511651
\tcode{codecvt<char, char, mbstate_t>} implements a degenerate conversion;
16521652
it does not convert at all.
1653+
\indextext{UTF-8}%
1654+
\indextext{UTF-16}%
1655+
\indextext{UTF-32}%
16531656
The specialization \tcode{codecvt<char16_t, char8_t, mbstate_t>}
16541657
converts between the UTF-16 and UTF-8 encoding forms, and
16551658
the specialization \tcode{codecvt} \tcode{<char32_t, char8_t, mbstate_t>}

source/strings.tex

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -363,6 +363,7 @@
363363
the built-in operators \tcode{=}, \tcode{==}, and \tcode{<} respectively.
364364

365365
\pnum
366+
\indextext{UTF-8}%
366367
The member \tcode{eof()} returns an
367368
\impldef{return value of \tcode{char_traits<char8_t>::eof}} constant
368369
that cannot appear as a valid UTF-8 code unit.
@@ -5763,6 +5764,7 @@
57635764
the next multibyte character (including any shift sequences).
57645765
If the function determines
57655766
that the next multibyte character is complete and valid,
5767+
\indextext{UTF-8}%
57665768
it determines the values of the corresponding UTF-8 code units and then,
57675769
if \tcode{pc8} is not a null pointer,
57685770
stores the value of the first (or only) such code unit
@@ -5810,6 +5812,7 @@
58105812
If \tcode{s} is a null pointer, equivalent to
58115813
\tcode{c8rtomb(buf, u8'$\backslash$0', ps)}
58125814
where \tcode{buf} is an internal buffer.
5815+
\indextext{UTF-8}%
58135816
Otherwise, if \tcode{c8} completes a sequence of valid UTF-8 code units,
58145817
determines the number of bytes needed
58155818
to represent the multibyte character (including any shift sequences),

0 commit comments

Comments
 (0)