TOP Unicode Encoding RFC code point

std

RFC3629 UTF-8, a transformation format of ISO 10646 (STD: 63)

encoding

B0B1B2B3B4B5Unicodebytesbitsbit pattern(code point)
00-7F
0xxxxxxx
-----U+00 … U+7F170xxx xxxx
C2-DF
110y yyyy
80-BF
10xx xxxx
----U+0080 … U+07FF2110000 0yyy yyxx xxxx
E0-EF
1110 zzzz
80-BF
10yy yyyy
80-BF
10xx xxxx
---U+0800 … U+FFFF316zzzz yyyy yyxx xxxx
F0-F7
1111 0aaa
80-BF
10zz zzzz
80-BF
10yy yyyy
80-BF
10xx xxxx
--U+010000 … U+1FFFFF421000a aazz zzzz yyyy yyxx xxxx
F8-FB80-BF80-BF80-BF80-BF-U+00200000 … U+03FFFFFF526
FC-FD80-BF80-BF80-BF80-BF80-BFU+04000000 … U+7FFFFFFF626

管理人/副管理人のみ編集できます