TOP Unicode Encoding RFC code point
RFC3629 UTF-8, a transformation format of ISO 10646 (STD: 63)
B0 | B1 | B2 | B3 | B4 | B5 | Unicode | bytes | bits | bit pattern(code point) |
00-7F 0xxxxxxx | - | - | - | - | - | U+00 … U+7F | 1 | 7 | 0xxx xxxx |
C2-DF 110y yyyy | 80-BF 10xx xxxx | - | - | - | - | U+0080 … U+07FF | 2 | 11 | 0000 0yyy yyxx xxxx |
E0-EF 1110 zzzz | 80-BF 10yy yyyy | 80-BF 10xx xxxx | - | - | - | U+0800 … U+FFFF | 3 | 16 | zzzz yyyy yyxx xxxx |
F0-F7 1111 0aaa | 80-BF 10zz zzzz | 80-BF 10yy yyyy | 80-BF 10xx xxxx | - | - | U+010000 … U+1FFFFF | 4 | 21 | 000a aazz zzzz yyyy yyxx xxxx |
F8-FB | 80-BF | 80-BF | 80-BF | 80-BF | - | U+00200000 … U+03FFFFFF | 5 | 26 | |
FC-FD | 80-BF | 80-BF | 80-BF | 80-BF | 80-BF | U+04000000 … U+7FFFFFFF | 6 | 26 |
最新コメント