0
0
mirror of https://github.com/ezyang/htmlpurifier.git synced 2024-12-22 08:21:52 +00:00

Remove incorrect information about bit-size

UTF-8 is a variable-width encoding that uses octets, UTF-16
is a variable-width encoding that uses 16-bit words, and
UCS-2 is an obsolete fixed-width encoding that doesn't not
support characters beyond the BMP. Explaining this would be
unwieldly, so we just removed the information.

See also: http://www.reddit.com/info/6mlqc/comments/c04aold

Signed-off-by: Edward Z. Yang <edwardzyang@thewritingpot.com>
This commit is contained in:
Edward Z. Yang 2008-06-24 22:12:56 -04:00
parent 7727cea112
commit 85fb192d93

View File

@ -118,9 +118,8 @@ there are now many character encodings floating around.</p>
see a page on the web, chances are it's encoded in one see a page on the web, chances are it's encoded in one
of these encodings.</li> of these encodings.</li>
<li><strong>Unicode-based encodings</strong> implement the <li><strong>Unicode-based encodings</strong> implement the
Unicode standard and include UTF-8, UCS-2 and UTF-16. Unicode standard and include UTF-8, UTF-16 and UTF-32/UCS-4.
They go beyond 8-bits (the first two are variable length, They go beyond 8-bits and support almost
while the second one uses 16-bits), and support almost
every language in the world. UTF-8 is gaining traction every language in the world. UTF-8 is gaining traction
as the dominant international encoding of the web.</li> as the dominant international encoding of the web.</li>
</ul> </ul>