

Standard, Unicode 3.1.1, defines 102,655 characters, Six bytes for each character, theoretically allowing 2 21 possibleĬharacters - that's well over one million characters. Possible characters, Unicode uses (depending upon encoding) anywhere from one to

Where the original ASCII fontĮncoding uses only one byte for each character, allowing only 256 Permits millions of separate characters to be referenced: enough for all theĪlphabets, syllabaries, logographic and mixed scripts used by modern readersĪs well as a large number of ancient scripts. Unicode is a universal standard for character encoding that XML and HTML) the most widely used method of representing rich text documents in electronic The ISO standard for text markup, SGML, was first adopted in 1981 and is today (in the forms of Heretofore most ISO standards have had useful lives measured in decades for instance, World Wide Web Consortium as the standard method of encoding text for World Wide Web documents. Unicode is a universal standard maintained by the International Standards OrganizationĪnd the international Unicode Consortium, a standard which has been adopted by the internation ▣ Home | ◈ Contents | △ Section | ◁ Previous | Next ▷įor the World Wide Web D R A F T Why Unicode? Unicode Polytonic Greek for the World Wide Web
