Japan imported the game to China ---- "China's film did not the Japanese code set ----" cause garbled
In order not to mess ---- "---- installed language packs" can be displayed properly in Japanese
United Unicode ----- "supports all languages worldwide
1. unicode, 2-4 bytes, has a collection of 136,690 characters (there are more than 20,000 Chinese characters), and has been constantly expanding. . . . .
2. The language can also be used with each global conversion, unicode ---- "gbk and so around 1990
1.1980 years when a lot of software development based gb2312, then turned to unicode substantially equal to the weight to push
2. Unicode = English, gbk = Chinese, there is no strong demand for all into Unicode
Global computer makers factory support Unicode, most software programs support Unicode
The windows on Chinese computers are gbk coding
Unicode brings a new problem, no problem in the large memory space, or if you want to transfer exists on the hard drive, twice as big waste of space inefficiency +
Unicode for transmission + storage, made a utf-8 (full name: Unicode Transformation Format, academic name UTF), a total of three versions, utf - 8, utf - 16, utf - 32
UTF - 8: Use 1,2,3,4 bytes represent all characters; preferentially use a character, can not meet the increase is 1 byte up to 4 bytes, 1 byte in English, European Languages accounted for two, East Asia accounted for 3, and other special characters occupy 4 bytes
character | ACSI | Unicode | UTF -8 |
A | 01000001 | 00000000 01000001 | 01000001 |
in | x | 01001110 00101101 | 11100100 10111000 10101101 |
py 2 = ACSII
py 3 = Unicode
Before write programs need to declare what is encoded with