Encoding: ASCII, UniCode, UTF-8

The computer can directly handle digital, processed if necessary other data formats, such as text format, then converted into the corresponding digital too. Only early case letters, numbers and special symbols total of 127, which is common ascii code. As the general to represent the data bytes, a byte is 8bit, represent the maximum 255, with the popularity of computers, more and more need programmed into computer language, and the original encoding format is clearly not enough, so the birth of Unicode coding, all languages are unified into a set of coding.
It seems the problem has been solved, but new problems came. Ascii code originally only one 8-byte may represent, and Unicode code is usually 2 bytes, so the data for storage and transmission of data will be very wasteful. Therefore, there is a UTF-8 encoding, the encoding may be encoded to be transmitted according to the data size of 1-6 bytes, commonly 1 byte letters, characters typically 3 bytes, encoded using UTF-8 save space.

Guess you like

Origin blog.csdn.net/DOUBLE121PIG/article/details/90577213