2-02 character encoding of evolution

  • GB2312, also known as GB code, issued by the State Administration of standards, implementation May 1, 1981, access to the mainland. Singapore also use this coding, a total of 7445 graphic characters, men accounted for 6763.
  • Released in 1995 GBK1.0, gbk coding can be used to represent traditional and simplified at the same time, the standards-compliant coding GB2312, contains a total of 21,003 Chinese characters, contains CJK characters in all.
  • Released in 2000 GB18030, GBK is an extension of the coding, covering Chinese, Japanese, Korean and Chinese minority languages, which included 27,484 Chinese characters, compatible with GBK and GB2312 character set.
  • BIG5 Code: Taiwan Traditional Chinese standard characters, using double-byte coding, contains a total of 13,053 Chinese characters, 1984 implementation.

To solve the problem in each country between different encoding is not interoperable, ISO standards organization to run for!

  • Unicode encoding: international standard character set, each character will define the world in various languages ​​a unique code to meet the information into a text cross-language, cross-platform. Unicode (Unicode, Unicode) provides that all characters and symbols to represent a minimum of 16 (2 bytes), namely: 16 ** 2 = 65536
  • UTF-8, Unicode is compressed and optimized coding, he no longer uses 2 bytes using a minimum, but all characters and symbols classified: ascii code contents of one byte saved character Europe 2 bytes saved, the characters in East Asia with three bytes of storage.

 

Chinese version of windows system default encoding is GBK

Mac OS \ Linux system default encoding is UTF-8

 

Guess you like

Origin www.cnblogs.com/echo-kid-coding/p/11132197.html