openKylin realizes comprehensive support of the new national standard system level!

In the recently released version 1.0 of openKylin, support for the new national standard GB 18030-2022 has been realized from the underlying library to the application program . Text files conforming to the GB 18030-2022 specification can be viewed and edited normally, and the national standard Arial series fonts are built in. The GB 18030-2022 internal code input method has been added to complete the full support for the new national standard GB 18030-2022 .

Today, I will take you to learn more about what GB18030-2022 is, why it is supported and what openKylin has done.

 

1. What is GB18030-2022

The full name of GB18030 is "Information Technology Chinese Coded Character Set", which is a national standard for super large Chinese coded character set independently developed by my country. This standard specifies the binary codes of Chinese characters and other commonly used characters and their corresponding glyphs or graphics. GB18030-2022 released on July 19, 2022 is its latest version (referred to as the new national standard), and will be officially implemented on August 1, 2023.

Compared with the old standard GB18030-2005, the GB18030-2022 standard has added more than 17,000 Chinese characters, including a total of 87,887 Chinese characters and 228 Chinese radicals. The new national standard has greatly expanded the number of Chinese characters, especially the number of rare characters, which basically meets the needs of processing rare characters in scenarios such as names of people, places, and ancient books.

In addition, GB18030-2022 provides good compatibility with the old Chinese coding standards, and the new national standard is compatible with GBK and GB2312 in vocabulary and coding.

GB18030-2022 stipulates three implementation levels, level 3 supported by openKylin is the highest implementation level - supporting all Chinese characters in the new national standard.

 

2. Why do you need to support GB18030-2022

According to the GB18030-2022 national standard document, the new national standard is applicable to all "technical products with the information processing and exchange functions of Chinese and other text and graphic characters", including computers, operating systems, input methods and other software and hardware products. From the perspective of scope of application, the openKylin operating system needs to support the new national standard.

 

3. Work done to support GB18030-2022

In order to fully and accurately support the GB18030-2022 standard, a comprehensive modification from the underlying basic library to the application program is required to achieve this goal. The modules involved are shown in the figure below:

 

Driven by the GBCharactersEncoding SIG group of the openKylin community , the openKylin 1.0 version has completed relevant modifications to the text editor (pluma), GTK3 framework, harfbuzz and glibc libraries , and realized complete support for GB18030-2022.

Take from open source and give back to open source. In the process of supporting the new national standard, openKylin submitted a PR to the upstream pluma to solve the problem that the pluma text editor cannot set the font name of non-ASCII characters .

 

And submitted 2 PRs to the upstream GTK to solve the problem that the CSS parser in GTK3 cannot correctly recognize the font name of non-ASCII characters .

At the same time, in order to support the new national standard characters of the input method, GBCharactersEncoding SIG and InputMethod SIG jointly developed the GB18030 input method , which enables users to input any new national standard characters through the national standard internal code. So far, openKylin version 1.0 has fully supported the new national standard, including text input, text display and other aspects .
 

 

4. Subsequent plans

Of course, supporting the new national standard is inseparable from the support and cooperation of all parties. To this end, the openKylin community GBCharactersEncoding SIG will also cooperate with all parties in the industry to further promote the implementation of the new national standard, including but not limited to the following aspects:

  • Promote third-party applications to support the new national standard
  • Promote WPS, Qi Anxin browser, Sogou input method and other mainstream applications to support the new national standard.
  • Improve the Qt GUI framework's support for the new national standard

In the future, GBCharactersEncoding SIG will continue to follow up the work related to the national standard character encoding, and strive to become an authoritative open source organization for the implementation of the national standard. All enthusiasts are welcome to join GBCharactersEncoding SIG and contribute to the informatization processing of national standard characters with us.

  • GBCharactersEncoding SIG home page address:
  • https://gitee.com/openkylin/community/tree/master/sig/GBCharactersEncoding

Guess you like

Origin www.oschina.net/news/250040