[Turn] to match Chinese characters Regular Expressions

Chinese characters matching the regular expression: [/ u4e00- / u9fa5]

Here are a few major non-English language characters range (found on google):

2E80 ~ 33FFh: CJK symbol area. Host Kangxi radical, auxiliary CJK Radicals, phonetic symbols, Japanese kana, note Korean, Japanese and Korean symbols, punctuation, tape, or with Rune numbers, month, and combinations of Japanese kana, units in No., month, date, and time.

3400 ~ 4DFFh: CJK Extension A Ideographs area, receiving a total of 6,582 Japanese and Korean characters.

4E00 ~ 9FFFh: CJK ideograms area, receiving a total of 20,902 Japanese and Korean characters.

A000 ~ A4FFh: Yi text area, housing the South China Yi text and roots.

AC00 ~ D7FFh: combination of Korean alphabet word area, makes up the notes housed in Korean text.

F900 ~ FAFFh: CJK Compatibility Ideographs area, receiving a total of 302 Japanese and Korean characters.

FB00 ~ FFFDh: text region forms, in combination accommodating Latin characters, Hebrew, Arabic, Japanese and Korean straight punctuation, small symbols, half-width symbols, full-width symbols.

For example, in Japan and South Korea need to match all non-symbolic character, the regular expression should be ^ [/ u3400- / u9FFF] + $
Theoretically yes, but I just copied to msn.co.ko a Korean he discovered a fundamental right , strange
and then had a copy msn.co.jp 'お', nor the line ..

And then extended to the ^ [/ u2E80- / u9FFF] + $, so it touches are passed, this should be the match CJK regular expressions, including traditional Chinese Province of Taiwan we are still blind to use

The regular expression for the Chinese, it should be ^ [/ u4E00- / u9FFF] + $, and forums were often brought ^ [/ u4E00- / u9FA5] + $ very close

Note that the forum say ^ [/ u4E00- / u9FA5] + $ This is designed to match the regular expression simplified Chinese, traditional Chinese characters are actually there, I tested with a tester under the 'People's Republic of China ', but also through, of course, ^ [/ u4E00- / u9FFF] + $ is the same result

 

Guess you like

Origin www.cnblogs.com/dajianshi/p/12166951.html