Integrated word breaker
Integrated IK Chinese word breaker
ES installation See Bowen ------------ "Elastic Search Getting Started"
- Download IK word breaker , select the corresponding version of zip compression version. Here to elasticsearch-analysis-ik-7.6.0.zip example.
- The zip archive uploaded to the server (/home/monk/Download/elasticsearch-analysis-ik-7.6.0.zip), to extract ES plugin directory (/apps/elasticsearch-7.6.0/plugins/elasticsearch-analysis- ik-7.6.0), as shown:
unzip /home/monk/Download/elasticsearch-analysis-ik-7.6.0.zip -d /apps/elasticsearch-7.6.0/plugins/elasticsearch-analysis-ik-7.6.0/
- ES can restart
Integrated Pinyin word breaker
-
Download Pinyin word breaker , select the corresponding version of zip compression version. Here to elasticsearch-analysis-pinyin-7.6.0.zip A Case Study
-
The zip archive uploaded to the server (/home/monk/Download/elasticsearch-analysis-pinyin-7.6.0.zip), to extract ES plugin directory (/apps/elasticsearch-7.6.0/plugins/elasticsearch-analysis- ik-7.6.0), as shown:
unzip /home/monk/Download/elasticsearch-analysis-pinyin-7.6.0.zip -d /apps/elasticsearch-7.6.0/plugins/elasticsearch-analysis-pinyin-7.6.0/
-
ES can restart
Verify successful integration
- The default tokenizer effect
- IK tokenizer effect
- ik_max_word: text will do the finest split size, such as would "People's Republic of China national anthem" split "People's Republic of China, the Chinese people, the Chinese, the Chinese People's Republic, people, people, people, republic, republican, and, the country country, national anthem, "will exhaust all the possible combinations for Term Query;
- ik_smart: do thickest split size, such will be "the national anthem of People's Republic of China" split "People's Republic of China, the national anthem," for Phrase queries.
- Pinyin word breaker effect