Solr install, use, configure Chinese tokenizer - Code World

Solr install, use, configure Chinese tokenizer

Others 2022-04-22 04:53:39 views: 0

Original address: https://blog.csdn.net/yuruixin_china/article/details/80037873

Solr is a search engine framework based on the lucene search library, which encapsulates lucene and implements an enterprise-level application framework. There is a complete cluster and index library optimization solution.

Solr can run independently, running in Servlet containers such as Jetty, Tomcat, etc. The implementation method of Solr index is very simple. Use the POST method to send an XML document describing the Field and its content to the Solr server. Solr adds, deletes, and updates according to the XML document. index. Solr search only needs to send an HTTP GET request, and then parse the query results returned by Solr in Xml, json and other formats to organize the page layout. Solr does not provide the function of building UI. Solr provides a management interface, through which you can query the configuration and operation of Solr.

Installation and
download address: http://archive.apache.org/dist/lucene/solr/
(The server is abroad, the download will be slower. You can use solr6.1.0 to download )
Download complete and unzip the compressed package
start solr
Visit the solr background management interface
http://127.0.0.1:8983/solr/#/
Create core (can be understood as a database in mysql, that is, a service can have multiple libraries)

solr create -c gxl_core

1

write picture description here

6. Enter the core you just created and test the word segmentation

. Since the word segmenter that comes with solr cannot segment Chinese according to semantics, you need to introduce the Chinese word segmenter IKAnalyzer

a. Put the jar of ik into the solr-6.1.0\server\solr-webapp\webapp\WEB-INF\lib directory
b. Modify the managed-schema.xml file and add the following code to the schema tag

<!-- IKAnalyzer-->
  <fieldType name="text_ik" class="solr.TextField">
    <analyzer class="org.wltea.analyzer.lucene.IKAnalyzer"/>
  </fieldType>

1
2
3
4

c. restart solr

solr restart -p 8983

1

After the above operations, look at the word segmentation effect
write picture description here

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=324818956&siteId=291194637

Solr install, use, configure Chinese tokenizer

Solr6.2 from environment deployment to integration with mysql to Chinese tokenizer to the use of solrJ

Solr6.2 from environment deployment to integration with mysql to Chinese tokenizer to the use of solrJ

Use sentencepiece to train the Chinese tokenizer and merge it with the LLaMA tokenizer

Former docker install and use solr

Create solr7.3.0 cluster_test cluster_add Chinese tokenizer_import data

Download, install and configure the Chinese environment of blender

idea to install, configure and use a basic

[UGUI] How to configure and use Chinese fonts in TextMeshPro

Detailed illustration mongodb download, install, configure and use

solr-centos install solr

use of solr

solr Chinese word segmentation

Under Linux install solr

centos install solr

Solr6.6.0 (ik tokenizer) Tomcat8 deployment

solr6.6 search environment construction, IK Chinese word segmentation, synonyms, pinyin, the use of solrj

Solr configure account permission login

ELK ② index, create, view, delete, install and use tokenizer, map, create, view, modify, document, add, delete, modify, check and partial update

Install and configure burpsuite and use burpsuite to break DVWA shooting range

How to configure and use win10 to install nginx (graphic)

Use snap to install Redis and configure external network access and access password

Install and use the easy code plugin in idea, and configure the mysql database in idea

How to quickly install, configure and use jQuery in vue3 project

Use docker to install and configure oracle 11g

[Solr] Chinese word segmentation configuration

Install ik tokenizer 6.5.4 using Docker

ELK--Elasticsearch install the ik tokenizer plugin

Docker install nginx & custom IK tokenizer

Principles and use of Solr

Recommended

Ranking

#2019110700005

What materials and procedures are required for patent transfer

What is the blockchain Ethereum triplet state root transaction root receipt root

Front-end study notes 04 --- About the insertion of html pictures and videos

Documents required for the filing of WeChat Mini Programs in special industries, the filing process of WeChat Mini Programs in special industries, how to file WeChat Mini Programs in special industries

2017 Qingdao-site tournament I The Squared Mosquito Coil

[BZOJ3165][HEOI2013]Segment (line segment tree without marking)

Kettle series: KettleEasyExpand, an open source Kettle universal plugin by Ma Jinju

The latest tutorial on making framework for iOS

DAX Section 6: Statistical Functions

Daily

More

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)