Lucene full text search engine

    Lucene is a sub-project of the 4 jakarta project group of the Apache Software Foundation. It is an open source full-text search engine toolkit, but it is not a complete full-text search engine, but a full-text search engine architecture that provides a complete Query engine and indexing engine, part of the text analysis engine (English and German two western languages). The purpose of Lucene is to provide a simple and easy-to-use toolkit for software developers to easily implement full-text search functions in the target system, or to build a complete full-text search engine based on this. Lucene is an open source library for full-text search and search, supported and provided by the Apache Software Foundation. Lucene provides a simple yet powerful API for full-text indexing and searching. Lucene is a mature free and open source tool in the Java development environment. For its part, Lucene is the most popular free Java information retrieval library currently and in recent years. Information retrieval libraries are often mentioned, and although related to search engines, information retrieval libraries should not be confused with search engines.

 

Download luke-5.3.0-luke-release View the generated retrieval file

 

Basic operation:

Add documents

delete document

Modify the document

document domain weighting

 

Advanced application:

search for a specific item

Query expression: QueryParser

paging implementation

Other query methods

Specify the range of items to query TermRangeQuery; specify the range of numbers to query NumericRangeQuery; specify the beginning of the string to search for PrefixQuery; Combination query BooleanQuery;

 

Chinese application:

Chinese word segmentation smartcn

Search result highlighting implementation

 

 

 ================================================

 

Source code:

Indexer.java generates retrieval files;

Searcher.java retrieves the instance;

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=326237854&siteId=291194637