Lucene is a sub-project of the 4 jakarta project group of the Apache Software Foundation. It is an open source full-text search engine toolkit, but it is not a complete full-text search engine, but a full-text search engine architecture that provides a complete Query engine and indexing engine, part of the text analysis engine (English and German two western languages). The purpose of Lucene is to provide a simple and easy-to-use toolkit for software developers to easily implement full-text search functions in the target system, or to build a complete full-text search engine based on this. Lucene is an open source library for full-text search and search, supported and provided by the Apache Software Foundation. Lucene provides a simple yet powerful API for full-text indexing and searching. Lucene is a mature free and open source tool in the Java development environment. For its part, Lucene is the most popular free Java information retrieval library currently and in recent years. Information retrieval libraries are often mentioned, and although related to search engines, information retrieval libraries should not be confused with search engines.
Download luke-5.3.0-luke-release View the generated retrieval file
Basic operation:
Add documents
delete document
Modify the document
document domain weighting
Advanced application:
search for a specific item
Query expression: QueryParser
paging implementation
Other query methods
Specify the range of items to query TermRangeQuery; specify the range of numbers to query NumericRangeQuery; specify the beginning of the string to search for PrefixQuery; Combination query BooleanQuery;
Chinese application:
Chinese word segmentation smartcn
Search result highlighting implementation
================================================
Source code:
Indexer.java generates retrieval files;
Searcher.java retrieves the instance;