Elasticsearch formally introduced
- Elasticsearch function, doing
- Elasticsearch of application scenarios, can play a role in any place
- Elasticsearch characteristics with other similar things in different places where
Function, doing
-
Distributed search engine and data analysis engine
Search: In Baidu, the site of the station search, retrieval of IT systems
Data analysis: Electric's Web site, the last seven business days before this commodity toothpaste sales of the top 10 of what; news sites, one month before the recent visits rankings news section 3 of which are
Distributed, search, data analysis
-
Full-text search, structured retrieval, data analysis
Full Text Search: I would like to find the product name containing toothpaste goods,
select * from products where product_name like "%牙膏%"
Structured Search: I want to find a product classified as of supplies of goods which are,
select * from products where category_id='日化用品'
Partial matching, auto-complete, search for error correction, searching recommendation
Data Analysis: We analyze the number of products available in each commodity classification,
select category_id,count(*) from products group by category_id
-
Massive data in near real-time processing
Distributed: ES huge amounts of data can be automatically distributed to multiple servers to store and retrieve up
The sea of data processing: distributed later, you can use a large number of servers to store and retrieve data, naturally handle huge amounts of data can be achieved
Near real-time: retrieving data takes 1 hour (this is not a near real-time, off-line batch processing, batch-processing); data search and analysis at the second level
With distributed / mass data opposite: lucene, stand-alone applications, can only be used on a single server can only handle a single server can handle the amount of data
Applicable scene, can play a role in any place
foreign
-
Wikipedia, similar to Baidu Encyclopedia, toothpaste, toothpaste Wikipedia, full-text search, highlighting, search Recommended
-
The Guardian (foreign news sites)
Similarly Sohu news, log user behavior (click, browse, favorites, comments) + social network data (related to certain views on the news), data analysis, to the author of each news article, let him know that his article public feedback (good, bad, hot, junk, contempt, worship)
-
Stack Overflow (abroad program exception discussion forum)
IT problems, program error, submit up, someone will answer and discuss with you, full text search, search-related questions and answers, program error, it will paste the error message to go inside, there is no corresponding search of answers
-
GitHub (open source management), search of billions of rows of code
-
Electricity supplier site, search for a product
-
Log data analysis, logstash acquires log, ES complex data analysis (ELK technology, elasticsearch + logstash + kibana)
-
Commodity price monitoring website
Users set the threshold price of a commodity, when below this threshold, a notification message is sent to the user, for example, subscribe to monitor toothpaste, Colgate toothpaste if the Family set below 50 dollars to notice me, I would buy
-
BI system, Business Intelligence, Business Intelligence.
For example, there is a large shopping Group, BI, analyze the composition constitutes a certain amount of consumption of the users of regional trends last three years as well as user groups, output related to the number of reports, xxx area, the last three years, showing the amount of annual consumption 100% growth, and user groups 85% senior white-collar, open a new mall. ES perform data analysis and mining, Kibana data visualization
domestic
Site Search (electricity supplier, recruitment, portals, etc.), IT systems search (OA, CRM, ERP, etc.), data analysis (ES popular use of a scene)
Feature
-
It can be used as a large-scale distributed clusters (hundreds of servers) technology
PB-level data processing, services large companies; you can also run on a single machine, a small service company
-
Elasticsearch not a new technology
Mainly to full-text search, and distributed data analysis techniques, combined together, before the formation of a unique ES;
lucene (full-text search), commercial data analysis software (is there), distributed database (mycat)
-
For users, it is out of the box
Very simple, as small and medium sized applications directly deployed about three minutes ES, as the system can be used in a production environment, the amount of data, the operation is not too complicated
-
Database functionality in many areas of the face is not enough (the transaction, as well as a variety of online transaction type of operation)
Special features, such as full-text search, a synonym, relevance ranking, complex data analysis, near-real-time processing of massive data; Elasticsearch as a complement to traditional database, the database provides a lot of features that are not can not provide
GIHUB:https://github.com/zq99299/note-book/blob/master/docs/elasticsearch-core/index.md