ElasticSearch的Ingest节点 - 代码天地

ElasticSearch的Ingest节点

其他 2018-05-12 07:37:10 阅读次数: 3

ElasticSearch的ingest节点用来在真正对文档进行索引之前做预处理。

所有的节点都是默认支持ingest的，任何节点都可以处理ingest请求，也可以创建一个专门的Ingest nodes。可以通过在elasticsearch.yml文件中添加如下配置关闭节点上的ingest功能：

node.ingest: false

为了在真正对文档进行索引之前对文件进行预处理，通过定义包含了多个process的pipeline来实现。每个process实现了对文档的某种转换，如移除某个字段，重命名某个字段等。

要使用某个pipeline，只需要在请求中简单的指定pipeline的id就可以了：

PUT my-index/_doc/my-id?pipeline=my_pipeline_id
{
  "foo": "bar"
}

可以通过ingest API来定义pipeline

PUT _ingest/pipeline/my-pipeline-id
{
  "description" : "describe pipeline",
  "processors" : [
    {
      "set" : {
        "field": "foo",
        "value": "bar"
      }
    }
  ]
}

其他pipeline操作，simulate是指对请求的文档进行同时操作

GET _ingest/pipeline/my-pipeline-id

DELETE _ingest/pipeline/my-pipeline-id

//对下面的dcos进行pipeline操作，pipeline是该simulate请求里面提供的
POST _ingest/pipeline/_simulate
{
  "pipeline" : {
    // pipeline definition here
  },
  "docs" : [
    { "_source": {/** first document **/} },
    { "_source": {/** second document **/} },
    // ...
  ]
}

//对下面的dcos进行pipeline操作，pipeline是已经存在的
POST _ingest/pipeline/my-pipeline-id/_simulate
{
  "docs" : [
    { "_source": {/** first document **/} },
    { "_source": {/** second document **/} },
    // ...
  ]
}

pipeline里面主要包含2部分，一部分是描述，另外就是process。

process有多种： append, Convert ,Data, Data Index Name, Fail,Foreach,Grok,Gsub,Join,Json,KV, Lowercase, Remove, Rename, Script,Set,Split,Sort,Trim, Uppercase , Dot Expander, URL Decode, 用户也可以定制自己的process，但定制的process需要安装到所有节点上。

猜你喜欢

转载自my.oschina.net/u/2449787/blog/1635255

ElasticSearch的Ingest节点

【Elasticsearch】es Ingest 节点

Elasticsearch的ETL利器——Ingest节点

Elasticsearch Ingest-Attachment

Elasticsearch：language ingest processor

Elasticsearch: Ingest pipelines学习

Elasticsearch：language ingest processor - 7.6

Elasticsearch：使用 Elasticsearch ingest pipeline 丰富数据

elasticsearch ingest-attachment 对于 word、pdf等文件内容的索引

Elasticsearch：使用 ingest pipeline 来管理索引名称

Elasticsearch核心技术与实战学习笔记 52 | Ingest Pipeline & Painless Script

centos docker 安装elasticsearch、ik分词器、ingest-attachment

使用Elasticsearch进行word，excel，PDF的全文检索 windows实现超完整（ingest-attachment实现）

【ElasticSearch】ElasticSearch 单个节点监控

ElasticSearch 节点管理

elasticsearch 重启节点

elasticsearch之节点重启

Elasticsearch - 安装（单节点）

elasticsearch 节点重启问题

Elasticsearch 节点发现

四、ElasticSearch节点启动

ElasticSearch集群节点详解

ElasticSearch--节点的类型

ElasticSearch 2 的节点调优（ElasticSearch性能）

【elasticsearch】elasticsearch集群更换节点操作

ElasticSearch单节点部署步骤

elasticsearch如何安全重启节点

Elasticsearch - 单机多节点集群

elasticsearch 内存溢出,节点崩溃

elasticsearch简述与单节点安装

今日推荐

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

最强开源大模型 Llama 3 上架 Gitee AI

虽然老乡鸡开源的不是代码，但背后的原因却让人很暖心

富文本编辑器 Quill 2.0 重磅发布，特性、可靠性与开发者体验大幅提升

周排行

SVN同步出现问题

解决 nginx 出现 413 Request Entity Too Large 的问题

第一节区块链服务BaaS的总体架构以及基本模块设计的一种方案

ITeye 2013年度盘点——社区赠书书单

IDEA / git 和github 的新手使用教程史上最简单的 IntelliJ IDEA 教程史上最简单的 GitHub 教程

测试工程方法：测试用例设计综合策略

Spark优化(三)：对多次使用的RDD进行持久化

使用STM32 ST-LINK Utility 设置读保护后不能运行

exgcd 解同余方程ax=b(%n)

Android使用脚本进行多渠道打包

每日归档

更多

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)

2024-04-15(42)

2024-04-14(0)

2024-04-13(119)