Kettle connects to big data cluster Hadoop - Code World

Kettle connects to big data cluster Hadoop

Others 2021-03-30 06:13:53 views: null

Ketlle provides a connection configuration module for the hadoop cluster. Before configuring the "Hadoop cluster", do some preparatory work. Copy the relevant configuration files in the cluster to the directory in the kettle, and replace the files in the original directory.

1. Required configuration files: core-site.xml, hdfs-site.xml, yarn-site.xml, mapred-site.xml, hbase-site.xml, hive-site.xml, the first four of these configuration files It is necessary to connect to Hadoop. The latter two depend on specific requirements. If they are also needed, they need to be copied in the cluster.
2. File storage path: E:\kettle-8.2\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh514 The
last level of the directory mainly depends on the big data platform used, from the cluster Just replace the copied configuration file with the file in the original directory.
3. Then click "Tools" in the upper left corner of the kettle -> click "Hadoop Distribution" -> select the type corresponding to the server big data platform, as shown in the figure below
Insert picture description here
4. After the preparation work is completed, right-click "Hadoop cluster" -> New , The specific configuration content is determined by the software in the cluster, as shown in the figure below.

5. After the configuration is completed, click Test. If the content shown in the figure below appears, it
Insert picture description here
can be seen that it is connected. The software configured according to the actual situation of the big data platform has been The connection is established (provided that the services are all enabled), after the connection is configured, you can directly select the connection using the relevant components in the subsequent steps.

Guess you like

Origin blog.csdn.net/AnameJL/article/details/115315437

Kettle connects to big data cluster Hadoop

Kettle connects to big data cluster Hadoop

Big data cluster construction: install and deploy MySQL, SQL Server, Zookeeper, Hadoop, Spark, Flink, Kafka, Kettle, Airflow cluster

kettle placement hadoop cluster

Eclipse connects with Hadoop cluster

Getting big data (four) Hadoop Cluster Setup

Big data platform construction | Hadoop cluster construction

Hadoop (cluster configuration) of big data technology

Hadoop big data platform cluster environment construction

kettle big data technologies

Big Data Hadoop cluster installed the third week --Hadoop

Kettle connects data sources in Oracle rac environment

Kettle connects data sources in Oracle rac environment

Big Data platform series: hadoop HDFS HA Cluster Setup

Big Data Hadoop cluster used in the framework of advantages and challenges

Docker build a big data cluster (c) Hadoop-based deployment

Getting Hadoop Big Data technologies - xsync cluster distribution script

Big Data technologies to learn and share: Hadoop Cluster Synchronization

Big Data Lesson -Hadoop fully distributed cluster structures

Playing Skill: Hadoop Big Data platform to build a distributed cluster environment

HA high availability architecture cluster deployment of big data Hadoop

Construction of distributed cluster environment of big data platform Hadoop

Big data Hadoop pseudo-distributed cluster construction

Big data Hadoop pseudo-distributed cluster construction

Big data Hadoop pseudo-distributed cluster construction

Shang Silicon Valley Big Data - Build Hadoop Cluster - Software Installation

Replace the disk in the hadoop big data cluster, the speed of balance is slow (solved)

Big Data Hadoop fifth week --Hadoop system configuration, start Hadoop cluster

Big data overview /// hadoop cluster building /// Hadoop configuration JobHistory /// Hadoop commonly used port numbers

Big data cluster environment construction: Hadoop, Spark, Flink distributed cluster environment

Recommended

Rushing to the GitHub hot list——How can open source programming languages and frameworks be so cute?

Beijing Humanoid Robot Innovation Center launches Tiangong, the world's first full-size humanoid robot with purely electric drive for anthropomorphic running

Ranking

8个无需编写代码即可使用 Python 内置库的方法

Java collections interview knowledge Lite

Machine learning algorithms foundation - Introduction a (watermelon materials for the book)

(Easy) Ransom Note - LeetCode

[Five days] Qt from entry to actual combat: the second day

Remember once extremely pit father can not download Maven Jar package of issues: IDEA question

The minimum cost Shortest

OSPF study map (the most complete version)

Network Takeaways

GnuPG

Daily

More

2024-04-27(29)

2024-04-26(22)

2024-04-25(32)

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)