The use of hadoop distributed cache - Code World

The use of hadoop distributed cache

Others 2020-09-14 01:37:00 views: null

Introduction

DistributedCache is a mechanism provided by the hadoop framework, which can distribute the files specified by the job to the machine where the task is executed before the job is executed, and there are related mechanisms to manage the cache files.
The cache content is in the file, and each node can read the cache according to the access path in hdfs.

Steps for usage

1. When adding a distributed cache,
first define the cache path

String cacheFile = "hdfs://xxxx";

You can set the alias "#" and the alias is the alias. You can directly use
cacheFile = cacheFile + "#alias"
in the main method to add to the job in the method (then you can use it in the map phase)

// 缓存jar包到task运行节点的classpath中
job.addArchiveToClassPath(archive);
// 缓存普通文件到task运行节点的classpath中
job.addFileToClassPath(file);
// 缓存压缩包文件到task运行节点的工作目录
job.addCacheArchive(uri);
// 缓存普通文件到task运行节点的工作目录
job.addCacheFile(uri)

2. Use distributed cache

  @Override
  protected void setup(Mapper<LongWritable, Text, Text, Text>.Context context) throws IOException, InterruptedException {
    
    
      super.setup(context);
      // 读取缓存文件中的内容 直接根据别名读取
          FileReader fr = new FileReader("别名");
          BufferedReader br = new BufferedReader(fr)；
  }

Guess you like

Origin blog.csdn.net/sc9018181134/article/details/104054235

The use of hadoop distributed cache

hadoop distributed cache

Distributed lock, distributed cache

[JBoss Cache of Distributed Cache]

MapReduce process under Hadoop distributed and simple use of HDFS

Use of HDFS components of Hadoop's distributed file storage system

Distributed Cache (b)

Distributed architecture of the cache system

The concept of a distributed cache redis

Distributed Cache -01 Overview

Distributed cache of Redis

Distributed cache those things

Distributed cache — MongoDB

[Original] Distributed cache breakdown

Distributed cache breakdown

Distributed cache Redis experience

[Introduction to Cacheonix of Distributed Cache]

[Introduction to Cacheonix of Distributed Cache]

memcached//Distributed data cache

Cache distributed lock exploration

[Introduction to Ehcache of Distributed Cache]

[Introduction to ASimpleCache of Distributed Cache]

Distributed cache problem

Advanced Redis - Distributed Cache

Flink distributed cache

docker install Distributed hadoop

centos - hadoop fully distributed

hadoop distributed configuration file

hadoop fully distributed deployment

Hadoop fully distributed preparations

Recommended

The United States plans to restrict the export of large AI models to China and Russia

Apple to reach agreement with OpenAI to bring ChatGPT to iPhone

Ranking

whisper-webui installation tutorial is silky and easy to use

[Base] Laravel concepts laravel basis, the custom service provider: Contracts, ServiceContainer, ServiceProvider, Facades relations

Import torchvision error problem solving DLL: module not found

observer & watch & notify = pub & sub

A small turntable program [HTML + CSS + JS]

CorelDRAW 2018 shortcuts Daquan

Supervise el botón de menú para lograr un gatillo de presión prolongada

JS将时间秒转换成天小时分钟秒的字符串

RIP basic configuration

[Deleted] solution to a problem a few questions (Noip1994)

Daily

More

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)

2024-05-02(0)