InputFormat解读

编程语言 2018-05-11 18:33:31 阅读次数: 0

mapreduce 第一个步骤读取文件系统，解析成一个个key,value

InputFormat 子类就是处理这件事的。 InputFormat 两个核心抽象方法getSplits,creatRecordReader

1>getSplits方法：split the set of input files for the job. >Each {@link InputSplit} is then assigned to an individual {@link Mapper} for processing意思读取文件对原数据的切分一个个InputSplit

一个InputSplit 对应一个map 进程去处理.通过阅读FileInputSplit 源代码getSplit方法,可以知道

1个文件可以切分1个或者多个InputSplit,

更加抽象的得到：有多个block，就有几个InputSplit(默认配置）,就有个多少map任务.

2> createRecordReader方法:对于InputSplit解析key，value

Map任务是静态，Map进程动态.

为啥Maper k1,v1是LongWritable，Text

因为job默认处理类TextInputFormat<LongWritable，Text>已经定死了。

猜你喜欢

转载自liyunqiangyq.iteye.com/blog/2200379

InputFormat解读

MapReduce详细解读一（InputFormat）

InputFormat

hadoop inputformat

InputFormat的作用

自定义InputFormat

InputFormat简析

Hadoop之InputFormat

Hadoop InputFormat浅析

Hadoop的OutputFormat和InputFormat

MapReduce中的InputFormat

hadoop InputFormat getSplits

MR的inputFormat总结

hadoop的inputformat问题

Hadoop基础【1.2】 InputFormat

十一、MapReduce中的InputFormat

Hadoop组件之-MapReduce(InputFormat)

Hadoop常用的OutputFormat和InputFormat

MapReduce 之 InputFormat数据输入

MapReuce之输入类InputFormat

MapReduce源码解析之InputFormat

学习笔记 - Hadoop InputFormat 浅谈

RecordReader and InputFormat vs OutputFormat and RecordWriter

【大数据】MapReduce组件InputFormat

MapReduce【自定义InputFormat】

hadoop各种输入方法(InputFormat)汇总

hadoop 自定义inputformat和outputformat

hadoop自定义inputformat源码

Hadoop深入学习：InputFormat组件

Hadoop开发常用的InputFormat和OutputFormat

今日推荐

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

周排行

static方法和非static方法的区别（java）

如何查找计算机专业paper

java.lang.ClassFormatError: Incompatible magic value 0 in class file com/sitecha

跳跃游戏II

stm32_之【建立工程】

TeaWeb v0.0.9 发布，统计底层优化、主机监控功能改进

事件分发 -----控制字体大小

JavaScript DOM练习（动态表格添加） December 25，2019

JSF Scope & CDI

实现从零搭建一个登录注册页面（附源代码）

每日归档

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)