Hadoop案例:数据压缩

在Driver类中添加以下代码即可:

1.在mapper输出端压缩

 Configuration conf = new Configuration();
 Job job = Job.getInstance(conf);
// 开启 map 端输出压缩
conf.setBoolean("mapreduce.map.output.compress", true);
// 设置 map 端输出压缩方式
conf.setClass("mapreduce.map.output.compress.codec", 
BZip2Codec.class,CompressionCodec.class);

2.在reducer输出端压缩 

// 设置 reduce 端输出压缩开启
FileOutputFormat.setCompressOutput(job, true);
// 设置压缩的方式
 FileOutputFormat.setOutputCompressorClass(job, BZip2Codec.class); 

Guess you like

Origin blog.csdn.net/baidu_41833099/article/details/121798680