数据倾斜问题 牛逼(1)数据倾斜之MapReduce&hive

数据倾斜总结
http://www.alidata.org/archives/2109

浅析 Hadoop 中的数据倾斜
http://my.oschina.net/leejun2005/blog/100922

http://blog.csdn.net/longshenlmj/article/details/17304437
[大牛翻译系列]Hadoop(14)MapReduce 性能调优:减小数据倾斜的性能损失
http://www.cnblogs.com/datacloud/p/3601624.html?utm_source=tuicool

hive优化方式和使用技巧
http://blog.csdn.net/wisgood/article/details/17301843

HIVE中MAPJOIN可以使用的场景分析
http://blog.csdn.net/jiedushi/article/details/7662819
在hive的hive-default.xml配置文件中数据倾斜相关参数设置
<property>
  <name>hive.map.aggr</name>
  <value>true</value>
  <description>Whether to use map-side aggregation in Hive Group By queries</description>
</property>

<property>
  <name>hive.groupby.skewindata</name>
  <value>false</value>
  <description>Whether there is skew in data to optimize group by queries</description>
</property>

猜你喜欢

转载自niub.iteye.com/blog/2192035