http://www.alidata.org/archives/2109
浅析 Hadoop 中的数据倾斜
http://my.oschina.net/leejun2005/blog/100922
http://blog.csdn.net/longshenlmj/article/details/17304437
[大牛翻译系列]Hadoop(14)MapReduce 性能调优:减小数据倾斜的性能损失
http://www.cnblogs.com/datacloud/p/3601624.html?utm_source=tuicool
hive优化方式和使用技巧
http://blog.csdn.net/wisgood/article/details/17301843
HIVE中MAPJOIN可以使用的场景分析
http://blog.csdn.net/jiedushi/article/details/7662819
在hive的hive-default.xml配置文件中数据倾斜相关参数设置
<property> <name>hive.map.aggr</name> <value>true</value> <description>Whether to use map-side aggregation in Hive Group By queries</description> </property> <property> <name>hive.groupby.skewindata</name> <value>false</value> <description>Whether there is skew in data to optimize group by queries</description> </property>