[Technology] big data applications work twelve | Hadoop Comprehensive job

The requirements of the job from: https://edu.cnblogs.com/campus/gzcc/GZCC-16SE2/homework/3339

Foreword

This work is carried out on the basis of "reptile big job" on the "reptile big job", the data I mainly pull hook net python job recruitment information crawling, finally got the 2641 data there is a name as lagoupy.xls in. The task of the job mainly in the following five points:

1. Upload a csv file operations generated large reptiles to HDFS

2. CSV file is a text file pre-generate Untitled

3. hdfs text file to import into the final of the Hive data warehouse

4. view and analyze data in the Hive

5. Hive on large reptiles operations generated data analysis, write a blog describing your analysis and analytical results. (Analysis of more than 10)

1. Upload a csv file operations generated large reptiles to HDFS

 

2. CSV file is a text file pre-generate Untitled

 

3. hdfs text file to import into the final of the Hive data warehouse

 

4. view and analyze data in the Hive

 

5. Hive on large reptiles operations generated data analysis, write a blog describing your analysis and analytical results. (Analysis of more than 10)

 

Guess you like

Origin www.cnblogs.com/bhuan/p/11002102.html
Recommended