hadoop-- test data cleansing

Testing requirements:

1, data cleaning: Cleaning in accordance with the data, and import data washing hive database.

Cleaning before:

 

After cleaning:

 

Data uploaded to the hive:

 

 The problem is the beginning of a very easy to go wrong in the hive, the version is too high and too low, after unloading installation repeatedly, finally it. mysql also, it does not match the last jar package can also be run after installation - good, username and password in the hive-site configuration file as the user name and password to connect the local navicat. The first part done!

 

 

Guess you like

Origin www.cnblogs.com/zmh-980509/p/11854411.html