A Simple Explanation for Big Data Engineers

The data engineer builds the Hadoop ecosystem under the Linux system (cloudera is the biggest exporter similar to the Linux Red Hat), stores the user's transaction or behavior information through HDFS (distributed file system) and other user data files, and then stores the user data files through Hbase. An engineer who stores data (similar to NoSQL), calculates data through Mapreduce (parallel computing framework), analyzes data through hiv or pig (data analysis platform), and finally reproduces data according to user needs. 

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=325788486&siteId=291194637