Java development infrastructure of big data base

Big Data to a lot of people feel that the highly specialized, complicated operation, a technical "tall" in. Big data talent shortage, some people see opportunity to bring big data, big data would like to learn through professional training, so much data from zero to learn what content it

 

First, the basic 0 learning big data requires Java foundation

Java: one of the most developed demand programming language, can engage in web development, desktop application design, game development, Android development background, full-stack development. It can be said that most big data base programming language, one is the nature of big data is to calculate the mass data storage and query, background development is easily accessible to application scenarios large amount of data access; the second is the natural advantages of Java, many big data components are written in Java.

White want to learn a large zero-based data, you need to learn to start from the basics of Java, the Java language can be used as the first introductory language. In general, learn JavaSE on it, but it can grasp JavaEE would be even nicer.

JavaSE: variables, loops, if like; object-oriented; I / O input and output (HDFS will be used); a reflective, generic, MR query; JDBC: operating a relational database (RDBMS) (HIVE will be used).

 

Second, the large data need to learn Linux system administration, Shell programming design, Maven

Linux is a free to use and free dissemination of Unix-like operating system, is a POSIX and UNIX-based multi-user, multi-tasking, multi-threading and multi-CPU support for the operating system. Shell is a program written in C, it is the user to use Linux bridge; Shell is both a command language is a programming language. Shell is a command interpreter, the command interpretation and execution of the program entered by the user. Maven is a project management tool, you can build information management through a short description of the project, project management reports and documentation. If you are interested in big data development, want the system to learn big data , you can join the big data exchange technology to learn buttoned Junyang: 522 189 307 , welcome to add, to understand course descriptions, access to learning resources.

Big Data technologies tend to run in a Linux environment, large distributed cluster data (Hadoop, Spark) are built on multiple Linux systems, execute commands on a cluster of Linux are in a terminal window type ...... think data related work in large, need to have Linux operating systems, and command.

Linux: Linux installation, CentOS, the directory structure (file type, permissions, user commands), Linux commands, basic operations -SQL basis: Select (focus) insertupdatedelete (Hive, SparkSQL will be used) and so on; the ultimate goal of big data: Use SQL analysis of large data.

Shell: variables, IF conditional statement, the shell functions, Case conditional statements, and the while loop until type loops, for loops and select cycle, cycle control and status conditions and return values, and so an array of the shell.

Maven: Maven large data automated build , deployment tools, including installation and configuration deployment, Maven repository, MavenPOM and so on. Garmisch Valley, Big Data training institutions.

Published 123 original articles · won praise 0 · Views 4925

Guess you like

Origin blog.csdn.net/mnbvxiaoxin/article/details/104227131