What Big Data platform? What are the features? How to build a big data platform?

Big Data platform is designed to meet enterprise requirements for data generated.
Big Data Platform:

It refers to the massive data storage, computing and continuous real-time streaming data computing scenario-based set of infrastructure. A typical series including Hadoop, Spark, Storm, Flink and Flume / Kafka and other clusters.

Can be used both open-source platform can also be used Huawei, star rings and other commercial grade solution that can be deployed on private cloud, it can also be deployed in the public cloud.
What Big Data platform?  What are the features?  How to build a big data platform?

Big data platform features:

1, mass data receiving

Using the storage and computing power of the computer cluster. Not only it has expanded in performance, and its ability to handle large streams of data into a corresponding increase.

2, fast

Column binding database schema (relative to non-traditional parallel processing based database rows) large-scale parallel processing techniques and the use, not only significantly improve the performance (typically about 100 to 1000), it may also be implemented in the lower and more transparent pricing .

In the process of getting started big data have met learning, industry, the lack of systematic learning path, learning systems planning, you are welcome to join my big learning data exchange skirt: 251 956 502, skirt documents have my years of study manual sorting of large data , development tools, PDF document with a book, you can download yourself.

3, compatible with traditional tools

Ensure that the platform has been certified to be compatible with traditional tools.

4, use Hadoop

Hadoop big data has become a major platform in the field. Hadoop use as a high durability and effective platform for lightweight data management.

5, support for data scientists

Scientists have data in enterprise IT in the higher influence and importance of fast, efficient, easy to use and widely deployed platform for big data can help narrow the distance between business people and technical experts.

6, provides data analysis functions

To ensure that not only supports large data platform ready in a few seconds and loads the data, also supported the establishment of forecasting model uses advanced algorithms, easy deployment model for in-database scoring. While allowing data scientists to use existing statistical packages and preferred language.

Better platform for Big Data:

There cloud Ali, Tencent, Baidu, Huawei and star ring.

Ali cloud big data platform more technical, more complete product;

Tencent partial analysis of large data products, products and solutions below normal;

Baidu large data products also relatively complete, in addition to many biased marketing solutions;

Huawei's products optimized solutions based on the needs of industrial customers;

Star ring is very characteristic of the product, but the research and development capabilities and market weak.

How to build a big data analytics platform?

General steps:

1, Linux system installation

2, a distributed computing platform / mounting assembly

Most of the current use of the distributed system is open source Hadoop series

3, data import

Data Import Tool Sqoop

4, data analysis

Data analysis generally comprises two phases: preprocessing data analysis and data modeling.

This process data preprocessing might use Hive SQL, Spark QL and Impala.

Analysis of data modeling is best to use Spark

5, and outputs the result visualization API

Visualization of results for display by the general formula or part of the original data.

Guess you like

Origin blog.51cto.com/14296550/2427713