September 17, 2019

Four features of big data:

1, the mass of

For example, IDC's recent report predicted that by 2020, global data volume will expand 50 times. Currently, large-scale data still is an indicator of changing the size range of a single set of data from tens to several TB PB. In short, data storage 1 PB will require twenty thousand PC with 50GB hard drive. In addition, various sources can produce unexpected data.

2, diversity

Increase the diversity of the data it was mainly due to new multi-structured data, and include network logs, social media, Internet search, phone call records and sensor networks, and other data types cause.

3, high speed

It describes a high-speed data is created and the moving speed. In the era of high-speed network, high-speed computer processor-based server and software performance optimization, to create real-time data streams has become a popular trend. Enterprises not only need to know how to quickly create data, you must also know how quickly processed, analyzed and returned to the user, in order to meet their real-time requirements.

4, volatility

Big data has a multilayer structure, which means that the data will show a large and varied forms and types. Compared with traditional business data, big data there is irregular and vague characteristics, resulting in difficult if not impossible to use traditional application software for analysis. Traditional business data format has evolved over time standard, it can be identified standard business intelligence software. At present, companies face the challenge of dealing with complex and tap the value from data presented in various forms.

Big Data three characteristics:

The first feature is a data type variety. Including weblogs, audio, video, pictures, location information, and so many types of data processing capability of the data put forward higher requirements.

The second feature is the relatively low value of density data. With such a wide range of application of things, information perception everywhere, a flood of information, but a lower density value, how to complete the value of the data through powerful machine algorithm more quickly, "purification" is the big data era pressing problem.

The third characteristic is the processing speed, high timeliness requirements. This is distinguished from conventional large data mining data most significant feature.

Guess you like

Origin www.cnblogs.com/hk1830/p/11564220.html