What is big data entry of big data? White can understand!

Disclaimer: This article is a blogger original article, follow the CC 4.0 BY-SA copyright agreement, reproduced, please attach the original source link and this statement.
This link: https://blog.csdn.net/weixin_43893397/article/details/102696612

1. What is big data?

literal meaning:

  • Large amounts of data, vast amounts of data
  • Our data are general data processing to Dian T G M Dian other units (the size of a song about 4Mb, 1024M = 1G, 1024G = 1T), the data is generally large in the above process and PB PB data, for storage, analysis and calculation, etc.

Professional explanation:

  • The size of the data set has far exceeded the existing common database software tools and processing power data

Professional point again:

  • Refers not to a certain time frame for data capture and management Dian treated with conventional set of software tools within, is the need for a new processing mode in order to have greater decision-making power Dian insight discovery and process optimization of mass Dian high rates of growth and diversification information assets .
  • Mainly to solve the massive data storage analysis and calculation of mass data

2. The characteristics of big data - 4V (Volume, Velocity, Variety)?

2.1Volume (a lot)

  • Popular point that: data volume, and more
  • Up to now, all human-produced the number of printed materials is 200PB , and the history of mankind a total amount of data that remark about 5EB , current, typical personal computer's hard drive for TB-level capacity, while the amount of data large enterprises have EB close the order

Here Insert Picture Description

2.2 Velocity (high-speed)

  • The fast growth rate data
  • This is a big data to distinguish the most significant features of traditional data mining, according to the report "Digital Universe," the IDC is expected that by 2020, global data usage will reach 35.2ZB, in the face of such vast amounts of data, efficient processing of data is the business s life
  • Lynx double eleven: the opening 2 minutes and 5 seconds, with a total turnover of over 10 billion yuan; day total turnover: 213.5 billion yuan
    Here Insert Picture Description

2.3 Variety (diversity)

  • Structured data, semi-structured data, unstructured data, and
  • This type of diversity is also let into structured data and unstructured data. Facilitate storage of the conventional equivalent to a database / data structure of text-based, more unstructured data , including web logs , audio, video, pictures, location information , etc., these multiple types of data processing capability of the data put forward higher requirements
    Here Insert Picture Description

2.4 Value (low value density)

  • Massive high-value data
  • The value of the density level of the total data size is inversely proportional to how fast to valuable data "purification" and referred to the current problems to be solved under the big data background.

3. What is the main job of big data?

main function
Mass data fast query
The mass data storage (amount of data, a single large file)
Rapid mass data calculation (comparison with conventional tools)
Massive data in real-time computing (immediately immediately)
Data mining (mining valuable data not previously found)

4. Large scenario data

Here Insert Picture Description
Here Insert Picture Description
Here Insert Picture Description
Here Insert Picture Description
Here Insert Picture Description

5. What are the future prospects for the development of large data

Early big data technology is now at ground applications, big data trends from their own development and the development of the industry point of view, the prospects for the future of big data is good, have the following specific reasons:

  • 第一:大数据自身能够创造出更多的价值。大数据相关技术紧紧围绕数据价值化展开,数据价值化将开辟出广大的市场空间,重点在于数据本身将为整个信息化社会赋能。随着大数据的落地应用,大数据的价值将逐渐得到体现。目前在互联网领域,大数据技术已经得到了较为广泛的应用。
  • 第二:大数据推动科技领域的发展。大数据的发展正在推动科技领域的发展进程,大数据的影响不仅仅体现在互联网领域,也体现在金融、教育、医疗等诸多领域。在人工智能研发领域,大数据也起到了重要的作用,尤其在机器学习、计算机视觉和自然语言处理等方面,大数据正在成为智能化社会的基础。
  • 第三:大数据产业链逐渐形成。经过近些年的发展,大数据已经初步形成了一个较为完整的产业链,包括数据采集、整理、传输、存储、分析、呈现和应用,众多企业开始参与到大数据产业链中,并形成了一定的产业规模,相信随着大数据的不断发展,相关产业规模会进一步扩大。
  • 第四:产业互联网将推动大数据落地。当前互联网正在经历从消费互联网向产业互联网过渡,产业互联网将利用大数据、物联网、人工智能等技术来赋能广大的传统产业,可以说产业互联网的发展空间非常大,而大数据则是产业互联网发展的一个重点,大数据能否落地到传统行业,关乎产业互联网的发展进程,所以在产业互联网阶段,大数据将逐渐落地,也必然落地。

通过以上分析可以得出,未来大数据领域的发展空间还是比较大的,而且目前大数据领域的人才缺口比较大,所以从就业的角度来说,当前学习大数据相关知识是个不错的选择。

6.总结一下

什么是大数据?

​ 字面意思理解:大量的数据,海量的数据

​ 数据集的大小已经远远超过了现有普通数据库软件和工具的处理能力的数据

大数据有什么特点?

​ 海量化

​ 数据量大(多)

​ 多样化

​ 结构化数据,半结构化数据,和非结构化数据

​ 快速化

​ 数据的增长速度快

​ 高价值

​ 海量数据价值高

大数据能做什么?

​ 1、海量数据快速查询

​ 2、海量数据的存储(数据量大,单个大文件)

3, fast mass data calculation (comparison with conventional tools)

4, mass data calculated in real time (immediately immediately)

5, data mining (mining valuable data not previously found)

Big Data outlook:

Well, yes, verygood!

This article on here, next goodbye! Like to give a praise ah concern

Guess you like

Origin blog.csdn.net/weixin_43893397/article/details/102696612