Introduction [Lecture] big data: Big Data Technology Principles and Applications

 

 

1.1 Big Data era

 

First Wave: before and after 1980, the personal computer, the birth of the company: IBM

Second Wave: before and after 1995, the popularity of the Internet

Third Wave: things, cloud computing, large data

 

Lead to big data era

factor:

Technical Support:

1. Storage: storage capacity is increasing (personal data continues to increase, more and more corporate data)

2. Calculation: CPU processing capacity has increased significantly

3. Network: network bandwidth is increasing

Through three stages: 

The first stage: stage operating systems

The second stage: UGC stage

Phase III: Phase Perception systems (large-scale popularization of the Internet of Things)

 

Big Data development process:

 

 

 

1.2 Big Data concepts and the impact

Characteristics: 4V

 

 

 1. The amount of data:

Big Data Moore's Law: more than 50% growth per year

2. Diversity:

Structured data and unstructured data

3. Fast of:

The processing speed is very fast; many enterprise applications require second-level decision-making

4. Low density value:

 

Impact of big data:

1. The fourth paradigm

The first: Experimental

The second: Theory

Third: computing

Fourth: data

2. In terms of way of thinking, subvert the traditional way of thinking

1) rather than the whole sample sampling

2) rather than the precise efficiency (the first one basis)

3) rather than a causal correlation

 

1.3 Application of Big Data

TV drama Toupai: big data analysis software

Google predict flu: real-time control of citizens search for relevant information

 

1.4 The key technology of large data

Five levels: data collection, (core) data storage and management, data processing and analysis , data privacy and security

 

Two core: distributed storage, distributed processing

 

Distributed storage: Google behalf of the company

 

Big data calculation mode:

Different computing model requires the use of different products

 

1. batch computing :( real time is not good enough)

Representative products: MapReduce, Spark

2. Flow calculation:

Real-time processing, real-time response

3. FIG calculated:

Efficient processing of the data structure of FIG.

4. Analysis calculated query (Query Interactive):

Representatives: Hive, etc.

 

 

 

1.5 big data cloud computing, networking

 

cloud computing:

1. Cloud computing to solve the problem:

Two core: distributed storage, distributed processing

2. Typical features:

Virtualization, multi-user

3. The concept:

As a service to provide very low-cost IT resources to users over the network

4. Advantages:

Companies do not need self-built IT infrastructure, cloud resources can be rented

The three modes: public cloud, hybrid cloud, private cloud

Public cloud: for all users

Private cloud: for the enterprise

Hybrid cloud: part for himself, partly to outside

6. The three kinds of cloud services:

1.IaaS:基础设施即服务

将基础设施作为服务出租

2.PaaS:平台即服务

 用提供的平台资源开发、部署软件

3.SaaS:软件即服务

典型案例:云财务软件

 

7.云计算关键技术:

1.多租户

2.虚拟化:虚拟机、虚拟专用网(VPN)

3.分布式存储

4.分布式计算

 

8.云计算数据中心

数据中心包含有大量刀片服务器

 

数据中心建设位置:

地质构造稳定、天气凉爽

 

慕课链接:https://www.icourse163.org/learn/XMU-1002335004?tid=1003965001#/learn/announce

Guess you like

Origin www.cnblogs.com/musecho/p/10991125.html