"Big Data technologies" Principles and Applications, Second Edition - Chapter Big Data Overview
Others
2019-12-22 02:34:50
views: null
1.2 Big Data concept
- Big amount of data
- Many data types
- Processing speed
- Low density value
1.3 impact of Big Data
- Changes gone from experiment to theory to calculate and then data
- Changes in thinking
- Full sample rather than sample
- Efficiency rather than precise
- Relevant and not causal
1.6 Large data calculation mode
- Batch computing, primarily in large-scale data for batch processing. MapReduce for large data sets (1TB) parallel computing. Spark is a low-latency cluster for large data sets distributed computing system, much faster than MapReduce.
- Flow calculations, data stream or data flow refers to a set of infinite series of dynamic data on the number and distribution of time, it must be calculated in real time given by way of second response. Commercial-grade platform: Streams, StreamBase; second category is open source computing platform, Storm, Yahoo, S4, Spark Streaming
- Calculation FIG. FIG Pregel achieve parallel processing system, mainly for graph traversal, shortest path, the PageRank calculation, there are other Giraph, GraphX, PowerGraph, GoldenOrb, Hama
- Analysis calculation, need to provide real-time or near real-time response, Google's Dremel, Impala, Hive, Cassandra
Large data cloud 1.8
- Cloud computing consists of three typical service mode, IaaS (infrastructure services, ie computing resources and storage), PaaS (Platform as a Service), SaaS (software as a service)
- Public cloud, private cloud, hybrid cloud
- Cloud computing key technologies include: virtualization technology, distributed storage, distributed computing, multi-tenant.
- Things extensions thereof is connected to the Internet, he uses the local network or the Internet and other communication technologies the sensors, controllers, machines, people and linked together in a new way, and objects formed, was connected to the object, information technology and remote management control.
Origin www.cnblogs.com/tsruixi/p/12078843.html