hadoop first day - 4. Large Data Analysis System

Fourth, the big data analysis system

  1. SUMMARY
    according to data transfer processes, each of the data analysis module is connected to form a large data system. Module comprising:
         - data collection (collection)
         - data storage
         - data calculation
         - Data Analysis
         - Application Data

          Also based on the timeliness of the data, generated from the time interval between the particular application, it is divided into off-line calculation, real-time calculation.
               - off-line calculation (processing): historical data processing, analysis-oriented in the past, called batch (batch) processing.
               - real-time calculation (processing): the current generated in real time data processing, called the flow (stream) type processing.
          The so-called timeliness is people can accept as a standard.

  1. Web site traffic log data analysis system
    system meaning: to help webmasters, operations personnel, extension workers and other real-time access to the Web traffic information, and traffic from the source, site content, site visitors based on site characteristics to provide data analysis aspects. Helping to improve website traffic, improve website user experience for visitors to settle down becomes more members or customers extract maximum revenue from fewer inputs.

Guess you like

Origin blog.csdn.net/qq_28178795/article/details/92076191