58 city architects with your handwriting Kafka stream processing technology

Kakfa Introduction

What is Kafka

Kafka was originally LinkedIn internal internal infrastructure system. It is considered a workflow platform, you can publish on Kafka and subscription streaming data, and save them up for processing. But we use Kafka, the most is to use it as a messaging system, similar to ActiveMQ, RabbitMQ and so on. But Kafka with these traditional messaging system also has many differences, these differences make it different from the messaging system.

  • Kafka is a distributed system to a cluster (support free expansion) is run. (We always called distributed message queue)
  • Kafka can be used to store data, the length of time the data is stored is defined by your own (fault-tolerant persistent stored). And further provided that the data stored in Kafka, you can read repeatedly.
  • Streaming will enhance the level of data processing to a new level.

The traditional message system, only used for passing messages.
Kafka can also be seen as a real-time version of Hadoop (which is also designed one of Kafka's original intention). Hadoop to store and process large amounts of data files on a regular basis, and Kafka and persistent store and process large data streams. Hadoop is mainly used in the data analysis, and Kafka because of its low latency characteristics more suitable used in the core business, when business event occurs, Kafka can make the appropriate response to these events. Meanwhile kafka can also be compared and ETL, because they are good at moving data.

Kafka belongs to the news release system - a subscription model messaging system. The message sender does not send messages directly to the recipients in a message, but the message is first classified (topic), and then publish messages to the message system. Select message recipients subscribe message type (topic), then the system can be received from the message the message is subscribed.

Instructor Profile

58 city architects with your handwriting Kafka stream processing technology

Kafka's application

58 city architects with your handwriting Kafka stream processing technology

Kafka message processing flow

58 city architects with your handwriting Kafka stream processing technology

Kafka important configuration information

58 city architects with your handwriting Kafka stream processing technology

Kafka's combat command

58 city architects with your handwriting Kafka stream processing technology

Kafka combat - functional programming and lambda expressions

58 city architects with your handwriting Kafka stream processing technology

to sum up

Kafka is exploding. More than a third of Fortune 500 companies use Kafka. These companies include the top ten tourist companies, the top ten banks in seven, eight of the top ten insurance companies in the top ten in nine telecommunications companies, and more. LinkedIn, Microsoft and Netflix use Kafka day (1,000,000,000,000) processing one trillion messages. Kafka for realtime data stream, a large collection of data or analyzed in real time (or both). Kafka used in conjunction with the memory in order to provide reliable services microstructure, it can be used to provide events to the CEP (Complex Event flow system) and IoT / IFTTT automation systems.

Guess you like

Origin blog.51cto.com/13904503/2415683