理解unbounded||bounded data stream

  • Overview

    Any kind of data is produced as a stream of events. Credit card transaction, sensor measurement, machine logs, or user interactions on a website or mobile application, all of these data are generated as a stream.

    Data can be processed as unbounded or bounded streams.

  • Unbounded streams

    Unbounded streams have a start but no defined end. They do not terminate and provide data as it is generated.

    Unbounded streams must be continously processed, i.e., events must be promptly handled after they have been ingested.

    It is not possible to wait for all input data to arrive because the input is unbounded and will not be complete at any point in time.

    Processing unbounded data often requires that events are ingested in a specific order, such as the order in which events occurred, to be able to reason about result completeness.

  • Bounded streams

    Bounded streams have a defined start and end.

    Bounded streams can be processed by ingesting all data before performing any computations.

    Ordered ingestion is not required to process bounded streams because a bounded data set can always be sorted.

    Processing of bounded streams is also known as batch processing.

  • References

  1. Apache Flink Docs

猜你喜欢

转载自blog.csdn.net/The_Time_Runner/article/details/115295631
今日推荐