【Introduction to Apache S4】

Apache S4 is a general-purpose, distributed, scalable, fault-tolerant, pluggable platform for handling infinite data streams of contacts. Apache S4 bridges the gap between complex proprietary systems and batch-oriented open source computing platforms. Our goal is to develop a high-performance computing platform that hides the complexity inherent in parallel processing systems from application programming.

Apache S4 is already used in Yahoo's systems to process thousands of search queries per second.



 

S4 is a general-purpose, distributed, scalable, fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data.

 

S4 motivation

S4 fills the gap between complex proprietary systems and batch-oriented open source computing platforms. We aim to develop a high performance computing platform that hides the complexity inherent in parallel processing system from the application programmer.

 



 

 

S4  implementation

The core platform is written in Java. The implementation is modular and pluggable, and S4 applications can be easily and dynamically combined for creating more sophisticated stream processing systems.

Guess you like

Origin http://10.200.1.11:23101/article/api/json?id=326856752&siteId=291194637