Evolution process from traditional relational database engine to parallel database engine

Author: Zen and the Art of Computer Programming

1 Introduction

The database has always been the infrastructure for enterprise-level application development. It provides a series of functional interfaces that can help businesses quickly develop, deploy, and maintain systems. As the originator of relational databases, the SQL language was already one of the most important data interaction languages ​​for Internet companies at that time. With the rapid development of information technology and the revolutionary development of software programming technology, relational databases are gradually approaching the edge of "outdated". In the 1990s, existing relational databases such as Oracle and DB2 entered the commercial stage, but in order to cope with more complex application scenarios, people proposed a new database theory, namely the era of massive data processing (big data) The current relational database is not suitable for the status quo, so a new database model - NoSQL database is needed. The design concept of NoSQL database absorbs the characteristics of distributed systems, cloud computing, large-scale data sets, dynamic query, etc., and adopts fragmentation mechanism, replication technology, master-slave cluster, etc. Optimizations for relational databases have been improved. At the same time, in order to make better use of massive data, NoSQL database also proposes a storage solution based on column storage and distributed file system, which enables data to be expanded horizontally and increases fault tolerance. There are already many well-known NoSQL products on the market, such as MongoDB, Cassandra, etc. These databases will change the pattern of traditional relational databases in the near future.

This article discusses the history and evolution of Apache Cassandra, a representative product of NoSQL. Cassandra is a high-reliability, high-availability distributed database. Its core feature is to achieve high data availability through replication technology, and supports automatic balance and data automatic scheduling, effectively avoiding single point of failure. However, for a new thing, many people don't know how to choose a database. Therefore, this article will first introduce the development history of NoSQL and its market demand. Then introduce the design concept, implementation and key technologies of Cassandra. Finally, relevant performance analysis is carried out through practical cases. Throughout the process, we will explain the significance of NoSQL technology in the field of big data

Guess you like

Origin blog.csdn.net/universsky2015/article/details/132681700