In-depth understanding of the transaction log of Apache Spark Delta Lake

In-depth understanding of the transaction log of Apache Spark Delta Lake

The transaction log is the key to Delta Lake understanding, because it is run through many of the most important features of the universal module, including ACID transactions, extensible metadata handling, time travel (time travel) and so on. In this article we will discuss the transaction log (Transaction Log) what is it at the file level is how it works, and how it reads and writes problem of providing elegant solutions for multiple concurrent.

What transaction log (Transaction Log) is

Delta Lake transaction log (also known as DeltaLog) is executed every time a transaction table on Delta Lake orderly records. Specific forms of the following:

[email protected]:/tmp/delta-table/_delta_log|
⇒  ll
total 280
-rw-r--r--  1 yangping.w

Guess you like

Origin yq.aliyun.com/articles/719415