The use of Kettle's "increase check sequence" (check whether the data has changed)

  I personally use the “add check sequence” function less frequently. It is mainly used to detect whether the data has changed. It can be checked according to different types, such as MD5, CRC32 and other check methods. Here, MD5 is used as an example. Let’s talk about the role of MD5 verification. In terms of files, if MD5 verification is added during file transmission, it can be judged whether the file has changed during transmission according to the verification value, and the correctness of the data can be said to be file. The digital fingerprint guarantees uniqueness. It can be judged by comparing the MD5 check value before and after data transmission. If you still have any doubts, you can check the principle of some check data methods such as MD5, and it will be easy to understand.
  For example, a simple business scenario, for example, a user uploads a file, and MD5 is used to generate a check value for the file when uploading. At the data receiving end, an MD5 check value is also generated for the file, and the two are checked. By comparing the verification value, you can see whether the file has changed during transmission. If there is a change, it will return to a transmission failure state, and then upload again. At the same time, you can also see the security of data transmission.

1. Core object -> conversion -> add check sequence, drag "add check sequence" to the conversion page, as shown in the figure below
Insert picture description here
2. Configure parameters, select verification type -> select verification result type -> Enter the verification result field -> select the verification field. It is important to note that the verification output field must be entered. If there is no entry, an error "At least one step has not been initialized" will be reported, as shown in the figure below
Insert picture description here
. 3. View the result data and
Insert picture description here
pass the result value as shown in the figure below It can be seen that an MD5 check value is generated for each field .
  The main purpose here is to talk about the role of "adding a check sequence", because when I first saw this functional operator, I didn’t know what this functional operator actually did. It was only by looking at some information that I could understand its role. Since I use less, I can't cite too many business scenarios. Friends who have questions about this function operator, I hope this article can help you.

Guess you like

Origin blog.csdn.net/AnameJL/article/details/115207232