Flink provides a distributed cache, similar to hadoop, that allows users to easily read local files in parallel functions and put them in the taskmanager node to prevent repeated task pulls. The working mechanism of this cache is as follows: the program registers a file or directory (local or remote file system, such as hdfs or s3), registers the cache file through ExecutionEnvironment and gives it a name. When the program is executed, Flink automatically copies the file or directory to the local file system of all taskmanager nodes, which will only be executed once. Users can find the file or directory by this specified name, and then access it from the local file system of the taskmanager node.