Gpload use of the loading and unloading of data GreenPlum

The tool-readable Greenplum gpload Greenplum external tables and parallel file server (gpfdist or gpfdists) to load data. It is based on parallel processing table disposed outside the file and allows the user to configure their data formats in a single configuration file, and the external table definition or gpfdists gpfdist provided. Gpload using tools, the need to write gpload control file, the control file is a file format yaml, as shown below:
Gpload use of the loading and unloading of data GreenPlum
and then performing gpload loading operation, as follows:
Gpload use of the loading and unloading of data GreenPlum
Description control file:
. 1, gpload control file, the same level parameter be sure to maintain a consistent indentation;
2, "-" followed by a space must be, if ":" keep up the parameter value, then the ":" followed by a space must be also;
3, gpload control file can also write SQL statements, such as the figure above, prior to loading (before) truncate the table, after the completion of loading statistics (after) collecting table;
4, parameters for gpfdist also be specified in the control file, such as the above-described specified MAX_LINE_LENGTH, port parameters .
References:
1, gpload official manual .
2, Greenplum Chinese manual

Guess you like

Origin blog.51cto.com/candon123/2411154