When downloading the github code dataset, files such as 7z.001.001 are usually downloaded. These files are generated because the total file is too large, so it is divided into n sub-volumes evenly during compression, that is, 001/002/003. To get data files that can be used directly, these data files need to be decompressed first. However, 00x files alone cannot be decompressed ( because they are just sub-volumes of files !!!), all files should be merged and restored before decompression. The specific method is:
The following is divided into linux system and windows system to introduce the practice
One, linux:
①Merge files:
Enter the folder where the volume is located, use the command
cat {文件名1} {文件名2} {文件名n} > {生成文件名}
Take the file name shapeNetP2M.7z as an example
cat ShapeNetP2M.7z-001.001 ShapeNetP2M.7z-002.002 ShapeNetP2M.7z-003.003 ShapeNetP2M.7z-004.004 >ShapeNetP2M.7z
Generate files in this directory
②Decompress the 7z file using instructions
'''格式为 7zr e {文件路径}'''
7zr e "ShapeNetP2M.7z"
Get the file with suffix tgz. The file is still compressed and requires further decompression
③Decompress the tgz file
Use instructions. After ./, add the path of the target folder to which the file is extracted. If not added, it will be decompressed to the current folder by default
tar zxvf 文件名.tgz -C ./
Finish!
2. Unzip the file under windows.
①Open the cmd command line, cd to enter the path where the file is located.
②Use instructions:
copy /b filename.7z* filename.7z
where filename is replaced with your filename. Note that you only need to change the prefix.
Successfully generated merged file named .7z!