[python] Data processing of .csv files: extract specific columns from all .csv files under the directory file, deduplicate and store as new .csv files - Code World

[python] Data processing of .csv files: extract specific columns from all .csv files under the directory file, deduplicate and store as new .csv files

Enterprise 2023-07-29 01:07:03 views: null

Raw data:

After processing:

solution:

import pandas as pd
import os
import csv

path = r"D:\xxx\数据"
for dirpath,dirnames,filenames in os.walk(path):
   for filename in filenames:
      # 使用pandas读入
      data = pd.read_csv(os.path.join(dirpath,filename)) #读取文件中所有数据
      x = data[['x','y','z']]#读取x,y,z列
      print(x)
      a=x.drop_duplicates(subset=['x','y','z'],keep='first', inplace=False) #去重
      print(a)
      a.to_csv(r'./userid.csv',sep=",")#储存为新的文件，userid.csv：为文件名

Guess you like

Origin blog.csdn.net/weixin_61745097/article/details/128359181

[python] Data processing of .csv files: extract specific columns from all .csv files under the directory file, deduplicate and store as new .csv files

Python data processing | Batch extract csv files under a folder, each csv file extracts specific columns according to the column index, and saves the extracted data to a new folder

Merge multiple csv files into one csv file

Python reads CSV files to remove duplicate data

pandas reads CSV files with different number of columns

Python reads XML, CSV files

python: concatenate multiple csv files

Merge two csv files with python

Python - writing and reading csv files

How CSV files are handled in Python

Detailed operations of csv files in python

Show Directory list in php but only .csv files

pandas learning: processing large CSV files with pandas

pandas learning: processing large CSV files with pandas

python extract specific types of files in a directory

Importing CSV files garbled

iOS operations on csv files

About .CSV files

springboot parses CSV files

Convert all CSV files in the same folder to excel files

Compare two methods to visualize data in CSV files

Data reading and writing: Python reads and writes CSV files

python(11):python reads excel and csv files

Read Java-based use Flink CSV files for batch processing, multi-table joins are two ways to Table classes and methods for data processing Join reentry CSV file

Python csv library reads and writes files

Convert pdf files to word, csv using Python

Summary of several methods for python to write csv files

Python quickly and intuitively reads local csv files

Python converts Excel files to CSV format in batches

Python directory under all subdirectories move files to the new List

Recommended

Ranking

[Algorithm] greedy _ program scheduling issues

Spring 控制反转（IOC）

Data structure-6.6 figure

Indicates that the class or member method has abstract properties

Huawei v5 server installed Linux operating system

Postgresql source code analysis - creating ordinary tables

Chapter 10 Evaluation Classification Results

Cloud service Ubuntu 20.04 version uses Nginx to deploy static web pages

Java Exercise 17.1

Solve the problem that git cannot automatically push submission in IDEA Push failed: Failed with error: Could not read from remote repository.

Daily

More

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)

2024-05-02(0)

2024-05-01(4)

2024-04-30(36)