Data format for the folder in the folder txt, txt and all the files in the original classification, centralized merge to excel in. Results as shown:
import os
import os.path
import pandas as pd
filedir = 'C:/Users/Administrator/Desktop/数据集'
filenames=os.listdir(filedir)
content=[]
result=[]
for i in filenames:
filedir = 'C:/Users/Administrator/Desktop/数据集' +"/"+str(i)
filenames=os.listdir(filedir)
for filename in filenames:
filepath = filedir+'/'+filename
with open(filepath,'r',encoding='utf-8') as file:
file=file.readlines()
content.append(file)
result.append(i)
df=pd.DataFrame()
df["content"]=content
df['result']=result
df.to_excel('data.xlsx')
Download dataset: https://download.csdn.net/download/weixin_42342968/12162455