Python爬取PDF中的表格

 使用Python爬取PDF中的表格

import pdfplumber
import pandas as pd

zmx_pdf = pdfplumber.open("D:/engineering space/raw file/pdf/prope_zmx.pdf")
page_2 = zmx_pdf.pages[3]
table_1 = page_2.extract_table()
df_1 = pd.DataFrame(table_1)
# list_1 = np.array(table_1)
# list_1 = list_1.tolist()
print(df_1)
# df_1.to_excel('D:/engineering space/raw file/pdf/test2.xlsx')

 

猜你喜欢

转载自blog.csdn.net/joshua_shi_t/article/details/121132646