The beginning of training pandas for data processing - Code World

The beginning of training pandas for data processing

Others 2020-03-12 00:38:06 views: null

import urllib.request;
from pandas import DataFrame;
from pandas import Series;
from bs4 import BeautifulSoup;

response = urllib.request.urlopen('file:///F:/python/untitled1/core/do_data/2month.html');
html = response.read();
soup = BeautifulSoup(html,"html.parser")
trs = soup.find_all('tr')
ths = trs[0].find_all('th');

index_d = []
for th in ths:
    index_d.append(th.getText())
data = DataFrame(columns=index_d)
print(index_d)

for tr in trs :
    tds = tr.find_all('td')
    td_datas = []
    for td in tds:
        td_datas.append(td.getText())
    if len(td_datas) != 0:
        data=data.append(
            Series(
                td_datas,
                index=index_d
            ), ignore_index=True
        )

print(len(data))

str2s = []

for i in range(len(data["股票全码"])):
    str2 =str(data["股票全码"][i])
    str2 = str2.replace("SZ","0|")
    str2 = str2.replace("SH","1|")
    str2 = str2 + " |" + The Data [ " limit time " ] [i] + "  " + the Data [ " historical reasons limit " ] [i] + "  " + the Data [ " limit the reasons the election " ] [i] 
    str2s.append (str2) 

the Data [ " new new " ] = str2s 
Data = data.drop_duplicates (Subset = [ ' ticker ' ], = Keep ' Last ' , InPlace = False)
 Print (len (Data)) 
DF2 = Data [ " new new " ].
values
#print(type(df2))

file = open('data.txt', 'w')
file.writelines("\n".join(df2));
file.close()

Guess you like

Origin www.cnblogs.com/rongye/p/12466584.html

The beginning of training pandas for data processing

【Pandas】①Pandas Data Processing Basics

Data processing pandas

Pandas text data processing

Pandas data processing (a)

pandas basic data processing

Pandas data conversion processing

Reptiles data processing data processing pandas

Pandas Data Processing | Some usages of Datetime in Pandas!

Pandas | 17 missing data processing

Pandas classification (category) data processing

pandas processing mongodb data 01

[python] pandas processing of data tables

Pandas missing data processing Daquan

Pandas data cleaning and feature processing

Getting Started with Pandas Data Processing

pandas Data Analysis - Processing fill in missing data

Processing data table pandas characters and date data

Big Data Processing Training: Big Data Processing Process

Image data extraction processing pandas textrank

Python efficient processing of test data using Pandas

Python data processing library pandas advanced course

--- pandas cross table and data processing PivotTable

Pandas duplicate data and null value processing

22 Python Pandas data processing basics

Summary of commonly used functions in pandas data processing

Pandas data processing | apply() function usage guide!

Pandas text data processing and time series

Pandas advanced processing-data discretization

Python pandas simple application data processing

Recommended

Ranking

Linux关机和重启详解（shutdown、halt、poweroff、reboot、init）

Netty work notes 0007---NIO's three core component relationships

Knife4j tutorial

2021.10.29，内容:什么时候用接口和抽象类

How to solve the problem that changing the memory frequency causes the computer to become unusable?

SpringMVC Tutorial - Controller

linux learning skills -Linux 25 transport Vega paid special privileges and facl extension

Financial quarterly report evaluation report data automatic generation 1.0

Agile Development Series - The Values of Agile Development

scrapy achieve browsercookie Middleware

Daily

More

2024-05-19(0)

2024-05-18(31)

2024-05-17(6)

2024-05-16(23)

2024-05-15(5)

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)