Reptile error occurs gbk - Code World

Reptile error occurs gbk

Others 2019-05-27 12:52:37 views: null

Source:

1  '' ' Baidu Bar crawling, it different in different pages ' '' 
2  
. 3  from the urllib Import Request
 . 4  from the urllib Import the parse
 . 5  
. 6  # Definitions of common variables 
. 7 the base_url = " https://tieba.baidu.com/f? kW = " 
. 8 headers = { ' the User-- Agent ' : ' the Mozilla / 5.0 (the Windows NT 6.1; the WOW64; RV: 6.0) the Gecko / 20,100,101 Firefox / 6.0 ' }
 . 9  
10  # splicing URL, (the first coding, and then stitching, and then request) 
11 tb_name the iNPUT = ( " Please enter the name attached to it: ")
 12 is Key = parse.quote (tb_name)
 13 is URL = the base_url + Key
 14  
15  Print (URL)
 16  
. 17  # three steps 
18  # reconstruct the requested object, packaging the request header 
. 19 REQ = request.Request (URL, headers = headers )
 20  # send a request the urlopen 
21 is RES = request.urlopen (REQ)
 22 is  # acquisition response 
23 is HTML res.read = (). decode ( ' UTF-. 8 ' )
 24  
25  # Print (HTML) 
26 is  
27  # save the file 
28 with open('贴吧.txt','w') as f:
29     f.write(html)

During data reptiles, such a mistake:

Enter a name attached to it: the beauty of it
https://tieba.baidu.com/f?kw=%E7%BE%8E%E5%A5%B3%E5%90%A7
Traceback (MOST recent Results Last Call):
File "D : / AID1812 / Spider / day01 / 05_ _ Baidu Post bar to practice .py ", Line 29, in <Module>
f.write (HTML)
UnicodeEncodeError: 'GBK' CODEC CAN not encode Character '\ U0001f236' in position 166 141: illegal multibyte sequence

solution:

with open supplementary add encoding = "utf-8" () inside, OK.

# Save file 
with open ( 'it stick .txt', 'W', encoding = 'UTF-. 8') AS F: 
    f.write (HTML)

Guess you like

Origin www.cnblogs.com/tianxiong/p/10929704.html

Reptile error occurs gbk

Drag the Java file in Idea to the local computer and run it in cmd. An error occurs: Unmappable characters in GBK encoding. Reason

1273 mysql error occurs

Reptile error (continued .....)

403 Forbidden nginx error occurs

"no such module cocoa" error occurs in Swift

Unable to import torch, an error occurs

[Go] actual language reptile GBK encoding issues process

[Turn] node reptile's website gbk Chinese distortion solutions

springboot: redis type conversion error occurs deserialization

redux + TypeScript encountered an error occurs introduced withRouter

Hibernate configuration error problem occurs summary

[Turn] conda install torch link error occurs

scp transmission error occurs ssh connection service

pytorch CUDA error occurs in the verification: out of memory

Log Mysql 1045 error occurs causes and solutions

mysql error occurs Table is read only solution

Nginx configuration reverse proxy error occurs

A 404 error occurs when the browser accesses the servlet

plsql Developer logs in to oracle and an initialization error occurs

url error occurs when installing pytorch

DexBackedDexFile$NotADexFile error occurs when apk is decompiled

An error occurs Object reference not set to an instance of an object

Syntax error occurs in navicat sql query

git code submission, 403 error occurs

Permission denied (publickey) error occurs in git

Error: The character encoding GBK unmappable solution

"Error: Unmapped character encoding GBK" solution

4: Error: Unmapped character encoding GBK

javadoc generates error "unmappable character encoding GBK"

Recommended

Linus is the most active in "eating dog food"!

Ranking

Share good programmer web front-end array and sorting, de-duplication and random roll call

Compilation error caused by cv_bridge and python version problems error: return-statement with no value, in function returning'void*' [-fpe

魔众帮助中心系统 v3.1.0 首页切换器，界面优化

Die beim Millimeterwellenradar-Integrationstest aufgetretene Grube (Multiprozessbindung an einen UDP-Port verursacht Probleme)

How to suppress the "requires transitive directive for an automatic module" warning properly?

LeetCode-1743. Restore the Array From Adjacent Pairs-Analysis and Code (Java)

Summer 2019 Summer soft essay 7 workers

Python中Assert断言的使用语法和例子

LeetCode one question per day (2021-2-3 sliding window median)

Fairchild, the ancestor of semiconductors, the legend of the first trillion-dollar start-up

Daily

More

2024-05-20(5)

2024-05-19(0)

2024-05-18(31)

2024-05-17(6)

2024-05-16(23)

2024-05-15(5)

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)