lxml extracts html tag content, tostring () cannot display Chinese solution - Code World

lxml extracts html tag content, tostring () cannot display Chinese solution

Others 2020-04-18 08:56:19 views: null

from lxml import etree
import requests


response = requests.get('https://www.baisu.com/).text
tree = etree.HTML(response)
strs = tree.xpath( "//body")
strs = strs[0]
 strs = (etree.tostring(strs)) # 不能正常显示中文
strs = (etree.tostring(strs, encoding = "utf-8", pretty_print = True, method = "html")).decode("gbk") # 可以正常显示中文
print (strs)

Chai Shen Blog Expert

Published 150 original articles · praised 149 · 810,000 views

His message board concerns

Guess you like

Origin blog.csdn.net/chaishen10000/article/details/103168859

lxml extracts html tag content, tostring () cannot display Chinese solution

pdfminert extracts PDF Chinese content

UE4 cannot display Chinese solution

a tag cannot download Chinese files

How to display html content in applet

laravel removes the html tag display

Redis Chinese display problem solution

IDEA cannot display the services solution

Uniapp's Swiper cannot display content

Solve the problem that the Linux system cannot display Chinese

Solve the problem that ubuntu cannot display Chinese pinyin

pygame cannot display Chinese on the screen (solved)

Character set reason cannot display Chinese normally

Character set reason cannot display Chinese normally

javascript content html <b> tag inside

[HTML] input tag prompts to add content

The linux terminal can display Chinese, but cannot input Chinese.

Chinese garbled html code solution

Content with tags, display problems in HTML, JSTL

When the content attribute is Chinese, the html page is garbled

Batch bat file display Chinese garbled solution

Matplotlib drawing does not display Chinese (solution)

Manim Chinese display problem error solution

Solution to Eastwood Chinese display garbled problem

Get the src and href content of the img tag and a tag from the html text

AutoCAD cannot display dotted line solution

AutoCAD cannot display dotted line solution

The solution to the problem that the computer cannot display WiFi

The <img> tag image of the Html page cannot be displayed

The <img> tag image of the Html page cannot be displayed

Recommended

TIOBE May list: Fortran “resurrected” into Top 10

GCC 14.1 released

Ranking

B. Little Girl and Game【1300 / 回文字符串博弈论】

CIKERS Shane 20190613

"Javascript advanced programming" study notes - the constructor and prototype

beeline hiveserver2 start

springboot - Automatically backup mysql data every day

Data Storage Full Solution--Detailed Persistence Technology

Detailed Explanation of Spring Web MVC DispatcherServlet—Official Original

TCP / IP protocol layers structure and function

Command type literal pos: unknown； Fallback type literal pos: unknown] with root cause

Design of multifunctional curtain controller with indoor anti-theft alarm

Daily

More

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)

2024-05-02(0)

2024-05-01(4)

2024-04-30(36)

2024-04-29(5)