python爬百度文库课件 - 代码天地

python爬百度文库课件

其他 2018-09-17 16:00:22 阅读次数: 0

库:re;selenium;requests

源码：

from selenium import webdriver
import re
import requests

def open_img(items):
    for item in items:
        item = re.sub('&','&',item)
        rsp =requests.get(item)
        yield rsp.content

url ='https://wenku.baidu.com/view/4e3d35d969eae009581becd5.html?from=search'　　　　#可修改成别的ppt网址
browser =webdriver.Chrome()
try:
    browser.get(url)
    html =browser.page_source
    pattern =re.compile('<div class="ppt-page-item.*?src="(.*?)".*?>',re.S)
    items =re.findall(pattern,html)
    n =0
    for i in open_img(items):
        with open('%d.jpeg'%n,'wb') as file:
            file.write(i)·
            n +=1
            print('第%d张图片下载完成'%n)

finally:
    browser.close()
input()

猜你喜欢

转载自www.cnblogs.com/vvlj/p/9662534.html

python爬百度文库课件

python——百度文库爬取

Python + selenium 爬取百度文库Word文本

Python实现的爬取百度文库功能

python爬取百度文库所有内容

Python爬取百度文库doc文档

python+requests爬取百度文库ppt

Python3爬取百度文库数据

爬取百度文库文章

百度文库

利用Python进行百度文库内容爬取（一）

python3爬虫(2):使用Selenium爬取百度文库word文章

Python3爬虫-selenium爬取百度文库

Python爬取 vip百度文库,再也不用为下载卷苦恼了

python+selenium爬取百度文库不能下载的word文档

python 利用selenium爬取百度文库的word文章

python爬虫爬取百度文库txt以及ppt资料

二十一、Python爬取百度文库word文档内容

利用Python进行百度文库内容爬取（二）——自动点击预览全文并爬取

仿百度文库

百度文库－ PS

百度文库破解

百度文库爬虫

python爬虫实战：下载百度文库文档

Python3网络爬虫(九)：使用Selenium爬取百度文库word文章

仿百度文库功能心得

复制百度文库中的内容

概率证明及百度文库的实现

百度文库 -- 下载免费

js 模拟百度文库评分

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)