python入门18网络爬虫 - 代码天地

python入门18网络爬虫

其他 2019-01-28 02:50:18 阅读次数: 0

一基本概念
作用：①私人定制一个搜索引擎
②获取更多数据源
③搜索引擎优化

组成：① 控制节点
②爬虫节点
③资源库（存储爬虫爬取的响应数据）

选择爬虫 – 强大的爬虫Scrapy,以及成熟高效的scrapy-redis分布式策略

1.urllib库的基本使用

import urllib
response = urllib.request.urlopen(" 此处输入网址 ")
print(response.read())

urlopen一般接受3个参数：
urlopen(url,data,timeout)
第一个url参数是URL，第二个参数data是访问URL时要传送的数据，第三个参数timeout是设置超时时间
第二，三个参数是可以不传送的

2.模拟POST登陆网站

import urllib.request
import urllib.parse
values={"username":  ,"password":  }
data = urllib.parse.urlencode(values)
url="   "
request=urllib.request.Request(url,data)
response =urllib.request.urlopen(request)
print(response.read())

3.urllib库的高级用法

import urllib.request
import urllib.urllib.parse
url="  "
user_agent = "  "
values={'username':' ','password':' '}
headers={'User-Agent' : user_agent}
data = urllib.parse.urlencode(values)
request =urllib.request.Request(url,data,headers)
response=urllib.request.urlopen(request)
print(response.read())

猜你喜欢

转载自blog.csdn.net/qq_35076836/article/details/82974323

python入门18网络爬虫

Python3网络爬虫入门

Python3.6网络爬虫

Python爬虫一一网络爬虫简介

Python3网络爬虫快速入门实战解析

tensorflow入门笔记(十九)python3网络爬虫(下)

tensorflow入门笔记(十八)python3网络爬虫(中)

tensorflow入门笔记(十七)python3网络爬虫（上）

Python3网络爬虫：Scrapy入门之使用ImagesPipline下载图片

0302网络爬虫

python3网络爬虫(抓取文字信息)

Python 3网络爬虫开发实战 PDF

Python3网络爬虫(一)

《python3网络爬虫开发实战》--Scrapy

《Python3网络爬虫开发实战》教程

Python3网络爬虫工具安装（Mac）

《Python 3网络爬虫开发实战》

Python3网络爬虫-请求库的安装

Python3网络爬虫-基本库使用

Python3网络爬虫开发实战

Python3网络爬虫实战-30、PyQuery

Python3网络爬虫——环境配置

Python 3网络爬虫开发实战书籍

python3网络爬虫开发实战pdf

Python3网络爬虫基本操作(一)

什么是Python3网络爬虫？

Python3网络爬虫：Scrapy入门实战之爬取动态网页图片

Ubuntu18网络配置

Python3网络爬虫之requests动态爬虫：拉钩网

Python之父强烈推荐，Python3网络爬虫开发实战，爬虫入门必看书籍，豆瓣评分9.2

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

周排行

Python环境安装与基础语法（1）——计算机基础知识

IMU预积分

ADAS中的LDW、FCW、BSD、LCA、ACC、AEB、APA、DMS代表的含义

B站笔试两道题

skyeye arm 硬件虚拟机环境的搭建

Web前端静态页面示例

数组-合并排序数组 II-简单

springcloud之版本问题启动报错

面向对象-------------匿名对象(六)

输入URL到页面呈现中间发生了什么？

每日归档

更多

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)