scrapy设置proxy

scrapy设置代理

- 在setting中添加代理
# 设置代理
PROXIES = [
    {'ip_port':'xxx.xxx.x.xx(ip地址):xxxx(端口号)','user_passwd':'username:passwd(付费代理)'}
]
- 自定义中间件
import random,base64
from MinYan.settings import PROXIES
class RandomProxy(object):
    def process_request(self,request,spider):
        proxy = random.choice(PROXIES)
        if proxy['user_passwd'] is None:
            request.meta['proxy'] = "http://" + proxy['ip_port']
        else:
            base64_userpasswd = base64.b64encode(proxy['user_passwd'])
            request.headers['Proxy-Authorization'] = 'basic' + base64_userpasswd
            request.meta['proxy'] = "http://" + proxy['ip_port']
- 在setting中开启DOWNLOADER_MIDDLEWARES
发布了22 篇原创文章 · 获赞 0 · 访问量 445

猜你喜欢

转载自blog.csdn.net/qq_43227756/article/details/104522659