爬虫保存cookies时重要的两个参数(ignore_discard和ignore_expires)的作用

代码如下:(由于临时做的实例采用登录云打码平台的cookies)

import requests
from lxml.html import etree
from fake_useragent import UserAgent
from http.cookiejar import LWPCookieJar
from ydm import shibie


session = requests.Session()
session.cookies = LWPCookieJar(filename='ydm.txt')

try:
    session.cookies.load('ydm.txt', ignore_discard=True, ignore_expires=True)
    print('本地cookie加载完成')
except Exception as e:
    print('')

ua = UserAgent()
headers = {
    'headers': ua.random,
}


def get_captcha_image():
    content = session.get('http://www.yundama.com/index/captcha', headers=headers).content
    with open('captcha.jpg', 'wb') as f:
        f.write(content)


def login():
    get_captcha_image()

    # 验证码保存到本地以后,对验证码进行识别
    result = shibie('captcha.jpg')

    print('识别结果:', result)

    response = session.get('http://www.yundama.com/index/login?username=gaohairui&password=gao12345&utype=1&vcode={}'.format(result))

    print(response.text)

    session.cookies.save(ignore_discard=True, ignore_expires=True)

if __name__ == '__main__':
    login()


不难看出最后的保存的两个参数:ignore_discard和ignore_expires

 session.cookies.save(ignore_discard=True, ignore_expires=True)

两个参数的作用:

    官方的解释:

ignore_discard: save even cookies set to be discarded.
ignore_expires: save even cookies that have expired.The file is overwritten if it already exists

    不难看出两个参数的实际作用是:ignore_discard的意思是即使cookies将被丢弃也将它保存下来,ignore_expires的意思       是如果cookies已经过期也将它保存并且文件已存在时将覆盖,在这里,我们将这两个全部设置为True。运行之后,cookies       将被保存到ydm.txt文件中。

    查看一下内容:


    















猜你喜欢

转载自blog.csdn.net/qq_33472765/article/details/80738923