python构建随机ip请求头以及headers


上文我们提到已经搭建了一个 ip池了,那么我们该如何调用呢

获取随机ip

方法一(mysql)

如果你把ip存取到了数据库(mysql等),你可以调用这个方法

def getip():
    db = pymysql.connect(
                  host="****",#数据库
                  port=3306,
                  user="****",#用户
                  passwd="****",#密码
                  db="****",#表名
                  charset='utf8'
                  )
    cursor= db.cursor()    
    sql='SELECT * FROM IP地址  AS t1  JOIN (SELECT ROUND(RAND() * ((SELECT MAX(useid) FROM `IP地址`)-(SELECT MIN(useid) FROM IP地址))+(SELECT MIN(useid) FROM IP地址)) AS useid) AS t2 WHERE t1.useid >= t2.useid ORDER BY t1.useid LIMIT 1'
    #IP地址是你存取IP地址的表
    cursor.execute(sql)
    results = cursor.fetchall()
    db.commit()              
    return results[0][2]               

    db.close()

方法二(文档)

如果你把ip存取到了文档(可以从数据库导出),你可以调用这个方法

def getipd():
    workbook = xlrd.open_workbook("ip地址.xlsx")
    worksheet = workbook.sheet_by_index(0)
    ncols = worksheet.cell_value(random.randint(*, ****), 2)#**代表你要调用的范围,具体你得看你文档数据分布
    return ncols

获取随机headers

这里只是列出几个随机headers,你也可以加多几个进去

def getheaders():
    userAgentList = [
            'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML like Gecko) Chrome/44.0.2403.155 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2228.0 Safari/537.36',
            'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.1 Safari/537.36',
            'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2226.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.4; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2225.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2225.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 5.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2224.3 Safari/537.36',
            'Mozilla/5.0 (Windows NT 10.0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/40.0.2214.93 Safari/537.36',
            'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2062.124 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2049.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 4.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2049.0 Safari/537.36',
            'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.67 Safari/537.36',
            'Mozilla/5.0 (Windows NT 5.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.67 Safari/537.36',
            'Mozilla/5.0 (X11; OpenBSD i386) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.125 Safari/537.36'
            
        ]
    return random.choice(userAgentList)

重点

现在两个方法写好了,那么该如何调用

response = requests.get(网址,headers={
    
    'User-Agent':getheaders()},proxies={
    
    'http':getipd()})

用这个方法就可以构造随机ip请求头了

猜你喜欢

转载自blog.csdn.net/qq_43337502/article/details/108071146
今日推荐