版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/qq_33335577/article/details/79123861
本人用的是Python2.7。不废话直接上代码
# -*- coding:utf-8 -*-
import urllib2
import re
url = "http://www.fenxiangdashi.com/aiqiyi/94659.html";
headers = {'User-Agent':
"Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:57.0) Gecko/20100101 Firefox/57.0"
};
request = urllib2.Request(url=url,headers =headers);
request = urllib2.urlopen(request);
html = request.read();##读取网页源码
##创建正则表达式
pattren = re.compile("账号:.*? 密码:.*?<br/>");
##正则表达式与源码进行匹配
result = pattren.findall(html);
print '一共匹配到%s个爱奇一艺帐号' %(len(result));
for string in result:
print string.replace("<br/>","");
运行截图: