python+selenium+PIL+tesseract验证码识别 - 代码天地

python+selenium+PIL+tesseract验证码识别

其他 2018-05-23 17:11:45 阅读次数: 2

一段简单的验证码识别，不过tesseract验证码识别很差，试了十几次只成功过两次，对结果不满意就当是学会一样新的技术吧

 1 from selenium import webdriver
 2 from time import sleep
 3 import unittest
 4 from PIL import Image
 5 from PIL import ImageEnhance
 6 import pytesseract
 7 driver=webdriver.Firefox()
 8 url="https://passport.baidu.com/?getpassindex"
 9 driver.get(url)
10 driver.maximize_window()
11 driver.save_screenshot(r"E:\aa.png")  #截取当前网页，该网页有我们需要的验证码
12 imgelement = driver.find_element_by_xpath(".//*[@id='forgotsel']/div/div[3]/img")
13 #imgelement = driver.find_element_by_id("code")  #定位验证码
14 location = imgelement.location  #获取验证码x,y轴坐标
15 print (location)
16 size=imgelement.size  #获取验证码的长宽
17 print(size)
18 coderange=(int(location['x']),int(location['y']),int(location['x']+size['width']),
19            int(location['y']+size['height'])) #写成我们需要截取的位置坐标
20 i=Image.open(r"E:\aa.png") #打开截图
21 frame4=i.crop(coderange)  #使用Image的crop函数，从截图中再次截取我们需要的区域
22 frame4.save(r"E:\frame4.png")
23 i2=Image.open(r"E:\frame4.png")
24 imgry = i2.convert('L')   #图像加强，二值化，PIL中有九种不同模式。分别为1，L，P，RGB，RGBA，CMYK，YCbCr，I，F。L为灰度图像
25 sharpness =ImageEnhance.Contrast(imgry)#对比度增强
26 i3 = sharpness.enhance(3.0)  #3.0为图像的饱和度
27 i3.save("E:\\image_code.png")
28 i4=Image.open("E:\\image_code.png")
29 text=pytesseract.image_to_string(i4)#使用image_to_string识别验证码
30 print (text)

code

猜你喜欢

转载自www.cnblogs.com/mtfan01/p/9077760.html

python+selenium+PIL+tesseract验证码识别

Python+selenium+pil+tesseract实现自动识别验证码

Python+Selenium+PIL+Tesseract真正自动识别验证码进行一键登录

Python - PIL-pytesseract-tesseract验证码识别

python 做验证码识别 tesseract

Mac python Tesseract 验证码识别

python使用tesseract识别验证码

python selenium PIL破解滑动验证码

python 爬虫 pytesseract 验证码识别：认识Tesseract

python使用tesseract-ocr完成验证码识别

python利用Tesseract识别验证码的方法

python爬虫中用Tesseract识别图形验证码

python+tesseract-orc 简单的验证码识别

OpenCv-Python-Tesseract验证码识别

selenium+pil截取验证码

使用Tesseract 识别验证码

Tesseract做图片验证码识别

Selenium识别验证码

PIL+selenium+Tesseract 实现验证码的登录（山东大学图书馆）

python利用selenium库识别点触验证码

使用python+selenium做验证码识别

python+selenium识别图片验证码

[Python自动化]selenium之验证码识别

吴裕雄--天生自然python学习笔记：python 用 Tesseract 识别验证码

selenium 验证码登录之Tesseract-OCR 安装

python selenium 验证码

Python爬虫教程-29-验证码识别-Tesseract-OCR

python下以api形式调用tesseract识别图片验证码

python+pillow+pytesseract+Tesseract-OCR验证码识别[转]

tesseract-orc训练结合python3图像识别验证码

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

SVN服务端安装在阿里云

实战 | 相机标定

webpack核心概念

note20——》只要肯低头吃苦，人生就会有救

PAT甲级 1062 Talent and Virtue （25 分）排序

NG Toolset开发笔记--5GNR Resource Grid（26）

如何对待上司

oracle命令

第9章 STL迭代器

logstash使用es映射模板

每日归档

更多

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)