python selenium webdriver 手册文档 - 代码天地

python selenium webdriver 手册文档

其他 2018-11-05 00:30:05 阅读次数: 0

python selenium webdriver 手册文档

1.安装与配置

pip install selenium

基本使用selenium都是为了动态加载网页内容用于爬虫，所以一般也会用到phantomjs

mac下如果要配置phantomjs环境的话

echo $PATH

ln -s

至于chromeDriver，配置方法类似，下载地址：

https://npm.taobao.org/mirrors/chromedriver/

2.代码样例

复制代码

#!/usr/bin/env Python

# coding=utf-8

from selenium import webdriver

from selenium.webdriver.common.keys import Keys

import time

keyword = '家有'.decode('utf-8')

chrome_options = webdriver.ChromeOptions()

# chrome_options.binary_location = "C:\\Program Files (x86)\\Google\\Application\\chrome.exe"

# chrome_options.add_argument('--user-agent=iphone')

# chrome_options.add_argument('--proxy-server=http://61.155.164.110:3128')

#driver = webdriver.Ie()

#driver = webdriver.Firefox()

driver = webdriver.Chrome(chrome_options=chrome_options)

driver.get('http://www.baidu.com')

driver.find_element_by_id('kw').clear()

time.sleep(1)

driver.find_element_by_id('kw').send_keys(keyword)

time.sleep(3)

#driver.find_element_by_id('su').send_keys(Keys.ENTER)

driver.find_element_by_id('su').click()

print driver.title

# driver.quit()

复制代码

3.api速查

3.1定位元素

3.1.1 通过id查找：

element = driver.find_element_by_id("coolestWidgetEvah")

or

from selenium.webdriver.common.by import By

element = driver.find_element(by=By.ID, value="coolestWidgetEvah")

3.1.2 通过class查找

cheeses = driver.find_elements_by_class_name("cheese")

or

from selenium.webdriver.common.by import By

cheeses = driver.find_elements(By.CLASS_NAME, "cheese")

3.1.3 通过标签名称查找

target_div = driver.find_element_by_tag_name("div")

or

from selenium.webdriver.common.by import By

target_div = driver.find_element(By.TAG_NAME, "div")

3.1.4 通过name属性查找

btn = driver.find_element_by_name("input_btn")

or

from selenium.webdriver.common.by import By

btn = driver.find_element(By.NAME, "input_btn")

3.1.5 通过链接的内容查找

next_page = driver.find_element_by_link_text("下一页")

or

from selenium.webdriver.common.by import By

next_page = driver.find_element(By.LINK_TEXT, "下一页")

3.1.6 通过链接的部分内容查找

next_page = driver.find_element_by_partial_link_text("去下一页")

or

from selenium.webdriver.common.by import By

next_page = driver.find_element(By.PARTIAL_LINK_TEXT, "下一页")

3.1.7 通过css查找

cheese = driver.find_element_by_css_selector("#food span.dairy.aged")

or

from selenium.webdriver.common.by import By

cheese = driver.find_element(By.CSS_SELECTOR, "#food span.dairy.aged")

3.1.8 通过xpath查找

inputs = driver.find_elements_by_xpath("//input")

or

from selenium.webdriver.common.by import By

inputs = driver.find_elements(By.XPATH, "//input")

3.1.9 通过js查找

labels = driver.find_elements_by_tag_name("label")

inputs = driver.execute_script(

"var labels = arguments[0], inputs = []; for (var i=0; i < labels.length; i++){" +

"inputs.push(document.getElementByIdx_x_x(labels[i].getAttribute('for'))); } return inputs;", labels)

3.2 获取元素的文本信息

element = driver.find_element_by_id("element_id")

element.text

3.3 修改userAgent

profile = webdriver.FirefoxProfile()

profile.set_preference("general.useragent.override", "some UA string")

driver = webdriver.Firefox(profile)

3.4 cookies

复制代码

# Go to the correct domain

driver.get("http://www.example.com")

# Now set the cookie. Here's one for the entire domain

# the cookie name here is 'key' and its value is 'value'

driver.add_cookie({'name':'key', 'value':'value', 'path':'/'})

# additional keys that can be passed in are:

# 'domain' -> String,

# 'secure' -> Boolean,

# 'expiry' -> Milliseconds since the Epoch it should expire.

# And now output all the available cookies for the current URL

for cookie in driver.get_cookies():

print "%s -> %s" % (cookie['name'], cookie['value'])

# You can delete cookies in 2 ways

# By name

driver.delete_cookie("CookieName")

# Or all of them

driver.delete_all_cookies()

最后放一个自己的代码样例好了，完成的功能为找到搜索框输入搜索关键词然后点击搜索按钮，然后打开每个搜索结果并且输出网页源代码

# coding=utf-8

import time

from selenium import webdriver

from selenium.common.exceptions import TimeoutException

from selenium.webdriver.support.ui import WebDriverWait # available since 2.4.0

from selenium.webdriver.support import expected_conditions as EC # available since 2.26.0

# Create a new instance of the Firefox driver

driver = webdriver.Chrome()

# go to the home page

driver.get("http://www.baidu.com")

#获得当前窗口句柄

nowhandle = driver.current_window_handle

print driver.title

# find the element that's name attribute is qymc (the search box)

inputElement = driver.find_element_by_name("qymc")

print inputElement

# type in the search

inputElement.send_keys(u"加油网")

driver.find_element_by_name("imageField").click();

# submit the form (compare with google we can found that the search is not a standard form and can not be submitted, we do click instead)

# inputElement.submit()

try:

# overlap will happen if we do not move the page to the bottom

# the last link will be under another unrelevant link if we do not scroll to the bottom

driver.execute_script("window.scrollTo(0, document.body.scrollHeight);")

#find all link and click them

for item in driver.find_elements_by_xpath('//*[@id="pagetest2"]/div/table/tbody/tr/td/a'):

item.click()

time.sleep(10)

#获取所有窗口句柄

allhandles=driver.window_handles

#在所有窗口中查找新开的窗口

for handle in allhandles:

if handle!=nowhandle:

#这两步是在弹出窗口中进行的操作，证明我们确实进入了

driver.switch_to_window(handle)

print driver.page_source

#返回到主窗口页面

driver.switch_to_window(nowhandle)

finally:

driver.quit()

猜你喜欢

转载自blog.csdn.net/fkew2009/article/details/83501911

python selenium webdriver 手册文档

selenium webdriver (python)大全

selenium + webdriver（python）（四）

Python使用Selenium的webdriver

Selenium webdriver api 调用属性方法 (文档手册)

selenium-webdriver(python) (十四) -- webdriver原理

python爬虫：selenium + webdriver + python

[译]Selenium Python文档：七、WebDriver API接口

Python use Selenium to control the webdriver

Selenium WebDriver 基于Python（一）

Python Selenium Webdriver 元素定位

webdriver,python,selenium下载地址

Python+selenium+webdriver爬虫

Python selenium webdriver 基本使用

Selenium WebDriver学习手册---2（使用Selenium API）

python3 selenium+webdriver+chrome

Python爬虫：对selenium的webdriver进行简单封装

selenium webdriver python 警告框的处理

selenium-webdriver(python) (十三) -- cookie处理

selenium-webdriver(python) (十六) --unittest 框架

selenium-webdriver(python) (十五) -- 鼠标事件

Python selenium的webdriver之鼠标悬停

Python学习笔记12：selenium webdriver

linux 无界面运行python + webdriver + selenium

python webdriver selenium wait 却找不到元素

selenium2 webdriver 常用的python 函数

webdriver元素操作-键盘（python+selenium）

Python selenium webdriver设置加载页面超时

python+selenium中webdriver相关资源

Python Selenium Webdriver常用方法总结

今日推荐

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

【转】spring中对控制反转和依赖注入的理解

tms webcore 安装和使用

java程序员进阶相关书籍

SpringMVC接受请求参数、

如何保存训练好的机器学习模型

MyEclipse、Eclipse设置项目JDK的三个地方

商超行业微信小程序开发定制一般多少钱（行业技术人员解读）

Markdown编辑器语言——30分钟入门到到精通

Linux系统下MongoDB的简单安装与基本操作

Power Strings

每日归档

更多

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)