python-selenium

from selenium import webdriver
from scrapy.selector import Selector
from time import sleep

chrome_opt = webdriver.ChromeOptions();
prefs ={"profile.managed_default_content_settings.images":2}
chrome_opt.add_experimental_option("prefs",prefs)

browser = webdriver.Chrome(executable_path="D:/Program Files/selenium_brower/chromedriver.exe",chrome_options=chrome_opt)
browser.get("https://www.taobao.com")

 profile.managed_default_content_settings.images 意思不加载网页中的图片

在windows环境下 注意路径一定要用 /  这种

browser.find_element_by_css_selector(".Login-content input[name='username']").send_keys("13460688542")

browser.find_element_by_css_selector(".Login-content input[name='password']").send_keys("3989441qwe")

browser.find_element_by_css_selector(".Button.SignFlow-submitButton.Button--primary.Button--blue").click()

 可以选择对应的css 来获取元素,send_keys对元素赋值

js = "window.scrollTo(0,document.body.scrollHeight);var lenOfpage=document.body.scrollHeight;return lenOfpage "

browser.execute_script(js)

 用js的方式下拉滚动条

Selenium support for PhantomJS has been deprecated, 
please use headless versions of Chrome or Firefox instead

PhantomJS 和 Chrome 官方推荐使用Chrome 浏览

t_selector = Selector(text=browser.page_source)
t_selector.css(".tm-promo-price .tm-price::text").extract()

 用scrapy 解析css 感觉美滋滋

http://selenium-python.readthedocs.io/installation.html

官方讲的最详细,其他都是次要的

猜你喜欢

转载自394498036.iteye.com/blog/2406882