python requests做爬虫爬取oxford词典单词音标 - 代码天地

python requests做爬虫爬取oxford词典单词音标

其他 2019-01-14 19:42:17 阅读次数: 0

import requests
import re


def phonetic_spelling(word):
    
    word=word.replace(" ","_")
    
    phoneticSpelling=""
    
    #url的格式有规律
    request=requests.get("https://en.oxforddictionaries.com/definition/"+word)
    
    html=request.text
    
    #查看网页发现音标所处的行HTML格式有规律 使用正则表达式描述
    regularExpression=r'<span\s+class="phoneticspelling">/([^\/]*)/</span>'
    
    matchObject=re.search(regularExpression,html,re.I)
    
    
    if matchObject:
        if matchObject.group(1):
            phoneticSpelling=matchObject.group(1)
            print("\nphoneticSpelling: ",word,"--->",phoneticSpelling)
        else:
            print("\nword \""+word+"\" has no phonetic spelling in the dictionary")
    else:
        print("\nword \""+word+"\" has no phonetic spelling in the dictionary")
        
    return phoneticSpelling


#测试
print(phonetic_spelling("Chinese"))

print(phonetic_spelling("English"))

print(phonetic_spelling("translation"))

print(phonetic_spelling("language"))

print(phonetic_spelling("crawler"))

猜你喜欢

转载自blog.csdn.net/MAILLIBIN/article/details/83152531

python requests做爬虫爬取oxford词典单词音标

python爬取单词构造自己的词典

python爬虫爬取招聘（ requests，BeautifulSoup）

Python爬虫-爬取扇贝单词(Xpath)

Python爬虫——利用requests模块爬取妹子图

python3爬虫-使用requests爬取起点小说

python爬虫---实现项目(一) Requests爬取HTML信息

03 Python爬虫之Requests网络爬取实战

Python爬虫入门——requests爬取单张图片/视频

Python爬虫使用requests库爬取表情包

python爬虫爬取电影数据并做可视化

Python爬取有道词典

Python爬虫之使用Fiddler+Postman+Python的requests模块爬取各国国旗

Python爬虫实战，requests模块，Python爬取音频数据并保存本地

Python爬虫requests之扇贝单词

【python3爬虫系列】问题一：去西刺爬取免费可用的代理（用requests爬取）

Python 爬虫爬取网页

python爬虫－爬取图片

python 爬虫爬取csdn

python爬虫爬取图片

Python爬虫：爬取图片

python爬虫（爬取段子）

python爬虫（爬取视频）

python爬虫 - 爬取图片

python爬虫爬取视频

Python爬虫——爬取小说

python爬虫登录爬取

【python爬虫】—图片爬取

python爬虫爬取豆瓣电影前250名电影及评分（requests+pyquery)

Python爬虫学习三------requests+BeautifulSoup爬取简单网页

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)