正则练习

line = "hoooooooohhrper123"
regs = ".*?(h.*?h).*"
match_obj = re.match(regs,line)
print(match_obj.group(1))
>> hooooooooh

line = "hoooooooohhrper123"
regs = ".*(h.*h).*"
>> hh

line = "hoooooooohhhrper123"
regs = ".*(h.+h).*"
>> hhh

line = "hoooooooohhaahrper123"
regs = ".*(h.+h).*"
>> haah

line = "hoooooooohhhaahrper123"
regs = ".*?(h.{1}h).*"
>> hhh

line = "hoooooooohhhaahrper123"
regs = ".*?(h.{2,5}h).*"
>> hhhaah

[]表示任意其中一个,就能满足匹配
\s 表示空格 (A\sB)
[\u4E00-\u9FA5] 匹配中文

字符串编码:GB2312中文编码,Unicode统一编码,出现可变长utf-8编码,
把英文变成一个字节,汉字3个字节,特别生僻的变成4-6个字节。

猜你喜欢

转载自blog.csdn.net/sinat_33588424/article/details/80882457
今日推荐