微客导航 » 文章资讯 » Python读取本地文件并解析网页元素的方法

Python读取本地文件并解析网页元素的方法

2023-09-12 14:25:05 280

如下所示：

frombs4importBeautifulSoup
path='./web/new_index.html'
withopen(path,'r')asf:
Soup=BeautifulSoup(f.read(),'lxml')
titles=Soup.select('ul>li>div.article-info>h3>a')
fortitleintitles:
print(title.text)

输出：
Sardinia'stop10beaches
Howtogettanned
HowtobeanAussiebeachbum
Summer'scheatsheet

#其中
titles=Soup.select('ul>li>div.article-info>h3>a')
#等效
titles=Soup.select('h3a')

print(title.text)
#等效
print(title.get_text())
print(title.string)

也可以使用以下代码

importbs4

path='./web/new_index.html'

withopen(path,'r')asf:
Soup=bs4.BeautifulSoup(f.read(),'lxml')

titles=Soup.select('h3a')
fortitleintitles:
print(title.string)

Html原文：









Home
Site
Other

Article

Sardinia'stop10beaches

fun Wow

whitesandsandturquoisewaters

4.5

Howtogettanned

buttNSFW

hotbikinigirlsonbeach

5.0

HowtobeanAussiebeachbum

sea

Tomakethemostofyourvisit

3.5

Summer'scheatsheet

bay boat beach

choosingabeachinCapeCod

3.0

©Mugglecoding

以上这篇Python读取本地文件并解析网页元素的方法就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持毛票票。

声明：本文内容来源于网络，版权归原作者所有，内容由互联网用户自发贡献自行上传，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任。如果您发现有涉嫌版权的内容，欢迎发送邮件至：czq8825#qq.com（发邮件时，请将#更换为@）进行举报，并提供相关证据，一经查实，本站将立刻删除涉嫌侵权内容。

返回顶部
3162201930
czq8825@qq.com