首页用BeautifulSoup解析指定标签下的href

用BeautifulSoup解析指定标签下的href

时间: 2024-04-05 14:22:44 浏览: 157

可以使用BeautifulSoup的`find_all`方法来查找指定标签下的所有子标签，然后再通过子标签的属性来获取href。下面是一个示例代码： ```python from bs4 import BeautifulSoup html = ''' <html> <body> <div class="links"> <a href="https://www.google.com">Google</a> <a href="https://www.baidu.com">Baidu</a> <a href="https://www.yahoo.com">Yahoo</a> </div> </body> </html> ''' soup = BeautifulSoup(html, 'html.parser') # 查找div标签下的所有a标签 links = soup.find('div', {'class': 'links'}).find_all('a') # 输出所有a标签的href属性值 for link in links: print(link['href']) ``` 输出结果： ``` https://www.google.com https://www.baidu.com https://www.yahoo.com ``` 在上面的代码中，我们首先使用`find`方法查找`class`属性为`links`的`div`标签，然后使用`find_all`方法查找该标签下的所有`a`标签。最后，我们使用`link['href']`来获取每个`a`标签的`href`属性值。

阅读全文