首页利用beautifulsoup如何获取标签a下的href

利用beautifulsoup如何获取标签a下的href

时间: 2024-06-10 15:03:46 浏览: 159

可以使用以下代码获取标签a下的href： ```python from bs4 import BeautifulSoup html = """ <html> <head> <title>Test</title> </head> <body> <div> <a href="http://www.example.com">Example</a> </div> </body> </html> """ soup = BeautifulSoup(html, 'html.parser') link = soup.find('a') href = link.get('href') print(href) ``` 输出结果为：`http://www.example.com`

利用beautifulsoup如何获取标签div下的a标签

利用BeautifulSoup可以通过以下代码获取标签div下的a标签： ```python from bs4 import BeautifulSoup html = '<div><a href="https://www.example.com">Link text</a></div>' soup = BeautifulSoup(html, 'html.parser') div_tag = soup.find('div') a_tag = div_tag.find('a') print(a_tag) ``` 其中，`find()` 方法可以用来查找指定的标签，而 `div_tag.find('a')` 语句则是在 div 标签下查找 a 标签。如果页面中有多个 div 标签，可以使用 `find_all('div')` 方法获取所有的 div 标签，之后再遍历所有的 div 标签，并查找其下的 a 标签。

利用beautifulsoup如何获取多个标签a下的多个href

？你可以使用beautifulsoup的find_all方法来获取多个标签a，然后使用for循环遍历每个a标签，再使用.get('href')方法获取每个a标签下的href属性值。以下是一个示例代码： ``` from bs4 import BeautifulSoup import requests # 构建网页请求 url = 'http://example.com' response = requests.get(url) # 解析网页内容 soup = BeautifulSoup(response.text, 'html.parser') # 获取所有的a标签 a_tags = soup.find_all('a') for a in a_tags: href = a.get('href') if href is not None: print(href) ``` 这段代码将会输出所有a标签下的href属性值。

阅读全文