首页用python爬取网页微博的评论数据

用python爬取网页微博的评论数据

时间: 2023-10-06 21:13:37 浏览: 73

要爬取网页微博的评论数据，可以使用Python中的requests和BeautifulSoup库进行爬取和解析。以下是一个简单的示例代码： ```python import requests from bs4 import BeautifulSoup url = 'https://weibo.com/1234567890/ABCDEfghi?filter=hot' headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'} response = requests.get(url, headers=headers) soup = BeautifulSoup(response.text, 'html.parser') comments = soup.find_all('div', class_='WB_text') for comment in comments: print(comment.text.strip()) ``` 其中，url是要爬取的微博页面的链接，headers是请求头，可以模拟浏览器进行访问。使用requests库的get方法获取页面内容后，使用BeautifulSoup库进行解析，通过find_all方法找到所有评论的标签，然后打印出来。需要注意的是，微博页面的HTML结构可能会改变，需要根据实际情况进行修改代码。

阅读全文