首页使用python爬取评论数据的代码

使用python爬取评论数据的代码

时间: 2024-09-18 11:14:27 浏览: 82

在Python中爬取评论数据通常需要结合使用`requests`库来发送HTTP请求获取网页内容，然后利用如`BeautifulSoup`或`lxml`解析HTML提取评论部分。这里是一个简单的示例，假设我们想从一个包含评论的HTML列表中提取信息： ```python import requests from bs4 import BeautifulSoup # 目标URL url = 'https://example.com/reviews' # 替换为你想要爬取的网站URL # 发送GET请求 response = requests.get(url) # 检查请求是否成功 if response.status_code == 200: # 解析HTML内容 soup = BeautifulSoup(response.text, 'html.parser') # 查找评论容器，这将取决于目标网站的结构 comments_container = soup.find('div', {'class': 'comments'}) # 提取每个评论 for comment in comments_container.find_all('p'): # 假设评论在`<p>`标签内 text = comment.get_text() print(text) else: print(f"请求失败，状态码：{response.status_code}") # 如果需要保存评论，可以使用`with open('comments.txt', 'w') as f:`代替print，将文本写入文件

阅读全文