首页用python完成抓取微博文本的代码

用python完成抓取微博文本的代码

时间: 2024-05-05 18:20:13 浏览: 95

以下是一个基本的Python代码示例，用于从微博抓取文本： 1. 首先，我们需要安装 `requests` 和 `beautifulsoup4` 库： ``` pip install requests pip install beautifulsoup4 ``` 2. 然后，我们需要导入这些库： ```python import requests from bs4 import BeautifulSoup ``` 3. 接下来，我们可以使用 `requests` 库来获取微博页面的 HTML 内容： ```python url = 'https://weibo.com/u/1234567890' response = requests.get(url) html = response.content ``` 4. 然后，我们可以使用 `beautifulsoup4` 库来解析 HTML 内容： ```python soup = BeautifulSoup(html, 'html.parser') ``` 5. 接下来，我们可以使用 `soup` 对象来查找微博文本： ```python texts = [] for tag in soup.find_all('div', {'class': 'WB_text'}): texts.append(tag.text) ``` 6. 最后，我们可以打印或保存文本： ```python for text in texts: print(text) ``` 完整代码示例： ```python import requests from bs4 import BeautifulSoup url = 'https://weibo.com/u/1234567890' response = requests.get(url) html = response.content soup = BeautifulSoup(html, 'html.parser') texts = [] for tag in soup.find_all('div', {'class': 'WB_text'}): texts.append(tag.text) for text in texts: print(text) ```

阅读全文