用python完成抓取微博文本的代码
时间: 2024-05-05 18:20:13 浏览: 95
以下是一个基本的Python代码示例,用于从微博抓取文本:
1. 首先,我们需要安装 `requests` 和 `beautifulsoup4` 库:
```
pip install requests
pip install beautifulsoup4
```
2. 然后,我们需要导入这些库:
```python
import requests
from bs4 import BeautifulSoup
```
3. 接下来,我们可以使用 `requests` 库来获取微博页面的 HTML 内容:
```python
url = 'https://weibo.com/u/1234567890'
response = requests.get(url)
html = response.content
```
4. 然后,我们可以使用 `beautifulsoup4` 库来解析 HTML 内容:
```python
soup = BeautifulSoup(html, 'html.parser')
```
5. 接下来,我们可以使用 `soup` 对象来查找微博文本:
```python
texts = []
for tag in soup.find_all('div', {'class': 'WB_text'}):
texts.append(tag.text)
```
6. 最后,我们可以打印或保存文本:
```python
for text in texts:
print(text)
```
完整代码示例:
```python
import requests
from bs4 import BeautifulSoup
url = 'https://weibo.com/u/1234567890'
response = requests.get(url)
html = response.content
soup = BeautifulSoup(html, 'html.parser')
texts = []
for tag in soup.find_all('div', {'class': 'WB_text'}):
texts.append(tag.text)
for text in texts:
print(text)
```
阅读全文
相关推荐
















