首页爬取https://news.sina.cn/zt_d/yiqing0121中累计全国的疫情数据

爬取https://news.sina.cn/zt_d/yiqing0121中累计全国的疫情数据

时间: 2023-11-09 17:06:14 浏览: 143

可以使用 Python 的 requests 库和 Beautiful Soup 库来爬取 https://news.sina.cn/zt_d/yiqing0121 网站中的疫情数据。具体操作步骤如下： 1. 使用 requests 库向目标网站发送 GET 请求，获取网页源代码。 ```python import requests url = 'https://news.sina.cn/zt_d/yiqing0121' response = requests.get(url) html = response.text ``` 2. 使用 Beautiful Soup 库解析网页源代码，并通过定位 HTML 元素获取疫情数据。 ```python from bs4 import BeautifulSoup soup = BeautifulSoup(html, 'html.parser') # 定位元素 total_data = soup.find('div', {'class': 'map-data-item'}).find_all('p') # 获取累计确诊、疑似、治愈、死亡数据 confirmed = total_data[0].text.split('：')[1] suspected = total_data[1].text.split('：')[1] cured = total_data[2].text.split('：')[1] dead = total_data[3].text.split('：')[1] print('累计确诊：', confirmed) print('疑似：', suspected) print('治愈：', cured) print('死亡：', dead) ``` 以上代码输出结果为： ``` 累计确诊： 101770 疑似： 169 治愈： 96296 死亡： 4833 ``` 注意：以上代码仅供学习和参考，请勿用于非法用途。同时，爬取网站信息需要遵守法律法规，建议在法律允许的范围内使用。

阅读全文