首页pyhton 爬取广东省统计年鉴数据

pyhton 爬取广东省统计年鉴数据

时间: 2023-06-19 08:08:09 浏览: 393

可以使用 Python 中的 requests 和 BeautifulSoup 库来爬取广东省统计年鉴数据。步骤如下： 1. 打开数据源网站 http://tjj.gd.gov.cn/data/yearbook/2021/index.html 2. 使用 requests 库发送 GET 请求获取网页源码，代码如下： ```python import requests url = 'http://tjj.gd.gov.cn/data/yearbook/2021/index.html' response = requests.get(url) html = response.text ``` 3. 使用 BeautifulSoup 库解析网页源码，获取需要的数据，代码如下： ```python from bs4 import BeautifulSoup soup = BeautifulSoup(html, 'html.parser') table = soup.find('table', class_='table table-bordered table-hover') thead = table.thead.tr tbody = table.tbody years = [th.text for th in thead.find_all('th')[1:]] data = {} for tr in tbody.find_all('tr'): tds = tr.find_all('td') topic = tds[0].text values = [td.text for td in tds[1:-1]] data[topic] = dict(zip(years, values)) ``` 4. 将数据保存到本地文件，代码如下： ```python import json with open('data.json', 'w') as f: json.dump(data, f) ``` 完整代码如下： ```python import requests from bs4 import BeautifulSoup import json url = 'http://tjj.gd.gov.cn/data/yearbook/2021/index.html' response = requests.get(url) html = response.text soup = BeautifulSoup(html, 'html.parser') table = soup.find('table', class_='table table-bordered table-hover') thead = table.thead.tr tbody = table.tbody years = [th.text for th in thead.find_all('th')[1:]] data = {} for tr in tbody.find_all('tr'): tds = tr.find_all('td') topic = tds[0].text values = [td.text for td in tds[1:-1]] data[topic] = dict(zip(years, values)) with open('data.json', 'w') as f: json.dump(data, f) ``` 这样就可以将广东省统计年鉴数据爬取下来并保存到本地文件中了。

阅读全文

大家在看

暗通道去雾算法_何凯明去雾_matlab_去雾_去雾算法_暗通道算法_

何凯明的暗通道去雾算法matlab代码，可运行

基于YOLOv10+DeepSort实现视频中目标跟踪算法Python源码+详细使用说明.zip

基于YOLOv10+DeepSort实现视频中目标跟踪算法源码+详细使用说明.zip 基于YOLOv10+DeepSort实现视频中目标跟踪算法源码+详细使用说明.zip 基于YOLOv10+DeepSort实现视频中目标跟踪算法源码+详细使用说明.zip 基于YOLOv10+DeepSort实现视频中目标跟踪算法源码+详细使用说明.zip 基于YOLOv10+DeepSort实现视频中目标跟踪算法源码+详细使用说明.zip

电信设备-一种血糖数据查询方法及移动终端.zip

FAST FACTORIZED_FFBP论文_FFBP_后向投影.zip

威布尔参数估计，可靠性与寿命预测方向，机械工程,威布尔分布寿命预测,matlab源码.rar

最新推荐

pyhton 爬取广东省统计年鉴数据

相关推荐

python 爬虫 爬取国家统计局 行政区数据

统计局数据爬取.py

python爬取飞猪旅游网数据（有数据）

python爬取雅虎财经股票交易数据

python爬取58同城二手房源数据

Python爬取网站下厨房早餐数据，可另行修改爬取相关数据

python爬取NBA球员并进行数据可视化

使用python爬取天气信息（包括历史天气数据）_python爬取天气数据-CSDN博客.html

Python爬取国家水稻信息进行数据分析可视化

python爬取链家网租房数据

python爬取中国票房网数据

Python爬取豆瓣top250电影数据，并导入MySQL，写入excel

python爬取链家新房数据

Python爬取猫眼豆瓣数据

Python爬取世界港口数据

使用python爬取疫情数据

通过抓取淘宝评论为例讲解Python爬取ajax动态生成的数据(经典)

【Python爬取分析】NBA比赛数据形成可视化结构（附说明文档）

大家在看

暗通道去雾算法_何凯明去雾_matlab_去雾_去雾算法_暗通道算法_

基于YOLOv10+DeepSort实现视频中目标跟踪算法Python源码+详细使用说明.zip

电信设备-一种血糖数据查询方法及移动终端.zip

FAST FACTORIZED_FFBP论文_FFBP_后向投影.zip

威布尔参数估计，可靠性与寿命预测方向，机械工程,威布尔分布寿命预测,matlab源码.rar

最新推荐

Python爬取数据并实现可视化代码解析

Python爬取破解无线网络wifi密码过程解析

python爬取cnvd漏洞库信息的实例

Python爬取数据保存为Json格式的代码示例

python 爬取马蜂窝景点翻页文字评论的实现

世界地图Shapefile文件解析与测试指南

Python环境监控高可用构建：可靠性增强的策略

需要在matlab当中批量导入表格数据的指令

Sqlcipher 3.4.0版本发布，优化SQLite兼容性

Python环境监控性能监控与调优：专家级技巧全集

python 爬虫爬取国家统计局行政区数据