首页Python爬取国家统计局的省市区

Python爬取国家统计局的省市区

时间: 2023-11-05 16:52:09 浏览: 137

python爬取国家省市区资料

使用python爬取国家省市区资料脚本

可以使用Python中的requests和BeautifulSoup库来实现爬取国家统计局的省市区信息。以下是示例代码： ```python import requests from bs4 import BeautifulSoup url = 'http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2019/index.html' response = requests.get(url) response.encoding = 'gbk' soup = BeautifulSoup(response.text, 'html.parser') province_list = soup.select('.provincetr a') for province in province_list: province_name = province.text.strip() province_code = province['href'][:2] print(province_code, province_name) city_url = f'http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2019/{province_code}.html' city_response = requests.get(city_url) city_response.encoding = 'gbk' city_soup = BeautifulSoup(city_response.text, 'html.parser') city_list = city_soup.select('.citytr a') for city in city_list: city_name = city.text.strip() city_code = city['href'][3:5] print('\t', city_code, city_name) county_url = f'http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2019/{province_code}/{city_code}.html' county_response = requests.get(county_url) county_response.encoding = 'gbk' county_soup = BeautifulSoup(county_response.text, 'html.parser') county_list = county_soup.select('.countytr a') if len(county_soup.select('.countytr a')) > 0 else county_soup.select('.towntr a') for county in county_list: county_name = county.text.strip() county_code = county['href'][5:9] print('\t\t', county_code, county_name) ``` 以上代码可以输出各个省市区的名称和代码。注意需要指定网页编码为`gbk`。

阅读全文

最新推荐

Python爬取国家统计局的省市区

相关推荐

python 爬虫 爬取国家统计局 行政区数据

统计局数据爬取.py

python 数据爬取统计局zip

python爬取验证码

python爬取音乐

使用python爬取天气信息（包括历史天气数据）_python爬取天气数据-CSDN博客.html

Python-使用python爬取mm图片

python爬取淘女郎图片

python 爬取热门基金

python 爬取微信文章

python爬取天气数据

python爬取股票数据

Python爬取房价信息

python爬取微博视频

python爬取文库资源

python爬取百度图片

python爬取药品信息

Python爬取网页内容

最新推荐

Python爬取破解无线网络wifi密码过程解析

Python爬取数据并写入MySQL数据库的实例

python爬取cnvd漏洞库信息的实例

python 爬取马蜂窝景点翻页文字评论的实现

Python爬取数据并实现可视化代码解析

平尾装配工作平台运输支撑系统设计与应用

管理建模和仿真的文件

MATLAB遗传算法探索：寻找随机性与确定性的平衡艺术

如何在S7-200 SMART PLC中使用MB_Client指令实现Modbus TCP通信？请详细解释从连接建立到数据交换的完整步骤。

MAX-MIN Ant System：用MATLAB解决旅行商问题

python 爬虫爬取国家统计局行政区数据