首页用Python爬取http://www.stats.gov.cn/sj/tjgb/rkpcgb/qgrkpcgb/202302/t20230206_1902003.html网站的数据并写入Excel表格中的python代码

用Python爬取http://www.stats.gov.cn/sj/tjgb/rkpcgb/qgrkpcgb/202302/t20230206_1902003.html网站的数据并写入Excel表格中的python代码

时间: 2023-08-14 16:03:38 浏览: 231

以下是使用Python爬取网站数据并写入Excel表格的示例代码： ```python import requests from bs4 import BeautifulSoup import xlwt # 发送HTTP请求获取网页内容 url = 'http://www.stats.gov.cn/sj/tjgb/rkpcgb/qgrkpcgb/202302/t20230206_1902003.html' res = requests.get(url) res.encoding = 'utf-8' html = res.text # 使用BeautifulSoup解析HTML soup = BeautifulSoup(html, 'html.parser') table = soup.find('table', {'class': 'MsoNormalTable'}) # 创建Excel工作簿并添加工作表 book = xlwt.Workbook(encoding='utf-8') sheet = book.add_sheet('Sheet1') # 获取表格内容并写入Excel表格 for row_index, row in enumerate(table.find_all('tr')): for col_index, cell in enumerate(row.find_all(['td', 'th'])): sheet.write(row_index, col_index, cell.get_text().strip()) # 保存Excel表格 book.save('data.xls') ``` 以上代码使用 requests 库发送HTTP请求获取网页内容，使用 BeautifulSoup 库解析HTML，然后使用 xlwt 库创建Excel工作簿并添加工作表，最后将表格内容写入Excel表格并保存。注意，在写入表格时，我们使用了 enumerate() 函数来获取行和列的索引。

阅读全文