首页http://www.moe.gov.cn/jyb_sjzl/moe_560/2021/gedi/202212/t20221230_1037369.html 用python语言爬取该网页数据并存入csv文件中

http://www.moe.gov.cn/jyb_sjzl/moe_560/2021/gedi/202212/t20221230_1037369.html 用python语言爬取该网页数据并存入csv文件中

时间: 2023-07-19 21:50:41 浏览: 121

好的，我可以通过 Python 的 requests 和 BeautifulSoup 库来爬取数据，再使用 csv 库将数据存入 CSV 文件中。以下是代码示例： ```python import requests from bs4 import BeautifulSoup import csv # 发送 GET 请求获取网页 HTML 内容 url = 'http://www.moe.gov.cn/jyb_sjzl/moe_560/2021/gedi/202212/t20221230_1037369.html' response = requests.get(url) html = response.text # 使用 BeautifulSoup 解析 HTML，获取表格数据 soup = BeautifulSoup(html, 'html.parser') table = soup.find('table') rows = table.find_all('tr') # 将数据写入 CSV 文件 with open('data.csv', 'w', newline='') as csvfile: writer = csv.writer(csvfile) for row in rows: cols = row.find_all('td') if cols: data = [col.text.strip() for col in cols] writer.writerow(data) ``` 执行以上代码后，数据将被保存在名为 `data.csv` 的 CSV 文件中。

阅读全文