首页Python读取html文件存为CSV文件

Python读取html文件存为CSV文件

时间: 2024-05-05 09:17:26 浏览: 169

可以使用Python的BeautifulSoup库来读取html文件，并使用csv库将数据存储为csv文件。以下是一个示例代码： ```python from bs4 import BeautifulSoup import csv # 读取html文件 with open('example.html', 'r') as f: soup = BeautifulSoup(f, 'html.parser') # 获取表格数据 table = soup.find('table') rows = table.find_all('tr') # 将表格数据存储为csv文件 with open('example.csv', 'w', newline='') as f: writer = csv.writer(f) for row in rows: cols = row.find_all('td') cols = [col.text.strip() for col in cols] writer.writerow(cols) ``` 在这个示例中，我们首先使用BeautifulSoup库读取名为example.html的文件，并使用html.parser解析器解析它。然后，我们通过查找表格标签和行标签来获取表格数据。最后，我们使用csv库将数据存储为名为example.csv的csv文件。请注意，在本示例中，我们假设表格中只包含数据，没有标题行。如果表格中有标题行，你需要先将标题行提取出来，并将它们写入csv文件的第一行，然后再写入数据行。

阅读全文