首页用python爬取网页数据到表格

用python爬取网页数据到表格

时间: 2023-11-11 20:50:48 浏览: 88

好的，您可以使用Python中的BeautifulSoup库和pandas库来完成这件事情。以下是一个简单的示例代码： ```python import requests from bs4 import BeautifulSoup import pandas as pd # 发送请求 url = "https://www.example.com" response = requests.get(url) # 解析网页内容 soup = BeautifulSoup(response.text, "html.parser") # 找到表格数据 table = soup.find("table") # 读取表格数据到pandas DataFrame data = [] rows = table.find_all("tr") for row in rows: cols = row.find_all("td") cols = [col.text.strip() for col in cols] data.append(cols) df = pd.DataFrame(data, columns=["Column 1", "Column 2", "Column 3"]) # 将数据保存到Excel文件 df.to_excel("data.xlsx", index=False) ``` 该代码将从网页中提取出表格数据，并将其存储在一个pandas DataFrame对象中。然后，将DataFrame数据保存到一个名为“data.xlsx”的Excel文件中。您可以根据需要修改代码以适应您的具体需求。

阅读全文