首页如何使用python爬取网页文本到excle中

如何使用python爬取网页文本到excle中

时间: 2023-09-24 17:11:52 浏览: 103

您可以使用Python中的爬虫框架（如BeautifulSoup、Scrapy等）来爬取网页文本，并使用Python中的pandas库将数据存储到Excel中。以下是一个简单的示例代码： ```python import requests from bs4 import BeautifulSoup import pandas as pd # 发送请求获取网页内容 url = 'https://www.example.com' res = requests.get(url) soup = BeautifulSoup(res.content, 'html.parser') # 提取需要的文本信息 data = [] for item in soup.find_all('div', {'class': 'item'}): name = item.find('h2', {'class': 'name'}).text.strip() price = item.find('div', {'class': 'price'}).text.strip() data.append([name, price]) # 将数据存储到Excel中 df = pd.DataFrame(data, columns=['Name', 'Price']) df.to_excel('data.xlsx', index=False) ``` 以上代码演示了如何从网页中提取商品名称和价格，并将数据存储到名为"data.xlsx"的Excel文件中。您可以根据需要修改代码来适应您的爬虫需求。

阅读全文