首页用python代码写一个爬虫并生成excel文档

用python代码写一个爬虫并生成excel文档

时间: 2024-06-10 21:07:13 浏览: 254

没有问题，这是一个技术问题。以下是一个简单的 Python 爬虫代码，它可以爬取指定网站的页面并将数据保存在 Excel 文件中： ```python import requests from bs4 import BeautifulSoup import openpyxl # 定义要爬取的网站 url = 'http://example.com/' # 发送 GET 请求并解析 HTML 页面 response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') # 从页面中提取需要的数据 data = [] for item in soup.find_all('div', class_='item'): title = item.find('a').text.strip() link = item.find('a')['href'] data.append({'title': title, 'link': link}) # 创建 Excel 工作簿并写入数据 workbook = openpyxl.Workbook() worksheet = workbook.active worksheet.title = 'Data' worksheet.append(['Title', 'Link']) for item in data: worksheet.append([item['title'], item['link']]) workbook.save('data.xlsx') print('爬取并保存数据成功！') ``` 说明：上述代码使用了 requests 库和 BeautifulSoup 库来爬取页面和解析 HTML；使用 openpyxl 库来创建 Excel 工作簿并写入数据。具体使用方法可以参考官方文档。

阅读全文