首页用python爬虫收集热门旅游景点的数据并保存在CSV文件中的代码

用python爬虫收集热门旅游景点的数据并保存在CSV文件中的代码

时间: 2024-10-21 15:15:07 浏览: 29

在Python中，我们可以使用`requests`库获取网页数据，`BeautifulSoup`库解析HTML，然后将结果存储到CSV文件中。以下是一个简单的示例，假设我们想要从某个网站抓取热门旅游景点的信息，比如名称、地址和评分： ```python import requests from bs4 import BeautifulSoup import csv # 目标URL url = "https://example.com/hotels" # 请替换为实际的旅游景点列表页面 # 发送GET请求 response = requests.get(url) content = response.content # 使用BeautifulSoup解析HTML soup = BeautifulSoup(content, 'html.parser') # 假设数据结构如下：每个景点是一行，包含字段名（例如['Name', 'Address', 'Rating']），然后查找对应的td元素 data_rows = [] table = soup.find('table') # 查找包含景点信息的表格 if table: rows = table.find_all('tr') for row in rows[1]: # 忽略表头 cols = row.find_all('td') cols_data = [col.text.strip() for col in cols] # 提取文本内容 data_rows.append(cols_data) # 将数据保存到CSV文件 with open('hotels.csv', 'w', newline='', encoding='utf-8') as csvfile: fieldnames = ['Name', 'Address', 'Rating'] writer = csv.writer(csvfile) writer.writerow(fieldnames) # 写入表头 writer.writerows(data_rows)

阅读全文