爬取去哪儿网的5A旅游景点数据代码
时间: 2023-07-28 12:06:33 浏览: 66
以下是使用 Python 和 requests 库爬取去哪儿网 5A级旅游景点数据的示例代码:
```python
import requests
from lxml import etree
# 请求头
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'}
# 发送请求并获取响应
url = 'https://piao.qunar.com/ticket/list.htm?keyword=5A%E7%BA%A7%E6%99%AF%E7%82%B9®ion=&from=mpl_search_suggest&page={}'
for i in range(1, 6):
response = requests.get(url.format(i), headers=headers)
html = etree.HTML(response.text)
# 解析数据
items = html.xpath('//div[@class="result_list"]//div[@class="sight_item"]')
for item in items:
name = item.xpath('.//h3/a/text()')[0]
address = item.xpath('.//p[@class="address"]/span/text()')[0]
level = item.xpath('.//span[contains(@class, "sight_item_price")]/preceding-sibling::span[@class="level"]/text()')[0]
price = item.xpath('.//span[contains(@class, "sight_item_price")]/em/text()')[0]
print(name, address, level, price)
```
该代码会爬取去哪儿网上的前5页5A级旅游景点数据,并输出名称、地址、级别和价格信息。你可以根据需要进行修改和扩展。