首页python爬虫去哪儿网代码

python爬虫去哪儿网代码

时间: 2023-07-27 16:06:48 浏览: 198

以下是使用 Python requests 和 BeautifulSoup 库爬取去哪儿网酒店信息的示例代码： ```python import requests from bs4 import BeautifulSoup # 构造请求头 headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'} # 构造请求参数 params = { 'cityName': '北京', 'page': '1' } # 发送 GET 请求 r = requests.get('http://hotel.qunar.com/city/beijing_city/', headers=headers, params=params) # 解析 HTML 页面 soup = BeautifulSoup(r.text, 'html.parser') # 获取酒店列表 hotel_list = soup.find_all('div', class_='item_hotel_info') # 输出酒店名称和价格 for hotel in hotel_list: name = hotel.find('span', class_='item_hotel_title').text price = hotel.find('span', class_='item_price').text print('酒店名称：', name, '\t价格：', price) ``` 以上代码会爬取去哪儿网北京市的酒店信息，并输出酒店名称和价格。注意，这只是一个示例代码，实际爬取需要根据网站的变化进行调整。同时，为了遵守网站的规则，需要在请求头中设置 User-Agent，并且不要频繁地进行爬取。

阅读全文