首页淘宝爬虫python代码

淘宝爬虫python代码

时间: 2023-10-28 07:58:42 浏览: 200

以下是一个简单的使用Python的淘宝爬虫代码示例： ```python import requests from bs4 import BeautifulSoup def get_taobao_info(keyword): url = f'https://s.taobao.com/search?q={keyword}' headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/89.0.4389.82 Safari/537.36' } response = requests.get(url, headers=headers) soup = BeautifulSoup(response.text, 'html.parser') items = soup.find_all('div', class_='J_MouserOnverReq') for item in items: title = item.find('a', class_='J_ClickStat').text.strip() price = item.find('strong').text shop = item.find('div', class_='shop').text.strip() print(f'Title: {title}\nPrice: {price}\nShop: {shop}\n') keyword = input("请输入要搜索的关键字：") get_taobao_info(keyword) ``` 这段代码使用requests库发送HTTP请求，将淘宝搜索页面的HTML内容获取下来，然后使用BeautifulSoup库解析HTML内容，提取出商品的标题、价格和店铺信息，并打印输出。请注意，使用爬虫爬取淘宝数据需要遵守相关网站的爬虫规则，包括不频繁请求、不过度抓取等。此代码仅供学习参考，请勿用于非法用途。

阅读全文