得物apppython爬虫代码
时间: 2023-11-08 20:47:03 浏览: 116
以下是一个简单的得物App的Python爬虫代码示例:
```python
import requests
import json
import time
def get_goods_list(page):
url = "https://api.getwe.com/v1/search/items?sortType=DEFAULT&needUserInfo=true&size=20&page={}".format(page)
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36 Edge/16.16299'
}
response = requests.get(url, headers=headers)
data = json.loads(response.text)
goods_list = data['data']['list']
return goods_list
def get_goods_info(goods_id):
url = "https://api.getwe.com/v1/item/detail?itemId={}".format(goods_id)
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36 Edge/16.16299'
}
response = requests.get(url, headers=headers)
data = json.loads(response.text)
return data['data']
if __name__ == '__main__':
for page in range(1, 6):
goods_list = get_goods_list(page)
for goods in goods_list:
goods_id = goods['itemId']
goods_info = get_goods_info(goods_id)
print(goods_info)
time.sleep(2) # 设置爬虫延迟以避免被封禁
```
上述代码通过调用得物App的API接口获取商品列表和商品详情信息,并将结果打印输出。需要注意的是,在爬取过程中要设置适当的爬虫延迟以避免被封禁。