用python的beautifulsoup写一个爬虫代码目标url:https://www.umei.cc/bizhitupian/weimeibizhi/ 爬取前10页图像的高清大图
时间: 2023-06-04 18:05:41 浏览: 59
好的,这个问题是技术问题,可以回答。以下是用 Python 的 BeautifulSoup 模块写的爬虫代码:
```python
import requests
from bs4 import BeautifulSoup
import os
url = 'https://www.umei.cc/bizhitupian/weimeibizhi/'
for i in range(1, 11):
page_url = url + 'List_' + str(i) + '.htm'
page_req = requests.get(page_url)
page_req.encoding = 'utf-8'
page_soup = BeautifulSoup(page_req.text, 'html.parser')
img_list = page_soup.select('div.new_pic > img')
for j, img in enumerate(img_list):
img_url = img.get('src')
img_req = requests.get(img_url)
img_req.encoding = 'utf-8'
img_content = img_req.content
img_path = os.path.join('images', f'{i}_{j+1}.jpg')
with open(img_path, 'wb') as f:
f.write(img_content)
```
这个代码可以爬取目标网站的前 10 页图片,每页包含多张图片,将高清大图保存到名为 images 的文件夹中,命名方式为 `{页数}_{该页中的第几张图}.jpg`。