首页python爬取网站图片

python爬取网站图片

时间: 2023-07-06 13:36:52 浏览: 104

要爬取网站的图片，可以使用 Python 的 requests 和 BeautifulSoup 库来实现。以下是一个简单的示例代码： ```python import requests from bs4 import BeautifulSoup import os # 目标网站的 URL url = "https://www.example.com" # 发送 GET 请求 response = requests.get(url) # 解析 HTML soup = BeautifulSoup(response.text, 'html.parser') # 创建保存图片的目录 if not os.path.exists('images'): os.makedirs('images') # 遍历所有的图片标签 for img in soup.find_all('img'): # 获取图片的 URL img_url = img.get('src') if not img_url: continue # 发送 GET 请求 img_response = requests.get(img_url) # 保存图片 with open('images/' + os.path.basename(img_url), 'wb') as f: f.write(img_response.content) ``` 上述代码中，我们首先发送 GET 请求获取网站的 HTML 内容，然后使用 BeautifulSoup 解析 HTML，找到所有的图片标签，获取图片的 URL，并发送 GET 请求获取图片内容，最后将图片保存到本地目录中。注意要判断图片是否存在 URL，以及创建保存图片的目录。

阅读全文