首页Python bs4 爬取图片

Python bs4 爬取图片

时间: 2023-11-05 09:13:42 浏览: 159

使用 Python 的 BeautifulSoup 库来爬取图片很简单，以下是一个示例代码： ```python import requests from bs4 import BeautifulSoup import os # 要爬取图片的网页链接 url = "https://www.example.com" # 发送 GET 请求并获取 HTML 内容 response = requests.get(url) html = response.content # 解析 HTML 内容 soup = BeautifulSoup(html, 'html.parser') # 创建目录用于保存图片 if not os.path.exists("images"): os.mkdir("images") # 遍历所有的 img 标签 for img in soup.find_all("img"): # 获取图片链接 img_url = img.get("src") if img_url.startswith("http"): # 发送 GET 请求并保存图片 response = requests.get(img_url) with open("images/"+img_url.split("/")[-1], "wb") as f: f.write(response.content) ``` 以上代码会爬取指定网页中所有的图片，并将它们保存在当前工作目录下的一个名为 "images" 的文件夹中。注意，此代码将仅下载以 "http" 开头的图片链接，如果要下载其他类型的图片链接，请进行相应修改。

阅读全文