首页Python抓取网页图片代码

Python抓取网页图片代码

时间: 2024-06-14 16:02:29 浏览: 130

在Python中，我们可以使用一些库来抓取网页上的图片。其中最常用的是`requests`库用于发送HTTP请求获取网页内容，`BeautifulSoup`库解析HTML结构，然后利用`os`和`urllib`来下载图片。以下是一个简单的例子： ```python import requests from bs4 import BeautifulSoup import os def download_images(url): # 发送GET请求 response = requests.get(url) # 使用BeautifulSoup解析HTML soup = BeautifulSoup(response.text, 'html.parser') # 查找所有的img标签（图片链接） img_tags = soup.find_all('img') # 确定图片保存路径（这里假设是当前目录下的images子目录） img_dir = "images" if not os.path.exists(img_dir): os.makedirs(img_dir) # 下载图片并保存 for img in img_tags: img_url = img.get('src') or img.get('data-src') # 处理常见的src和data-src属性 if img_url: img_response = requests.get(img_url, stream=True) with open(os.path.join(img_dir, os.path.basename(img_url)), 'wb') as f: for chunk in img_response.iter_content(1024): f.write(chunk) # 使用函数 download_images("http://example.com") # 替换为你想要抓取的网页URL

阅读全文