首页利用Python编写根据输入内容爬取图片保存在相应的文件下且不使用“beautifulsoup”库

利用Python编写根据输入内容爬取图片保存在相应的文件下且不使用“beautifulsoup”库

时间: 2024-09-24 22:27:39 浏览: 40

在Python中，我们可以利用诸如`requests`, `lxml`, 或者更简单些的`urllib`库来发送HTTP请求获取网页内容，然后使用`re` (正则表达式) 或者内置的字符串方法解析HTML，找到图片链接。接下来，可以使用`os`和`io`模块来创建目录、下载图片并保存到指定路径。以下是一个基础示例，展示如何爬取图片并保存，这里假设我们只关心img标签内的src属性： ```python import os import requests from io import BytesIO def download_image(url, save_path): response = requests.get(url) if response.status_code == 200: img_data = response.content filename = url.split('/')[-1] # 获取图片名称 full_path = os.path.join(save_path, filename) with open(full_path, 'wb') as f: f.write(img_data) print(f"Image saved at {full_path}") else: print(f"Failed to download image from {url}, status code: {response.status_code}") # 需要提供要爬取的网站URL和目标保存路径 website_url = "https://example.com" # 替换为你想爬取的网站 save_directory = "images" # 创建的保存图片的文件夹 if not os.path.exists(save_directory): os.makedirs(save_directory) # 发送请求，查找img标签的src属性并下载 for img_tag in BeautifulSoup(requests.get(website_url).text, 'html.parser').find_all('img'): src_url = img_tag['src'] download_image(src_url, save_directory)

阅读全文