首页写一个爬取http://hbjd.com.cn/article-view-id-20554.html的图片的代码

写一个爬取http://hbjd.com.cn/article-view-id-20554.html的图片的代码

时间: 2024-03-30 19:34:15 浏览: 146

以下是一个简单的Python爬虫代码，可以爬取http://hbjd.com.cn/article-view-id-20554.html的所有图片： ```python import requests from bs4 import BeautifulSoup import os def download_images(url, save_dir): # 发送GET请求 response = requests.get(url) # 解析HTML soup = BeautifulSoup(response.text, 'html.parser') # 获取所有的img标签 images = soup.find_all('img') # 遍历所有的img标签 for i, img in enumerate(images): src = img.get('src') if src.startswith('http'): filename = os.path.join(save_dir, f'{i}.jpg') # 下载图片 with open(filename, 'wb') as f: f.write(requests.get(src).content) if __name__ == '__main__': url = 'http://hbjd.com.cn/article-view-id-20554.html' save_dir = './images' if not os.path.exists(save_dir): os.makedirs(save_dir) download_images(url, save_dir) ``` 在上述代码中，我们使用requests库发送GET请求获取HTML页面，然后使用BeautifulSoup库解析HTML，获取所有的img标签，并提取其中的src属性作为图片的URL。最后，我们可以使用requests库下载图片，并保存在本地。

阅读全文