首页python爬取https://finance.sina.com.cn/blog中的图片

python爬取https://finance.sina.com.cn/blog中的图片

时间: 2023-06-23 22:58:25 浏览: 151

要爬取https://finance.sina.com.cn/blog中的图片，可以使用Python的requests和BeautifulSoup库。下面是一个简单的示例代码： ```python import requests from bs4 import BeautifulSoup url = 'https://finance.sina.com.cn/blog' response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') img_tags = soup.find_all('img') # 找到所有的img标签 for img in img_tags: img_url = img['src'] # 获取图片链接 if img_url.startswith('https://n.sinaimg.cn/finance/'): # 只下载finance.sina.com.cn的图片 img_name = img_url.split('/')[-1] # 获取图片名称 img_data = requests.get(img_url).content # 获取图片二进制数据 with open(img_name, 'wb') as f: f.write(img_data) # 将图片二进制数据写入文件 ``` 上述代码会获取https://finance.sina.com.cn/blog中所有属于finance.sina.com.cn的图片链接，然后通过requests库获取图片的二进制数据，最后将图片保存到本地文件中。需要注意的是，有些网站可能会对爬虫进行限制，需要设置一些请求头或者使用代理来解决。

阅读全文