import requests from lxml import etree resp = requests.get(url,headers={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/114.0.0.0 Safari/537.36 Edg/114.0.1823.37'}) print(resp.text) xp = etree.HTML(resp.text) img_urls = xp.xpath('//boby/div/section/div/div/div/section/figure/a/img/@src') img_names =xp.xpath('//boby/div/section/div/div/div/section/figure/a/img/@alt') for u,n in zip(img_urls,img_names): print(f'图片名;{n} 地址;{u}') img_resp = requests.get(u,headers={'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/114.0.0.0 Safari/537.36 Edg/114.0.1823.37'}) with open(f'./venv//img_f/{n}.jpg','wb') as f: f.write(img_resp.content)
时间: 2023-10-09 20:17:25 浏览: 78
根据您提供的代码,我看到您爬取的是图片地址并保存为本地图片。但是,您的代码中似乎缺少一些标签。具体来说,'boby' 应该是 'body',另外您的xpath路径可能也有问题。
您可以尝试如下的代码:
```python
import requests
from lxml import etree
url = 'https://example.com'
resp = requests.get(url,headers={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/114.0.0.0 Safari/537.36 Edg/114.0.1823.37'})
#print(resp.text)
xp = etree.HTML(resp.text)
img_urls = xp.xpath('//body//div//section//div//div//div//section//figure//a//img/@src')
img_names = xp.xpath('//body//div//section//div//div//div//section//figure//a//img/@alt')
for u,n in zip(img_urls, img_names):
print(f'图片名:{n} 地址:{u}')
img_resp = requests.get(u, headers={'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/114.0.0.0 Safari/537.36 Edg/114.0.1823.37'})
with open(f'./venv/img_f/{n}.jpg', 'wb') as f:
f.write(img_resp.content)
```
这个代码会爬取指定网站的图片地址,并将其下载到本地文件夹中。希望这个代码对您有所帮助。