首页Python爬虫财务报表

Python爬虫财务报表

时间: 2023-12-25 07:29:12 浏览: 27

以下是一个使用Python爬虫获取财务报表的示例： ```python import requests from lxml import etree # 设置请求头 headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3' } # 财务报表的URL url = 'http://example.com/financial_report' # 发送GET请求获取页面内容 response = requests.get(url, headers=headers) # 使用lxml解析页面内容 html = etree.HTML(response.text) # 提取财务报表的链接 report_links = html.xpath('//a[@class="report-link"]/@href') # 下载财务报表 for link in report_links: # 构造文件保存路径 save_path = "..\\requests\\财务报表\\" + link.split('/')[-1] # 发送GET请求下载文件 file_response = requests.get(link, headers=headers) # 保存文件 with open(save_path, 'wb') as f: f.write(file_response.content) print("财务报表下载完成！") ``` 这个示例使用了Python的requests库和lxml库。首先，我们发送GET请求获取财务报表页面的内容。然后，使用lxml库解析页面内容，提取财务报表的链接。最后，通过发送GET请求下载财务报表，并将其保存到指定的文件路径中。