python下载上市公司年报

以下是使用Python下载上市公司年报的步骤： 1. 首先需要安装Selenium和ChromeDriver，Selenium用于模拟浏览器操作，ChromeDriver用于控制Chrome浏览器。可以使用以下命令安装Selenium和ChromeDriver： ```shell pip install selenium ``` 2. 导入必要的包： ```python from selenium import webdriver from selenium.webdriver.common.keys import Keys from selenium.webdriver.common.by import By from bs4 import BeautifulSoup import os import pandas as pd from time import sleep ``` 3. 启动Chrome浏览器并打开目标网站： ```python # 启动Chrome浏览器 driver = webdriver.Chrome() # 打开目标网站 driver.get("http://www.cninfo.com.cn/new/index") ``` 4. 在搜索框中输入公司名称并搜索： ```python # 找到搜索框并输入公司名称 search_box = driver.find_element(By.ID, "suggestinput") search_box.send_keys("公司名称") # 模拟回车键 search_box.send_keys(Keys.RETURN) ``` 5. 点击“公司概况”中的“公司公告”： ```python # 找到“公司概况”并点击 company_info = driver.find_element(By.XPATH, "//a[contains(text(),'公司概况')]") company_info.click() # 找到“公司公告”并点击 company_announcement = driver.find_element(By.XPATH, "//a[contains(text(),'公司公告')]") company_announcement.click() ``` 6. 在“公司公告”页面中找到“年报”并点击： ```python # 找到“年报”并点击 annual_report = driver.find_element(By.XPATH, "//a[contains(text(),'年报')]") annual_report.click() ``` 7. 在“年报”页面中找到目标年份的年报并下载： ```python # 找到目标年份的年报并下载 year = "2021" # 目标年份 pdf_links = driver.find_elements(By.XPATH, f"//a[contains(text(),'{year}') and contains(text(),'PDF')]") for link in pdf_links[:2]: # 只下载前两个PDF文件 href = link.get_attribute("href") driver.execute_script(f"window.open('{href}');") sleep(2) # 等待2秒钟 ``` 完整代码如下： ```python from selenium import webdriver from selenium.webdriver.common.keys import Keys from selenium.webdriver.common.by import By from bs4 import BeautifulSoup import os import pandas as pd from time import sleep # 启动Chrome浏览器 driver = webdriver.Chrome() # 打开目标网站 driver.get("http://www.cninfo.com.cn/new/index") # 找到搜索框并输入公司名称 search_box = driver.find_element(By.ID, "suggestinput") search_box.send_keys("公司名称") # 模拟回车键 search_box.send_keys(Keys.RETURN) # 找到“公司概况”并点击 company_info = driver.find_element(By.XPATH, "//a[contains(text(),'公司概况')]") company_info.click() # 找到“公司公告”并点击 company_announcement = driver.find_element(By.XPATH, "//a[contains(text(),'公司公告')]") company_announcement.click() # 找到“年报”并点击 annual_report = driver.find_element(By.XPATH, "//a[contains(text(),'年报')]") annual_report.click() # 找到目标年份的年报并下载 year = "2021" # 目标年份 pdf_links = driver.find_elements(By.XPATH, f"//a[contains(text(),'{year}') and contains(text(),'PDF')]") for link in pdf_links[:2]: # 只下载前两个PDF文件 href = link.get_attribute("href") driver.execute_script(f"window.open('{href}');") sleep(2) # 等待2秒钟 # 关闭浏览器 driver.quit() ```

python下载上市公司年报

相关推荐

基于python的上市公司年报分析（pdf转txt，停用词过滤，关键词分析，文本分析）

上市公司年报_Python中jieba_数字化_关键词词频统计_程序+年报样例

python批量从上市公司年报中获取指定内容.rar

python自动下载上市公司年报

python爬虫 上市公司年报

python爬取上市公司年报

python爬虫上市公司年报

基于python的上市公司年报分析

利用python爬取上市公司年报并生成

python上市公司年报文本分析

python上市公司年报词频统计

python批量下载年报

爬虫港交所上市公司年报

爬虫港股上市公司年报

爬虫美股上市公司年报

python如何爬取巨潮网站中制造业上市公司年报

python爬取上交所上市公司公告

上市公司业绩预测 python

python 自动下载A股上市公司2022年报

最新推荐

毕业设计MATLAB_执行一维相同大小矩阵的QR分解.zip

zigbee-cluster-library-specification

管理建模和仿真的文件

MATLAB柱状图在信号处理中的应用：可视化信号特征和频谱分析

帮我设计一个基于Android平台的便签APP的代码

JSBSim Reference Manual

"互动学习：行动中的多样性与论文攻读经历"

MATLAB柱状图在数据分析中的作用：从可视化到洞察

ISP图像工程师需要掌握的知识技能

c++校园超市商品信息管理系统课程设计说明书(含源代码) (2).pdf

python爬虫上市公司年报