用python代码实现分别采用request、selenium、scrapy爬取东方财富网上的热门数据、特色数据、沪深港通数据等。

1. 使用 requests 爬取东方财富网热门数据： ```python import requests from bs4 import BeautifulSoup url = 'http://finance.eastmoney.com/' res = requests.get(url) soup = BeautifulSoup(res.text, 'html.parser') hot_news = soup.find_all('div', {'class': 'hotNews'})[0] hot_news_titles = hot_news.find_all('a') for title in hot_news_titles: print(title.text.strip()) ``` 2. 使用 Selenium 爬取东方财富网热门数据： ```python from selenium import webdriver url = 'http://finance.eastmoney.com/' driver = webdriver.Chrome() driver.get(url) hot_news = driver.find_element_by_class_name('hotNews') hot_news_titles = hot_news.find_elements_by_tag_name('a') for title in hot_news_titles: print(title.text.strip()) driver.quit() ``` 3. 使用 Scrapy 爬取东方财富网热门数据： ```python import scrapy class FinanceSpider(scrapy.Spider): name = 'finance' start_urls = [ 'http://finance.eastmoney.com/' ] def parse(self, response): hot_news = response.css('.hotNews') hot_news_titles = hot_news.css('a::text').getall() for title in hot_news_titles: yield { 'title': title.strip() } ``` 4. 使用 requests 爬取东方财富网特色数据： ```python import requests from bs4 import BeautifulSoup url = 'http://finance.eastmoney.com/special/00095/' res = requests.get(url) soup = BeautifulSoup(res.text, 'html.parser') special_news = soup.find_all('div', {'class': 'listSpecial'})[0] special_news_titles = special_news.find_all('a') for title in special_news_titles: print(title.text.strip()) ``` 5. 使用 Selenium 爬取东方财富网特色数据： ```python from selenium import webdriver url = 'http://finance.eastmoney.com/special/00095/' driver = webdriver.Chrome() driver.get(url) special_news = driver.find_element_by_class_name('listSpecial') special_news_titles = special_news.find_elements_by_tag_name('a') for title in special_news_titles: print(title.text.strip()) driver.quit() ``` 6. 使用 Scrapy 爬取东方财富网特色数据： ```python import scrapy class FinanceSpecialSpider(scrapy.Spider): name = 'finance_special' start_urls = [ 'http://finance.eastmoney.com/special/00095/' ] def parse(self, response): special_news = response.css('.listSpecial') special_news_titles = special_news.css('a::text').getall() for title in special_news_titles: yield { 'title': title.strip() } ``` 7. 使用 requests 爬取东方财富网沪深港通数据： ```python import requests from bs4 import BeautifulSoup url = 'http://finance.eastmoney.com/hsgt/index.html' res = requests.get(url) soup = BeautifulSoup(res.text, 'html.parser') hs_con = soup.find_all('div', {'class': 'hs_con'})[0] hs_con_titles = hs_con.find_all('a') for title in hs_con_titles: print(title.text.strip()) ``` 8. 使用 Selenium 爬取东方财富网沪深港通数据： ```python from selenium import webdriver url = 'http://finance.eastmoney.com/hsgt/index.html' driver = webdriver.Chrome() driver.get(url) hs_con = driver.find_element_by_class_name('hs_con') hs_con_titles = hs_con.find_elements_by_tag_name('a') for title in hs_con_titles: print(title.text.strip()) driver.quit() ``` 9. 使用 Scrapy 爬取东方财富网沪深港通数据： ```python import scrapy class FinanceHSGTSpider(scrapy.Spider): name = 'finance_hsgt' start_urls = [ 'http://finance.eastmoney.com/hsgt/index.html' ] def parse(self, response): hs_con = response.css('.hs_con') hs_con_titles = hs_con.css('a::text').getall() for title in hs_con_titles: yield { 'title': title.strip() } ```

用python代码实现分别采用request、selenium、scrapy爬取东方财富网上的热门数据、特色数据、沪深港通数据等。

相关推荐

dongfangyanbao:使用 scrapy-selenium 爬取爬取东方财富网的所有个股研报信息

基于Python+Selenium Web自动化实现的东方网期货交易所数据爬取及存储源码.zip

Python selenium爬取微博数据代码实例

scrapy+selenium爬取网页动态加载数据实例讲解

使用selenium库结合python爬取微博数据的实现过程路线和存在不足

用python+selenium爬取小红书数据

python爬取selenium自动化获取数据

使用selenium自动爬取网页数据

python爬虫，使用scrapy框架以及selenium动态爬取当当网（http://search.dangdang.com/）搜索框输入的python后的图书数据

selenium抓取网页文本数据python代码实现

用python代码写一个微博数据爬取程序

使用jupyter notebook Selenium库爬取起点中文网网页数据

使用scrapy和selenium爬取

写一个scrapy和selenium结合爬取的代码爬取淘宝

Selenium库爬取今日头条网页数据

编写代码用Scrapy与xpath爬取杭州、福州、广州的十页链家租房数据及其详情页数据

selenium爬取数据

selenium爬取数据实例

Selenium爬取亚马逊商品数据

最新推荐

Python selenium爬取微信公众号文章代码详解

结合scrapy和selenium爬推特的爬虫总结

selenium + python 获取table数据的示例讲解

C#使用Selenium+PhantomJS抓取数据

python 接口测试response返回数据对比的方法

利用迪杰斯特拉算法的全国交通咨询系统设计与实现

管理建模和仿真的文件

【实战演练】基于TensorFlow的卷积神经网络图像识别项目

CD40110工作原理

全国交通咨询系统C++实现源码解析