首页用BeautifulSoup或Scrapy库获取东方财富股吧内容的代码

用BeautifulSoup或Scrapy库获取东方财富股吧内容的代码

时间: 2024-04-05 12:07:53 浏览: 164

以下是一个使用Scrapy库的示例代码： ``` import scrapy class StockSpider(scrapy.Spider): name = 'stock' allowed_domains = ['guba.eastmoney.com'] start_urls = ['http://guba.eastmoney.com/list,601318_1.html'] def parse(self, response): for post in response.css('.articleh'): yield { 'author': post.css('.l3 a::text').extract_first(), 'title': post.css('.l3 span a::text').extract_first(), 'time': post.css('.l6::text').extract_first(), 'content': post.css('.l3 span a::attr(href)').extract_first(), } next_page = response.css('.pager .next::attr(href)').extract_first() if next_page is not None: yield response.follow(next_page, self.parse) ``` 该代码将爬取东方财富网的股吧内容，并将作者、标题、时间和内容链接等信息保存。注意，该代码仅供学习参考，请不要非法爬取网站内容。

阅读全文