上述代码并未正确爬取信息

非常抱歉，我在修改代码时出现了错误。请使用以下修改后的代码来正确爬取评论信息，并将其存储在comments.xlsx中： ```python import time import openpyxl from selenium import webdriver from bs4 import BeautifulSoup from selenium.common import NoSuchElementException from selenium.webdriver.common.by import By def is_element_present(edge, how, what): try: edge.find_element(by=how, value=what) except NoSuchElementException: return False return True website = 'https://www.mafengwo.cn' url = "https://www.mafengwo.cn/jd/10783/gonglve.html" edge = webdriver.Edge() edge.get(url) pages = [] comments_data = [] while True: flag = is_element_present(edge, By.CSS_SELECTOR, 'a.pg-next') print(flag) if flag: next_page = edge.find_element(By.CSS_SELECTOR, "a.pg-next") html = edge.page_source soup = BeautifulSoup(html, "html.parser") aa = soup.select("ul.scenic-list>li>a ") for a in aa: href = a.get("href") pages.append(website + href) print(a.get("title")) edge.execute_script("arguments[0].click()", next_page) time.sleep(5) else: break for x in pages: edge.get(x) print(x) detail_html = edge.page_source print(detail_html) soup = BeautifulSoup(detail_html, "html.parser") # 获取总评论数 total_comments = soup.select("p.comment-total span")[0].get_text() print("总评论数:", total_comments) # 获取好评数 good_comments = soup.select("div.score-legend > span:nth-of-type(1) > strong")[0].get_text() print("好评数:", good_comments) # 获取中评数 medium_comments = soup.select("div.score-legend > span:nth-of-type(2) > strong")[0].get_text() print("中评数:", medium_comments) # 获取差评数 bad_comments = soup.select("div.score-legend > span:nth-of-type(3) > strong")[0].get_text() print("差评数:", bad_comments) comments_data.append([total_comments, good_comments, medium_comments, bad_comments]) time.sleep(10) # 将爬取的信息存入comments.xlsx文件中 wb = openpyxl.Workbook() ws = wb.active ws.append(["总评论数", "好评数", "中评数", "差评数"]) for data in comments_data: ws.append(data) wb.save("comments.xlsx") wb.close() ``` 这次修改后的代码应该能够正确地爬取评论信息并将其存储在comments.xlsx文件中。如果您还有其他问题，请随时提问。非常抱歉给您带来的困扰。

阅读全文

上述代码并未正确爬取信息

相关推荐

爬虫代码，有不足

Python之requests+xpath爬取猫眼电影并写入数据库(图文教程)

python爬虫实战，多线程爬取京东

关于爬取opgg的英雄联盟英雄攻略，以及王者荣誉官网及王者营地的王者荣耀英雄攻略。.zip

使用Python的requests库进行简单的网页爬取与信息提取

Scrapy中的数据去重与增量爬取

爬取数据后的数据质量评估与改进策略

利用Cookies绕过B站视频爬取权限验证

【PyQuery实战】：构建个人博客文章爬取系统

Scrapy与Ajax交互爬取秘籍：处理JavaScript动态内容的6大技巧

Scrapy中间件深度剖析：如何掌握请求与响应的中间处理，实现高效爬取

Scrapy调试艺术：代码调试与问题定位的高效方法

代码审查的艺术：visit算法的常见错误与调试技巧

爬虫中的异常处理：代码健壮性的关键一环

concureent.futures调优攻略：Python代码优化与性能测试实战

【代码案例解析】：如何自定义webbrowser控制命令实现特定功能

Django项目中Sitemaps正确性测试：技术专家的测试指南

【Python正则表达式实战技巧】：深入sre_constants模块，破解代码性能瓶颈

1. 使用Python和selenium对一下站点进行采集，并将数据存入数据库，要求每天采集1次： https://www.toolify.ai/category 要求遍历筛选条件爬取站点item的信息

最新推荐

Python爬取数据并实现可视化代码解析

Python爬取当当、京东、亚马逊图书信息代码实例

Python selenium爬取微信公众号文章代码详解

Python爬取数据保存为Json格式的代码示例

python爬取cnvd漏洞库信息的实例

PureMVC AS3在Flash中的实践与演示：HelloFlash案例分析

管理建模和仿真的文件

YRC1000 EtherNet_IP通信协议：掌握连接与数据交换的6个关键策略

如何设置 OpenFileDialog 用户只能在固定文件夹及其子文件夹里选择文件

掌握Makefile多目标编译与清理操作