python爬取微博评论数、点赞数、发文时间、转发数、发文内容、博主信息

要爬取微博评论数、点赞数、发文时间、转发数、发文内容、博主信息，你需要使用Python中的第三方库requests和BeautifulSoup来获取网页源代码，并解析HTML内容。首先，你需要获取微博页面的URL，然后使用requests库发送HTTP请求，获取网页源代码。例如： ```python import requests url = 'https://weibo.com/123456789' response = requests.get(url) html = response.text ``` 接下来，你需要使用BeautifulSoup库来解析HTML内容，找到评论数、点赞数、发文时间、转发数、发文内容和博主信息的标签，例如： ```python from bs4 import BeautifulSoup soup = BeautifulSoup(html, 'html.parser') weibo = soup.find_all('div', {'class': 'WB_feed_detail clearfix'}) for w in weibo: # 评论数 comments = w.find('span', {'class': 'line S_line1', 'node-type': 'comment_btn_text'}) if comments is not None: comments_num = comments.text.strip() else: comments_num = '0' # 点赞数 likes = w.find('span', {'class': 'line S_line1', 'node-type': 'like_status'}) if likes is not None: likes_num = likes.em.text else: likes_num = '0' # 发文时间 time = w.find('a', {'class': 'S_txt2', 'node-type': 'feed_list_item_date'}) if time is not None: post_time = time.text.strip() else: post_time = '无' # 转发数 reposts = w.find('a', {'class': 'S_txt2', 'action-type': 'feed_list_forward'}) if reposts is not None: reposts_num = reposts.text.strip() else: reposts_num = '0' # 发文内容 content = w.find('div', {'class': 'WB_text W_f14'}) if content is not None: post_content = content.text.strip() else: post_content = '无' # 博主信息 user_info = w.find('div', {'class': 'WB_info'}) if user_info is not None: user_name = user_info.find('a', {'class': 'W_f14 W_fb S_txt1'}) if user_name is not None: user_name = user_name.text.strip() else: user_name = '无' user_id = user_info.find('a', {'class': 'S_txt1'}) if user_id is not None: user_id = user_id.text.strip() else: user_id = '无' else: user_name = '无' user_id = '无' # 输出结果 print('评论数：', comments_num) print('点赞数：', likes_num) print('发文时间：', post_time) print('转发数：', reposts_num) print('发文内容：', post_content) print('博主名称：', user_name) print('博主ID：', user_id) print('------------------------') ``` 这样，你就可以使用Python爬取微博评论数、点赞数、发文时间、转发数、发文内容、博主信息了。但是请注意，爬取他人数据可能会侵犯他人隐私和权益，建议在遵守相关法律法规的前提下进行爬取。

阅读全文

python爬取微博评论数、点赞数、发文时间、转发数、发文内容、博主信息

相关推荐

微博数据爬虫演示：评论、点赞与图片信息解析

Python爬取微博数据生成词云图教程

Python Scrapy爬虫高效爬取微博内容教程

Python爬取微博评论代码

python爬取微博视频

python爬取微博图片及内容

python爬取微博关键词搜索博文

Python爬虫 - 使用python爬取微博热搜.zip

python爬虫爬取微博评论案例详解

利用Python爬取微博数据生成词云图片实例代码

爬取微博数据_爬取微博_python爬虫_爬取微博数据并可视化_数据开发_微博分析_

python爬取微博关键词搜索博文,修改cookie和地址就可以

Python爬取新浪微博转发数等

python新浪微博爬虫，爬取微博和用户信息 (源码)

Python爬虫实战：解析微博评论信息

Python爬虫实战：免登陆爬取微博评论并生成词云

2025最新电工技师考试题及答案.docx

基于java+ssm+mysql的玉安农副产品销售系统 源码+数据库+论文(高分毕设项目).zip

骨科康复医疗领域知识图谱建立及其分析.pdf

基于交易能量框架的多微电网最优能源管理：配网协同优化以降低运营成本, 关键词：Transactive energy，微电网 配网 参考文档：Optimal Energy Management for

大家在看

paleo-core-0.10.2.jar and markdown-to-asciidoc-1.0.jar

基于MATLAB的表面裂纹识别与检测

iometer使用指南

IPC-7351 使用说明

日工作日程表－日工作安排-SAP_HR_考勤管理及配置_HR306_V3.0

最新推荐

利用Python爬取微博数据生成词云图片实例代码

python 爬取马蜂窝景点翻页文字评论的实现

Python爬取当当、京东、亚马逊图书信息代码实例

2025最新电工技师考试题及答案.docx

基于java+ssm+mysql的玉安农副产品销售系统 源码+数据库+论文(高分毕设项目).zip

Droste：探索Scala中的递归方案

Simulink DLL性能优化：实时系统中的高级应用技巧

rust语言将文本内容转换为音频

安卓蓝牙技术实现照明远程控制

【Simulink DLL集成】：零基础快速上手，构建高效模型策略

基于java+ssm+mysql的玉安农副产品销售系统源码+数据库+论文(高分毕设项目).zip

基于交易能量框架的多微电网最优能源管理：配网协同优化以降低运营成本, 关键词：Transactive energy，微电网配网参考文档：Optimal Energy Management for

基于java+ssm+mysql的玉安农副产品销售系统源码+数据库+论文(高分毕设项目).zip