首页Traceback (most recent call last): File "C:\Users\test\Desktop\DIDI测试.py", line 13, in <module> extract_html_info(file_path) ^^^^^^^^^^^^^^^^^ NameError: name 'extract_html_info' is not defined

Traceback (most recent call last): File "C:\Users\test\Desktop\DIDI测试.py", line 13, in <module> extract_html_info(file_path) ^^^^^^^^^^^^^^^^^ NameError: name 'extract_html_info' is not defined

时间: 2023-08-04 22:06:25 浏览: 144

测试用的html文件

测试用的html文件，没用的东西，自己测试时用的，

抱歉，我之前给出的代码有一处错误。在调用`extract_html_info`函数之前，需要先定义该函数。请将以下代码块放在调用`extract_html_info`函数之前： ```python # 提取html文件内所需要数据 def extract_html_info(file_path): with open(file_path, 'r', encoding='utf-8') as file: # 读取HTML源代码 html = file.read() soup = BeautifulSoup(html, 'html.parser') # 提取所有的<p>标签 p_tags = soup.find_all('p') for p_tag in p_tags: # 提取ID talk_id = p_tag.find_previous(string=lambda text: isinstance(text, str) and '[talkid:' in text) talk_id = talk_id.strip('[talkid:]') # 提取时间 timestamp = p_tag.find_previous('body').find_previous('head').find('meta', {'http-equiv': 'Content=-Type'})['content'] # 提取发送号码 send_number = p_tag.find('span', {'class': 'hint-success'}).text # 提取接收号码 receive_number = p_tag.find_all('span', {'class': 'hint-success'})[1].text # 提取信息内容 message = p_tag.previous_sibling.strip() # 判断是否是音频 if '音频' in message: file_url = p_tag.find('a')['href'] print(f"ID: {talk_id}, 时间: {timestamp}, 发送号码: {send_number}, 接收号码: {receive_number}, 音频文件地址: {file_url}") else: print(f"ID: {talk_id}, 时间: {timestamp}, 发送号码: {send_number}, 接收号码: {receive_number}, 信息内容: {message}") ``` 这样，你就定义了`extract_html_info`函数，并且可以在主程序中调用它了。如果还有其他问题，请随时提问。

Traceback (most recent call last): File "C:\Users\test\Desktop\DIDI测试.py", line 13, in <module> extract_html_info(file_path) ^^^^^^^^^^^^^^^^^ NameError: name 'extract_html_info' is not defined

相关推荐

html信息提取

Test.html 测试文件

Traceback (most recent call last): File "C:\Users\yaoju\Desktop\pycuda.py", line 1, in <module> import pycuda.autoinit File "C:\Users\yaoju\Desktop\pycuda.py", line 1, in <module> import pycuda.autoinit ModuleNotFoundError: No module named 'pycuda.autoinit'; 'pycuda' is not a package

Traceback (most recent call last): File "C:/Users/郑紫晗/Desktop/测试.py", line 2, in <module> from docx import Document ModuleNotFoundError: No module named 'docx'

Traceback (most recent call last): File C:\Users\小杨\Desktop\Yang\Yang\lstm.py, line 78, in <module>

Traceback (most recent call last): File "C:/Users/郑紫晗/Desktop/测试.py", line 1, in <module> from pdfminer.high_level import extract_text ModuleNotFoundError: No module named 'pdfminer'

Traceback (most recent call last): File "C:/Users/asus/Desktop/无.py", line 1, in <module> import requests ModuleNotFoundError: No module named 'requests'

Traceback (most recent call last): File "C:/Users/lenovo/Desktop/1.py", line 1, in <module> import pygame ModuleNotFoundError: No module named 'pygame'

Traceback (most recent call last): File "C:/Users/Administrator/Desktop/1.py", line 1, in <module> import requests ModuleNotFoundError: No module named 'requests'

Traceback (most recent call last): File "C:\Users\Administrator\Desktop\MUSIC.py", line 3, in <module> from sklearn.ensemble import RandomForestRegressor ModuleNotFoundError: No module named 'sklearn'

Traceback (most recent call last): File "C:\Users\翁正杰\Desktop\shixi5.py", line 56, in <module> plt.figure()

Traceback (most recent call last): File "C:\Users\乐爷\Desktop\MQTTandUI.py", line 1, in <module> import paho.mqtt.client as mqtt ModuleNotFoundError: No module named 'paho'

Traceback (most recent call last): File "C:\Users\小杨\Desktop\models.py", line 21, in <module> from keras.layers.merge import _Merge ModuleNotFoundError: No module named 'keras.layers.merge'

Traceback (most recent call last): File "C:/Users/huawei/Desktop/想.py", line 32, in <module> if h.value>900: AttributeError: 'tuple' object has no attribute 'value' >>>

Traceback (most recent call last): File "C:\Users\13420\Desktop\511.py", line 15, in <module> inst2.write('REST')怎么解决

航空公司客户满意度数据转换与预测分析Power BI案例研究

最新推荐

平尾装配工作平台运输支撑系统设计与应用

管理建模和仿真的文件

MATLAB遗传算法探索：寻找随机性与确定性的平衡艺术

如何在S7-200 SMART PLC中使用MB_Client指令实现Modbus TCP通信？请详细解释从连接建立到数据交换的完整步骤。

MAX-MIN Ant System：用MATLAB解决旅行商问题

"互动学习：行动中的多样性与论文攻读经历"

【实战指南】MATLAB自适应遗传算法调整：优化流程全掌握

在Spring AOP中，如何实现一个环绕通知并在方法执行前后插入自定义逻辑？

Flutter状态管理新秀：sealed_flutter_bloc包整合seal_unions

关系数据表示学习