首页https://manu44.magtech.com.cn/Jwk_infotech_wk3/article/2023/2096-3467/2096-3467-7-4-145.shtml 获取网页的文本标题和正文内容

https://manu44.magtech.com.cn/Jwk_infotech_wk3/article/2023/2096-3467/2096-3467-7-4-145.shtml 获取网页的文本标题和正文内容

时间: 2023-09-27 17:04:50 浏览: 118

读取网页文本内容

4星 · 用户满意度95%

读取网页的内容，文本，只是 TXT哦··自己输出

同样地，您可以使用Python中的requests和BeautifulSoup库来获取网页的标题和正文内容。以下是一个示例代码： ```python import requests from bs4 import BeautifulSoup url = "https://manu44.magtech.com.cn/Jwk_infotech_wk3/article/2023/2096-3467/2096-3467-7-4-145.shtml" response = requests.get(url) soup = BeautifulSoup(response.content, 'html.parser') title = soup.title.string content = soup.find('div', class_='content').get_text() print("标题：", title) print("正文内容：", content) ``` 这里我们首先使用requests库获取网页的HTML代码，然后使用BeautifulSoup库解析HTML代码，并使用title.string获取网页的标题，使用find()方法找到class为'content'的div标签，并使用get_text()方法获取正文内容的文本。最后打印出标题和正文内容即可。

阅读全文