首页1、使用BeautifulSoup解析页面 <html> <title>baidu</title></html>

1、使用BeautifulSoup解析页面 <html> <title>baidu</title></html>

时间: 2023-11-16 20:59:44 浏览: 154

使用BeautifulSoup解析页面可以方便地从HTML或XML文档中提取数据。在Python中，我们可以使用BeautifulSoup库来实现这个功能。下面是一个使用BeautifulSoup解析页面的示例代码： ```python from bs4 import BeautifulSoup html_doc = """ <html> <head> <title>baidu</title> </head> <body> The Dormouse's story Once upon a time there were three little sisters; and their names were <a href="http://example.com/elsie" class="sister" id="link1">Elsie</a>, <a href="http://example.com/lacie" class="sister" id="link2">Lacie</a> and <a href="http://example.com/tillie" class="sister" id="link3">Tillie</a>; and they lived at the bottom of a well. ... </body> </html> """ soup = BeautifulSoup(html_doc, 'html.parser') print(soup.title.string) ``` 在这个例子中，我们首先导入了BeautifulSoup库，然后定义了一个HTML文档的字符串。接着，我们使用BeautifulSoup将这个字符串解析成一个BeautifulSoup对象。最后，我们通过访问soup对象的title属性和string属性，输出了HTML文档中的title标签的内容。输出结果为： ``` baidu ```

阅读全文