首页python爬虫爬取网页数据

python爬虫爬取网页数据

时间: 2023-06-11 17:06:12 浏览: 89

Python 爬虫可以使用第三方库如 requests、BeautifulSoup、Scrapy 等来实现网页数据的爬取。以下是一个使用 requests 和 BeautifulSoup 实现简单网页数据爬取的例子： ```python import requests from bs4 import BeautifulSoup url = 'https://www.example.com' response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') print(soup.title.text) ``` 以上代码通过 requests 库向指定 URL 发送 GET 请求，并返回响应对象。然后使用 BeautifulSoup 库对响应对象的 HTML 文本进行解析，提取网页标题并输出。当然，这只是爬虫的入门级别，对于复杂的网页结构和大量数据的爬取，需要使用更为高级的爬虫框架和技术。