首页python爬取文本数据

python爬取文本数据

时间: 2023-10-07 21:12:00 浏览: 58

python获取商品销量数据

```python import requests import re def get_sales(url): headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'} response = requests.get(url, headers=headers) html = response.text pattern = re.compile(r'"soldQuantity":(\d+)') result = pattern.search(html) if result: return result.group(1) else: return None if __name__ == '__main__': url = 'https://detai

Python 可以使用多种库来爬取文本数据，其中最常用的是 requests 和 BeautifulSoup 库，以下是一个简单的示例： ```python import requests from bs4 import BeautifulSoup # 发送 GET 请求获取网页内容 response = requests.get('https://www.example.com') # 使用 BeautifulSoup 解析 HTML soup = BeautifulSoup(response.text, 'html.parser') # 获取网页中的文本内容 text = soup.get_text() # 打印文本内容 print(text) ``` 上面的代码中，首先使用 requests 库发送 GET 请求获取网页内容，然后使用 BeautifulSoup 库解析 HTML，最后从解析后的结果中获取文本内容并打印。这只是一个简单的例子，具体的操作方式还需要根据具体的需求进行调整。

阅读全文