首页python爬虫小红书代码

python爬虫小红书代码

时间: 2024-03-15 21:40:36 浏览: 213

当然可以！以下是一个简单的Python爬虫代码示例，用于爬取小红书网站的数据： ```python import requests from bs4 import BeautifulSoup # 发送HTTP请求获取网页内容 url = "https://www.xiaohongshu.com/" response = requests.get(url) html_content = response.text # 使用BeautifulSoup解析网页内容 soup = BeautifulSoup(html_content, "html.parser") # 提取需要的数据 data_list = [] items = soup.find_all("div", class_="item") for item in items: title = item.find("h3").text.strip() author = item.find("span", class_="author").text.strip() data_list.append({"title": title, "author": author}) # 打印提取的数据 for data in data_list: print("标题：", data["title"]) print("作者：", data["author"]) print("--") ``` 这段代码使用了`requests`库发送HTTP请求获取小红书网站的内容，并使用`BeautifulSoup`库解析网页内容。然后，通过查找特定的HTML元素，提取了标题和作者信息，并将其存储在一个列表中。最后，遍历列表并打印提取的数据。

阅读全文