首页python爬虫储存爬取的数据到数据库的详细代码

python爬虫储存爬取的数据到数据库的详细代码

时间: 2023-10-08 18:05:07 浏览: 197

以下是一个将爬取的数据存入MySQL数据库中的Python爬虫示例代码： ```python import pymysql import requests from bs4 import BeautifulSoup # 连接MySQL数据库 = pymysql.connect(host='localhost', user='root', password='password', db='testdb', port=3306) # 获取数据库游标 cursor = db.cursor() # 请求页面 url = 'https://www.example.com' res = requests.get(url) soup = BeautifulSoup(res.text, 'html.parser') # 从页面中提取数据 data1 = soup.find('div', {'class': 'box1'}).text data2 = soup.find('div', {'class': 'box2'}).text # 将数据插入数据库 sql = "INSERT INTO test_table(data1, data2) VALUES (%s, %s)" try: cursor.execute(sql, (data1, data2)) db.commit() print("数据插入成功") except: db.rollback() print("数据插入失败") # 关闭数据库连接 db.close() ``` 上面的代码中，我们首先连接到MySQL数据库，并获取一个游标。然后我们请求一个网页，使用BeautifulSoup库提取数据。最后，我们将数据插入到MySQL数据库中。在插入数据时，我们将数据1和数据2作为参数传递给SQL插入语句，然后使用游标执行该语句。如果插入成功，则提交，否则回滚。最后，我们关闭数据库连接。

阅读全文