首页帮我获取https://dbaasp.org/peptide-card?id=DBAASPR_1 页面里，Activity Against Target Species 下的表格里的信息，输出excel用python编写

帮我获取https://dbaasp.org/peptide-card?id=DBAASPR_1 页面里，Activity Against Target Species 下的表格里的信息，输出excel用python编写

时间: 2024-10-20 17:06:24 浏览: 44

为了从指定URL获取页面中的数据并将其转换为Excel文件，我们可以使用Python的一些库，如`requests`用于发送HTTP请求，`BeautifulSoup`解析HTML内容，然后`pandas`处理数据并保存到Excel。以下是简单的步骤：首先，安装必要的库（如果你还没有安装）： ```bash pip install requests beautifulsoup4 pandas openpyxl ``` 接下来是一个示例脚本，它会帮助你完成任务： ```python import requests from bs4 import BeautifulSoup import pandas as pd # 发送GET请求 url = "https://dbaasp.org/peptide-card?id=DBAASPR_1" response = requests.get(url) content = response.text # 使用BeautifulSoup解析HTML soup = BeautifulSoup(content, 'html.parser') # 查找特定区域的数据 table_data = [] table = soup.find('table', {'class': 'table table-striped table-bordered table-hover'}) for row in table.find_all('tr')[1:]: # 我们通常跳过表头 cols = row.find_all('td') cols_data = [col.text.strip() for col in cols] table_data.append(cols_data) # 将数据转化为DataFrame data_df = pd.DataFrame(table_data[1:], columns=table_data[0]) # 假设第一行是列标题 # 保存到Excel output_filename = 'activity_against_target_species.xlsx' data_df.to_excel(output_filename, index=False) print(f"数据已保存至'{output_filename}'")

阅读全文