首页python3爬取校友会版的高校排名，https://www.dxsbb.com/news/5463.html的代码

python3爬取校友会版的高校排名，https://www.dxsbb.com/news/5463.html的代码

时间: 2023-10-19 20:06:38 浏览: 140

以下是使用 Python3 爬取 https://www.dxsbb.com/news/5463.html 的校友会版的高校排名的代码： ```python import requests from bs4 import BeautifulSoup url = "https://www.dxsbb.com/news/5463.html" response = requests.get(url) soup = BeautifulSoup(response.content, "html.parser") table = soup.find("table", {"class": "list-table"}) rows = table.find_all("tr") for row in rows: cols = row.find_all("td") if len(cols) > 0: rank = cols[0].text.strip() name = cols[1].text.strip() score = cols[2].text.strip() print(rank, name, score) ``` 解释： 1. 首先，我们使用 requests 库发送 GET 请求，获取网页的 HTML 内容。 2. 然后，使用 BeautifulSoup 库解析 HTML 内容。 3. 我们找到了包含排名信息的表格，并使用 find_all 方法获取每一行的信息。 4. 对于每一行信息，我们使用 find_all 方法获取每一列的信息。 5. 如果一行中有多于一列的信息，我们从中提取出排名、学校名称和综合得分信息，并打印出来。希望这个例子能帮助你实现你的爬虫。

阅读全文