首页python爬虫使用beautifulsoup库爬取www.shanghairanking.cn网站中大学名字分数排名

python爬虫使用beautifulsoup库爬取www.shanghairanking.cn网站中大学名字分数排名

时间: 2023-12-25 16:06:53 浏览: 22

好的，我可以帮你解答这个问题。首先，你需要安装beautifulsoup库，可以使用以下命令安装： ``` pip install beautifulsoup4 ``` 然后，你需要使用requests库向目标网站发出请求，获取网页内容。可以使用以下代码： ``` import requests url = 'http://www.shanghairanking.cn/rankings/bcur/2021' response = requests.get(url) content = response.content ``` 接下来，你需要使用beautifulsoup解析网页内容，找到你需要的信息。可以使用以下代码： ``` from bs4 import BeautifulSoup soup = BeautifulSoup(content, 'html.parser') table = soup.find('table', {'id': 'UniversityRanking'}) rows = table.tbody.find_all('tr') for row in rows: cols = row.find_all('td') name = cols[1].text.strip() score = cols[2].text.strip() rank = cols[0].text.strip() print(rank, name, score) ``` 这段代码会输出每个大学的排名、名字和分数排名。你可以根据需要进行修改和调整。

阅读全文