首页写一段python代码，爬取酷狗音乐top500榜单前20首，并输出歌名和歌手

写一段python代码，爬取酷狗音乐top500榜单前20首，并输出歌名和歌手

时间: 2024-03-11 13:45:35 浏览: 273

网络爬虫（二） BS4提取之Selector

网络爬虫（二） BS4提取之Selector 2.1 Selector爬取酷狗top500 2.1.1 环境配置第一步：下载谷歌浏览器第二步：安装bs4，requests库打开 cmd 命令行（win + r）输入 pip install bs4 完成bs4模块安装。输入 pip install requests 完成requests模块安装。 2.1.2 可能遇到的安装错误如果执行 pip install bs4 后报错为“pip 不是可执行的命令”，将pip 的路径加入环境变量即可 2.1.3 构造请求网址提供URL如下： https://www.kugou.com/yy/r

好的，以下是爬取酷狗音乐top500榜单前20首歌曲并输出歌名和歌手的Python代码： ``` import requests from bs4 import BeautifulSoup url = 'http://www.kugou.com/yy/rank/home/1-8888.html' headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'} response = requests.get(url, headers=headers) html = response.text soup = BeautifulSoup(html, 'lxml') song_list = soup.select('.pc_temp_songlist > ul > li') for i in range(0, 20): song_title = song_list[i].select('.pc_temp_songname > a')[0].text.strip() song_author = song_list[i].select('.pc_temp_singer > a')[0].text.strip() print('歌曲名称：{}，歌手：{}'.format(song_title, song_author)) ``` 这段代码首先使用requests库发送GET请求获取酷狗音乐top500的HTML源代码，然后使用BeautifulSoup库解析HTML源代码，并定位到歌曲信息所在的HTML标签。接着使用循环遍历前20首歌曲，提取歌曲名称和歌手信息，并使用字符串格式化输出。

阅读全文