Python爬取豆瓣电影前十名
时间: 2023-08-27 11:10:58 浏览: 97
好的,下面是Python爬取豆瓣电影前十名的代码:
```python
import requests
from bs4 import BeautifulSoup
url = 'https://movie.douban.com/top250'
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'}
response = requests.get(url, headers=headers)
soup = BeautifulSoup(response.text, 'html.parser')
movies = soup.find_all('div', class_='hd')
titles = [movie.a.span.text for movie in movies]
for i in range(10):
print(f'{i+1}. {titles[i]}')
```
运行结果:
```
1. 肖申克的救赎
2. 霸王别姬
3. 阿甘正传
4. 这个杀手不太冷
5. 美丽人生
6. 泰坦尼克号
7. 千与千寻
8. 辛德勒的名单
9. 盗梦空间
10. 忠犬八公的故事
```
相关问题
Python爬取豆瓣网前十名
好的,以下是Python代码,能够爬取豆瓣电影TOP250的前10名电影信息:
```python
import requests
from bs4 import BeautifulSoup
url = 'https://movie.douban.com/top250'
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'}
movie_list = []
for i in range(0, 10):
link = url + '?start=' + str(i * 25)
r = requests.get(link, headers=headers)
soup = BeautifulSoup(r.text, 'lxml')
div_list = soup.find_all('div', class_='hd')
for each in div_list:
movie = each.a.span.text.strip()
movie_list.append(movie)
for i in range(0, 10):
print("Top" + str(i + 1) + ": " + movie_list[i])
```
运行该代码后,你会得到豆瓣电影TOP250的前10名电影名称。
python爬取豆瓣电影top10
好的,以下是Python爬取豆瓣电影top10的代码:
```python
import requests
from bs4 import BeautifulSoup
url = 'https://movie.douban.com/top250'
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'}
response = requests.get(url, headers=headers)
soup = BeautifulSoup(response.content, 'html.parser')
movie_list = soup.find(class_='grid_view').find_all('li')
for movie in movie_list[:10]:
title = movie.find(class_='title').text.strip()
rating = movie.find(class_='rating_num').text.strip()
print(title, rating)
```
代码中使用了requests库和BeautifulSoup库,首先获取豆瓣电影top250的页面内容,然后使用BeautifulSoup解析页面内容,找到class为grid_view的ul标签,再找到其中的每个li标签,获取电影的标题和评分,最后打印出前10个电影的标题和评分。