爬取猫眼经典电影数据(名称、url、评分、id),生成csv文件
时间: 2023-12-22 16:03:40 浏览: 27
好的,我可以教你如何使用Python爬取猫眼经典电影数据,生成csv文件。请先确保你已经安装了Python和相应的库(requests、BeautifulSoup、pandas)。
以下是代码实现:
```python
import requests
from bs4 import BeautifulSoup
import pandas as pd
# 定义请求头信息
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3'}
# 获取网页源代码
url = 'https://maoyan.com/films?showType=3'
response = requests.get(url, headers=headers)
# 解析网页源代码
soup = BeautifulSoup(response.text, 'html.parser')
movie_list = soup.find_all('div', attrs={'class': 'movie-item film-channel'})
# 提取所需信息
movie_info = []
for movie in movie_list:
movie_name = movie.find('span', attrs={'class': 'name'}).text
movie_url = 'https://maoyan.com' + movie.find('a').get('href')
movie_score = movie.find('span', attrs={'class': 'score'}).text
movie_id = movie.find('a').get('data-val').split(':')[1]
movie_info.append([movie_name, movie_url, movie_score, movie_id])
# 生成csv文件
df = pd.DataFrame(movie_info, columns=['电影名称', '电影链接', '评分', '电影ID'])
df.to_csv('maoyan_movies.csv', index=False, encoding='utf-8-sig')
```
运行上述代码,将会在当前目录下生成一个名为“maoyan_movies.csv”的文件,其中包含了猫眼经典电影的名称、链接、评分和ID。