pytho数据分析,x轴分为1900-1929,1930-1959,1960-1959,1960-1989,1990-2019,2020-2050五档,然后y轴上放在movie.csv文件中统计各电影年份在各档中的总数
时间: 2024-01-09 08:05:45 浏览: 23
首先,您需要导入`pandas`库来处理CSV文件,并且使用`matplotlib`库来绘制图表。您可以按照以下步骤进行操作:
1. 导入必要的库:
```python
import pandas as pd
import matplotlib.pyplot as plt
```
2. 读取CSV文件并创建DataFrame对象:
```python
df = pd.read_csv('movie.csv')
```
3. 添加一列`decade`表示年份所属的区间:
```python
bins = [1900, 1930, 1960, 1990, 2020, 2050]
labels = ['1900-1929', '1930-1959', '1960-1989', '1990-2019', '2020-2050']
df['decade'] = pd.cut(df['year'], bins=bins, labels=labels)
```
4. 对`decade`列进行分组并计算各组的数量:
```python
count_by_decade = df.groupby('decade').size()
```
5. 绘制条形图:
```python
count_by_decade.plot(kind='bar')
plt.show()
```
完整代码如下:
```python
import pandas as pd
import matplotlib.pyplot as plt
df = pd.read_csv('movie.csv')
bins = [1900, 1930, 1960, 1990, 2020, 2050]
labels = ['1900-1929', '1930-1959', '1960-1989', '1990-2019', '2020-2050']
df['decade'] = pd.cut(df['year'], bins=bins, labels=labels)
count_by_decade = df.groupby('decade').size()
count_by_decade.plot(kind='bar')
plt.show()
```
这段代码将会生成一个条形图,其中x轴为年份所属的区间,y轴为各区间中电影数量的总数。