根据数据文件titannic.zip,使用HIVE完成以下操作 统计获救与死亡情况 统计舱位分布情况 统计港口登船人数分布情况 统计性别与生存率的关系 统计客舱等级与生存率的关系 统计登船港口与生存率的关系
时间: 2023-07-14 15:14:17 浏览: 62
以下是使用Hive完成上述需求的代码:
1. 统计获救与死亡情况
```
SELECT
CASE WHEN survived = 1 THEN '获救' ELSE '死亡' END AS status,
COUNT(*) AS count
FROM titanic
GROUP BY survived;
```
2. 统计舱位分布情况
```
SELECT
pclass AS cabin_class,
COUNT(*) AS count
FROM titanic
GROUP BY pclass;
```
3. 统计港口登船人数分布情况
```
SELECT
CASE
WHEN embarked = 'C' THEN 'Cherbourg'
WHEN embarked = 'Q' THEN 'Queenstown'
WHEN embarked = 'S' THEN 'Southampton'
ELSE 'Unknown'
END AS port,
COUNT(*) AS count
FROM titanic
GROUP BY embarked;
```
4. 统计性别与生存率的关系
```
SELECT
sex,
AVG(survived) AS survival_rate
FROM titanic
GROUP BY sex;
```
5. 统计客舱等级与生存率的关系
```
SELECT
pclass AS cabin_class,
AVG(survived) AS survival_rate
FROM titanic
GROUP BY pclass;
```
6. 统计登船港口与生存率的关系
```
SELECT
CASE
WHEN embarked = 'C' THEN 'Cherbourg'
WHEN embarked = 'Q' THEN 'Queenstown'
WHEN embarked = 'S' THEN 'Southampton'
ELSE 'Unknown'
END AS port,
AVG(survived) AS survival_rate
FROM titanic
GROUP BY embarked;
```
相关推荐
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![](https://csdnimg.cn/download_wenku/file_type_ask_c1.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![docx](https://img-home.csdnimg.cn/images/20210720083331.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)