利用pandas解决问题:City staff need to conduct statistics on the population information of the city, including name, gender, age, and community. There are duplicate values, missing values, and unreasonable values in the data. Please handle them as required: Duplicate values: name: if there are two names, keep the first one. Missing values: name: drop the line. gender: fill in Unknown. age: fill in 0. community: fill in Unknown. Unreasonable values: age: if the age is less than 0, fill in 0. Please output average age of each gender in each community. Input The input is the path name of the file. filename = input() Output print the output dataframe as sample.

以下是用pandas解决该问题的代码实现: ```python import pandas as pd # 读取数据 filename = input() df = pd.read_csv(filename) # 处理重复值 df.drop_duplicates(subset=['name'], keep='first', inplace=True) # 处理缺失值 df.dropna(subset=['name'], inplace=True) df.fillna({'gender': 'Unknown', 'age': 0, 'community': 'Unknown'}, inplace=True) # 处理不合理值 df['age'] = df['age'].apply(lambda x: 0 if x < 0 else x) # 计算平均年龄 result = df.groupby(['community', 'gender']).agg({'age': 'mean'}) # 输出结果 print(result) ``` 上述代码首先读取文件,然后依次进行重复值、缺失值和不合理值的处理。最后通过groupby函数计算每个社区、每个性别的平均年龄,并输出结果。

