首页python输出中文小说txt高频词条形图代码

python输出中文小说txt高频词条形图代码

时间: 2023-08-31 11:05:52 浏览: 53

以下是Python代码，用于输出中文小说txt的高频词条形图： ```python import jieba from collections import Counter import matplotlib.pyplot as plt # 读取小说txt文件 with open('novel.txt', 'r', encoding='utf-8') as f: text = f.read() # 对文本进行分词和去除停用词等预处理 words = [word for word in jieba.cut(text) if len(word) > 1 and word != '\n'] stopwords = ['的', '了', '是', '我', '你', '他', '她', '它', '我们', '你们', '他们', '她们', '它们'] words = [word for word in words if word not in stopwords] # 统计词频并取前20个高频词 word_counts = Counter(words) top20_words = word_counts.most_common(20) # 生成词条形图 plt.rcParams['font.sans-serif'] = ['SimHei'] plt.bar(range(len(top20_words)), [val[1] for val in top20_words]) plt.xticks(range(len(top20_words)), [val[0] for val in top20_words]) plt.title('小说高频词条形图') plt.show() ``` 需要注意的是，代码中的`novel.txt`需要替换成你要处理的小说文本文件名。此外，如果你的Python环境中没有安装jieba和matplotlib库，需要先进行安装。