首页使用python3写一段代码，打开一个txt文件并使用nltk进行分词，再进行词频统计，使用matplotlib绘制图标绘制词频

使用python3写一段代码，打开一个txt文件并使用nltk进行分词，再进行词频统计，使用matplotlib绘制图标绘制词频

时间: 2024-03-26 19:34:04 浏览: 59

好的，下面是一段示例代码，其中使用了nltk和matplotlib库： ```python import nltk from nltk.tokenize import word_tokenize import matplotlib.pyplot as plt from collections import Counter # 打开文本文件并读取内容 with open('example.txt', 'r') as f: text = f.read() # 使用nltk库进行分词 tokens = word_tokenize(text) # 统计词频 freq = Counter(tokens) # 取出前10个高频词和它们的词频 top_words = freq.most_common(10) top_words, top_freqs = zip(*top_words) # 绘制柱状图 plt.bar(top_words, top_freqs) plt.title('Top 10 Most Frequent Words') plt.xlabel('Words') plt.ylabel('Frequency') plt.show() ``` 在这段代码中，我们首先打开了一个名为`example.txt`的文本文件，并使用`nltk`库中的`word_tokenize`函数对文本进行了分词处理。然后，我们使用`collections`模块的`Counter`函数统计了每个词出现的次数，并取出了前10个高频词。最后，我们使用`matplotlib`库将这些高频词的词频绘制成柱状图。

阅读全文