首页import jieba with open('123.txt', 'r', encoding='utf-8') as f: text = f.read() words = jieba.lcut(text) word_counts = {} for word in words: if len(word) < 2: continue if word in word_counts: word_counts[word] += 1 else: word_counts[word] = 1 sorted_words = sorted(word_counts.items(), key=lambda x: x[1], reverse=True) nouns = [] for word, count in sorted_words: pos = jieba.lcut(word)[0].split('/')[1] if pos == 'n': nouns.append(word) if len(nouns) == 10: break print(nouns)

import jieba with open('123.txt', 'r', encoding='utf-8') as f: text = f.read() words = jieba.lcut(text) word_counts = {} for word in words: if len(word) < 2: continue if word in word_counts: word_counts[word] += 1 else: word_counts[word] = 1 sorted_words = sorted(word_counts.items(), key=lambda x: x[1], reverse=True) nouns = [] for word, count in sorted_words: pos = jieba.lcut(word)[0].split('/')[1] if pos == 'n': nouns.append(word) if len(nouns) == 10: break print(nouns)

时间: 2024-03-16 15:47:47 浏览: 114

求大佬指点，写的是三国演义中文文本的词频统计，可是总是报错（已解决）

5星 · 资源好评率100%

1.首先是这样写的： import jieba txt = open(D:/python程序/threekingdoms.txt,rt,encoding='utf-8').read() words=jieba.lcut(txt) counts={} for word in words: if len(word)==1: continue else: counts[word]=counts.get(word,0)+1 items=list(counts.items()) items.sort(key=lambda x:x[1],reverse=True)

这段代码可以计算指定文本文件中出现频率排名前10的名词。你需要将文本文件命名为“123.txt”，并将其放在与代码文件相同的目录下，然后运行这段代码即可。请注意，代码中使用的是jieba分词库，所以需要确保该库已经安装。如果没有安装，可以使用以下命令进行安装： ``` pip install jieba ``` 另外，在运行代码之前，你需要将文本文件中的内容替换为你想要分析的实际文本。

阅读全文