编写Java程序白鹿原词频统计

编写Java程序来统计白鹿原这本书中的词频，通常需要以下几个步骤： 1. **数据准备**：首先，你需要有一个文本文件，包含《白鹿原》的内容。可以将书籍内容读取到字符串变量中。 ```java String bookContent = readFile("白鹿原.txt"); ``` 2. **分词处理**：接着，使用正则表达式或者现成的库（如开源的Jieba Java版等）对文本进行分词。 ```java List<String> words = Arrays.asList(bookContent.split("\\s+")); ``` 3. **创建哈希表**：使用HashMap存储每个单词及其出现的次数，键是单词，值是计数。 ```java Map<String, Integer> wordCount = new HashMap<>(); for (String word : words) { wordCount.put(word, wordCount.getOrDefault(word, 0) + 1); } ``` 4. **统计并打印结果**：遍历哈希表，输出每个单词及其频率。 ```java wordCount.forEach((word, count) -> System.out.println(String.format("%s: %d", word, count))); ``` 5. **异常处理**：记得处理可能出现的错误，比如文件读取失败、空格分割导致的无效单词等。完整示例代码片段： ```java import java.io.BufferedReader; import java.io.FileReader; import java.util.HashMap; import java.util.List; public class WordFrequencyCounter { public static void main(String[] args) { try { String bookFile = "白鹿原.txt"; List<String> words = getWordsFromFile(bookFile); Map<String, Integer> wordCount = countWords(words); printWordFrequency(wordCount); } catch (Exception e) { e.printStackTrace(); } } private static List<String> getWordsFromFile(String fileName) throws Exception { // 使用BufferedReader读取文件内容，并分词 BufferedReader reader = new BufferedReader(new FileReader(fileName)); StringBuilder content = new StringBuilder(); String line; while ((line = reader.readLine()) != null) { content.append(line).append(" "); } reader.close(); return Arrays.asList(content.toString().split("\\s+")); } private static Map<String, Integer> countWords(List<String> words) { // 创建并统计词频 Map<String, Integer> wordCount = new HashMap<>(); for (String word : words) { wordCount.put(word, wordCount.getOrDefault(word, 0) + 1); } return wordCount; } private static void printWordFrequency(Map<String, Integer> wordCount) { wordCount.forEach((word, count) -> System.out.println(String.format("%s: %d", word, count))); } } ```

阅读全文

编写Java程序白鹿原词频统计

相关推荐

Python-[jieba库应用]-统计水浒传中人物出现次数

浅析《白鹿原》中女性形象分析.doc

白鹿原词频统计python

白鹿原词频统计python123

论陈忠实《白鹿原》中白嘉轩形象及其文化内涵.docx

open("白鹿原.txt"'r',encoding='UTF-8')中的r是什么意思

在执行网页某个功能时，报错APP Referer校验失败，程序中那个位置对应

程序可以在ftp云服务器中 写入路径，但无法写入文件或者图片，怎么解决

卡通风格化魔法术技能粒子特效 ：Toon Projectiles 2 1.0

在 MATLAB GUI 中动态更新数据：策略与实践

【JCR一区级】Matlab实现白鹭群优化算法ESOA-CNN-BiLSTM-Attention的故障诊断算法研究.rar

信创实验室建设方案（24页）.pptx

KGBrowserSetup-x86-V1.0.0.100-20190315.exe

obspy-1.2.2-cp38-cp38-win_amd64.whl

数字政府大数据政务云平台顶层设计方案(75页）.pptx

Pillow-9.1.1-cp37-cp37m-win_amd64.whl

mxnet-1.7.0+mkl-cp36-cp36m-win_amd64.whl

HO河马优化算法特征选择并同时优化XGBOOST参数数据分类预测（Matlab完整源码和数据)

算法部署-使用OpenVINO在Intel-CPU上部署StableDiffusion图像生成扩散模型-附项目源码-优质项目实战

最新推荐

卡通风格化魔法术技能粒子特效 ：Toon Projectiles 2 1.0

明日知道社区问答系统设计与实现-SSM框架java源码分享

管理建模和仿真的文件

C#单元测试实战：在Visual Studio中打造强大测试框架

现需完成模拟超市收银操作，输入购买的某件商品的单价和数量，输出应付的款项和相关信息。（需要有清晰的输入提示和具体的输出信息提示。）用c语言写代码

Unity3D粒子特效包：闪电效果体验报告

"互动学习：行动中的多样性与论文攻读经历"

Visual Studio代码重构：简化代码，增强可维护性的秘密

用java写购买机票

Windows64位Python3.7安装Twisted库指南

程序可以在ftp云服务器中写入路径，但无法写入文件或者图片，怎么解决

卡通风格化魔法术技能粒子特效：Toon Projectiles 2 1.0

卡通风格化魔法术技能粒子特效：Toon Projectiles 2 1.0