C语言编程【问题描述】编写程序统计一个英文文本文件中每个单词的出现次数（词频统计），并将统计结果按单词出现频率由高至低输出到指定文件中。

这是一个比较典型的文本处理问题，需要进行字符串的分割和哈希表的统计。具体思路如下： 1. 打开输入文件和输出文件，读取输入文件中的内容。 2. 对输入文件中的内容进行分割，将每个单词存储到哈希表中，并统计每个单词的出现次数。 3. 对哈希表中的单词按照出现频率从高到低进行排序。 4. 将排序后的结果输出到指定文件中。以下是对应的 C 语言代码实现： ```c #include <stdio.h> #include <stdlib.h> #include <string.h> #define MAX_LINE 1024 #define MAX_WORD 128 #define MAX_HASH 10007 typedef struct WordNode { char* word; int count; struct WordNode* next; } WordNode; WordNode* hashTable[MAX_HASH] = { NULL }; // 哈希函数 unsigned int hash(char* str) { unsigned int hash = 0; while (*str) { hash = hash * 31 + (*str++); } return hash % MAX_HASH; } // 插入单词到哈希表中 void insert(char* word) { unsigned int key = hash(word); WordNode* node = hashTable[key]; while (node != NULL) { if (strcmp(node->word, word) == 0) { node->count++; return; } node = node->next; } node = (WordNode*)malloc(sizeof(WordNode)); node->word = (char*)malloc(strlen(word) + 1); strcpy(node->word, word); node->count = 1; node->next = hashTable[key]; hashTable[key] = node; } // 释放哈希表中的内存 void freeHashTable() { for (int i = 0; i < MAX_HASH; i++) { WordNode* node = hashTable[i]; while (node != NULL) { WordNode* next = node->next; free(node->word); free(node); node = next; } } } // 比较函数，用于排序 int compare(const void* a, const void* b) { WordNode* node1 = *(WordNode**)a; WordNode* node2 = *(WordNode**)b; return node2->count - node1->count; } int main() { char inputFileName[MAX_WORD], outputFileName[MAX_WORD]; printf("Input file name: "); scanf("%s", inputFileName); printf("Output file name: "); scanf("%s", outputFileName); FILE* inputFile = fopen(inputFileName, "r"); if (inputFile == NULL) { printf("Failed to open input file.\n"); return 1; } char line[MAX_LINE]; while (fgets(line, MAX_LINE, inputFile)) { char* word = strtok(line, " \t\n\r\f.,:;?!()"); while (word != NULL) { insert(word); word = strtok(NULL, " \t\n\r\f.,:;?!()"); } } fclose(inputFile); WordNode* wordList[MAX_HASH]; int count = 0; for (int i = 0; i < MAX_HASH; i++) { WordNode* node = hashTable[i]; while (node != NULL) { wordList[count++] = node; node = node->next; } } qsort(wordList, count, sizeof(WordNode*), compare); FILE* outputFile = fopen(outputFileName, "w"); if (outputFile == NULL) { printf("Failed to open output file.\n"); freeHashTable(); return 1; } for (int i = 0; i < count; i++) { fprintf(outputFile, "%s %d\n", wordList[i]->word, wordList[i]->count); } fclose(outputFile); freeHashTable(); return 0; } ``` 需要注意的是，这里使用了链表法来解决哈希冲突，同时也需要在程序结束之后释放哈希表中的内存。另外，为了方便起见，这里使用了标准库中的快速排序函数 qsort 来对单词按照出现频率进行排序。

阅读全文

C语言编程【问题描述】 编写程序统计一个英文文本文件中每个单词的出现次数（词频统计），并将统计结果按单词出现频率 由高至低输出到指定文件中。

相关推荐

C语言词频统计程序设计与实现

统计文本单词个数与频率

基于C语言实现的词频统计与检索系统开发

统计文本文件中一段英文中某个单词出现的次数及其出现的位置

C语言大作业：词频统计程序

词频统计（数组或链表实现）.rar_c语言 词频_eighthpo_regionr77_tone4nn_词频统计C语言

用数组方法实现从文件中读取单词并统计单词个数

C语言版词频分析器

单词统计小程序

文档中英文词频统计-C++链表的简易使用

C语言程序设计基础ppt

文本文件检索程序代码

PersonalProject-C:wordcount-C语言提交仓库

Linux环境高级编程实验6

C语言项目实战：英语词汇词频统计及图形绘制源码解析

C语言实现词频分析器

文本分析工具：词频统计与排序方法

C语言实现的前缀树字典库

【C语言字符串流程解析】：从输入到输出的优化策略

【数据处理必备】：Python readline在文本分析中的应用

最新推荐

C语言实现英文文本词频统计

C语言中使用lex统计文本文件字符数

C语言统计一篇英文短文中单词的个数实例代码

C语言统计一串字符中空格键、Tab键、回车键、字母、数字及其他字符的个数(Ctrl+Z终止输入)

c语言作业——学生成绩统计.docx

火炬连体网络在MNIST的2D嵌入实现示例

管理建模和仿真的文件

L2正则化的终极指南：从入门到精通，揭秘机器学习中的性能优化技巧

如何构建一个符合GB/T19716和ISO/IEC13335标准的信息安全事件管理框架，并确保业务连续性规划的有效性？

Angular插件增强Application Insights JavaScript SDK功能

C语言编程【问题描述】编写程序统计一个英文文本文件中每个单词的出现次数（词频统计），并将统计结果按单词出现频率由高至低输出到指定文件中。

词频统计（数组或链表实现）.rar_c语言词频_eighthpo_regionr77_tone4nn_词频统计C语言