研究文学的人通常需要统计一篇文章【用英文写】中某个人名、地名或形容词出现的次数和位置，请帮我写一个统计系统，完成以下功能，其中文章存放在一个名为wen.txt的文件中，要查找的人名，地名和形容词一次性输入在另一个名为cha.txt的文本文件中，要求系统读取wen.txt和cha.txt这两个文件并运行一次，全部统计完成，并把结果存放在文件result.txt中，并且统计所有查找对象出现的概率使用基于c语言的代码

时间: 2024-02-06 07:10:57 浏览: 54

统计文本文件中一段英文中某个单词出现的次数及其出现的位置

4星 · 用户满意度95%

统计文本文件中一段英文中某个单词出现的次数及其出现的位置本知识点介绍了一个使用 C++ 编程语言编写的程序，旨在统计给定英文文本文件中某个英文单词的出现次数及其出现的位置。下面是对该程序的详细解释：程序结构该程序主要由四个部分组成： 1. 头文件包含与宏定义：该部分包含了必要的头文件 `stdio.h` 和 `string.h`，并且定义了一个宏 `MAX_size`，用于限制字符串的最大长度。 2. 函数 `_hke` 和 `_loHKE`：这两个函数用于判断字符是否为英文字符（包括小写和大写）和将大写字符转换为小写字符。 3. 函数 `Index`：该函数是程序的核心部分，负责统计给定单词在文本文件中的出现次数及其出现的位置。 4. 主函数 `main`：该函数负责读取文本文件，获取用户输入的单词，并调用 `Index` 函数来统计单词的出现次数及其出现的位置。统计单词出现次数及其出现的位置函数 `Index` 负责统计给定单词在文本文件中的出现次数及其出现的位置。该函数的参数包括： * `str`：文本文件中的字符串 * `word`：用户输入的单词 * `position`：用于存储单词出现的位置函数 `Index` 的执行过程可以分为以下步骤： 1. 初始化变量 `word_number`，用于统计文本文件中的单词数量。 2. 遍历文本文件中的每个字符，使用函数 `_hke` 判断字符是否为英文字符。 3. 如果字符为英文字符，则将其加入到 `str` 中，并将 `word_number` 加 1。 4. 对于每个单词，使用函数 `_loHKE` 将其转换为小写字符，以便与用户输入的单词进行比较。 5. 如果发现单词与用户输入的单词相同，则将其出现的位置存储到 `position` 数组中，并将 `times` 加 1。 6. 如果单词不相同，则继续遍历文本文件中的下一个字符。主函数主函数 `main` 负责读取文本文件，获取用户输入的单词，并调用 `Index` 函数来统计单词的出现次数及其出现的位置。该函数的执行过程可以分为以下步骤： 1. 打开文本文件并读取其内容到 `str` 中。 2. 获取用户输入的单词。 3. 调用 `Index` 函数来统计单词的出现次数及其出现的位置。 4. 如果单词未出现，则输出提示信息“ There is no such word you entered in the article.”。 5. 否则，输出单词的出现次数及其出现的位置。总结本程序使用 C++ 编程语言，统计给定英文文本文件中某个英文单词的出现次数及其出现的位置。该程序使用了两个函数 `_hke` 和 `_loHKE` 来判断字符是否为英文字符和将大写字符转换为小写字符，并使用函数 `Index` 来统计单词的出现次数及其出现的位置。该程序可以应用于文本分析、信息检索等领域。

好的，我可以为你编写一个基于C语言的统计系统。以下是代码： ``` #include <stdio.h> #include <stdlib.h> #include <string.h> #define MAX_WORD_LEN 50 #define MAX_FILE_LEN 100000 typedef struct { char word[MAX_WORD_LEN]; int count; int *positions; int pos_count; double probability; } Word; typedef struct { Word *words; int count; } WordList; int count_words(char *str, char *word) { int count = 0; char *pos = str; while ((pos = strstr(pos, word)) != NULL) { count++; pos += strlen(word); } return count; } WordList *load_word_list(char *filename) { FILE *fp = fopen(filename, "r"); if (fp == NULL) { printf("Failed to open file: %s\n", filename); exit(1); } char buf[MAX_WORD_LEN]; WordList *word_list = (WordList *)malloc(sizeof(WordList)); word_list->words = (Word *)malloc(sizeof(Word)); word_list->count = 0; while (fgets(buf, MAX_WORD_LEN, fp) != NULL) { buf[strcspn(buf, "\r\n")] = '\0'; // remove newline characters word_list->words = (Word *)realloc(word_list->words, (word_list->count + 1) * sizeof(Word)); strcpy(word_list->words[word_list->count].word, buf); word_list->words[word_list->count].count = 0; word_list->words[word_list->count].positions = NULL; word_list->words[word_list->count].pos_count = 0; word_list->words[word_list->count].probability = 0.0; word_list->count++; } fclose(fp); return word_list; } void count_words_in_file(char *filename, WordList *word_list) { FILE *fp = fopen(filename, "r"); if (fp == NULL) { printf("Failed to open file: %s\n", filename); exit(1); } char *file_contents = (char *)malloc(MAX_FILE_LEN); fread(file_contents, 1, MAX_FILE_LEN, fp); fclose(fp); // count occurrences of each word in the file char *pos = file_contents; while (*pos != '\0') { for (int i = 0; i < word_list->count; i++) { int count = count_words(pos, word_list->words[i].word); if (count > 0) { word_list->words[i].count += count; word_list->words[i].positions = (int *)realloc(word_list->words[i].positions, (word_list->words[i].pos_count + count) * sizeof(int)); char *word_pos = strstr(pos, word_list->words[i].word); for (int j = 0; j < count; j++) { word_list->words[i].positions[word_list->words[i].pos_count] = word_pos - file_contents; word_list->words[i].pos_count++; word_pos += strlen(word_list->words[i].word); } } } pos++; } free(file_contents); } void calculate_probabilities(WordList *word_list, int total_words) { for (int i = 0; i < word_list->count; i++) { word_list->words[i].probability = (double)word_list->words[i].count / total_words; } } void write_results_to_file(char *filename, WordList *word_list, int total_words) { FILE *fp = fopen(filename, "w"); if (fp == NULL) { printf("Failed to open file: %s\n", filename); exit(1); } for (int i = 0; i < word_list->count; i++) { fprintf(fp, "%s:\n", word_list->words[i].word); fprintf(fp, "\tCount: %d\n", word_list->words[i].count); fprintf(fp, "\tPositions: "); for (int j = 0; j < word_list->words[i].pos_count; j++) { fprintf(fp, "%d", word_list->words[i].positions[j]); if (j < word_list->words[i].pos_count - 1) { fprintf(fp, ", "); } } fprintf(fp, "\n"); fprintf(fp, "\tProbability: %.2f%%\n", word_list->words[i].probability * 100); } fprintf(fp, "Total words: %d\n", total_words); fclose(fp); } int main() { WordList *word_list = load_word_list("cha.txt"); count_words_in_file("wen.txt", word_list); int total_words = 0; for (int i = 0; i < word_list->count; i++) { total_words += word_list->words[i].count; } calculate_probabilities(word_list, total_words); write_results_to_file("result.txt", word_list, total_words); printf("Done!\n"); return 0; } ``` 这个程序定义了一个 `Word` 结构体，用于存储一个单词的信息，包括单词本身、出现次数、出现位置等。`WordList` 结构体则用于存储所有要查找的单词，包括它们的出现次数和位置。程序从两个文件中读取要查找的单词和要搜索的文章，然后统计所有单词在文章中的出现次数和位置，并计算每个单词出现的概率，最后将结果写入到一个名为 `result.txt` 的文件中。

阅读全文

相关推荐

c语言 统计字符出现次数.txt

查找英语文章中要出现的单词，统计其个数

自然语言处理：中文分词，打标签，文章匹配相似度，机器学习.zip

2017版高考英语一轮复习语法专题第二部分词法篇_其他词类专题2冠词素材外研版

人教PEP英语六年级下册期末复习冠词PPT学习教案.pptx

四年级英语下册Unit3WeatherPartB第3课时作业人教PEP版202004092154

大学英语四六级应试技巧阅读篇PPT学习教案.pptx

日语修订版第一册单词(含词例全).docx

学前班应掌握字词.doc

四级长篇阅读匹配题方法篇.doc

七年级j完形填空100篇精选.doc

大学英语新四级阅读PPT学习教案.pptx

考研英语阅读理解10种题型解题技巧.doc

成考英语答题技巧及复习大纲学习教案.pptx

初中英语牛津上海七年级下册语法课PPT学习教案.pptx

外研版小升初英语重点知识积累以及常考题型.ppt

2013年九年级英语上册 Module 3 Sporting life综合检测题 外研版

最新推荐

李白高力士脱靴李白贺知章告别课本剧.pptx

高清艺术文字图标资源，PNG和ICO格式免费下载

管理建模和仿真的文件

DMA技术：绕过CPU实现高效数据传输

SGM8701电压比较器如何在低功耗电池供电系统中实现高效率运作？

mui框架HTML5应用界面组件使用示例教程

"互动学习：行动中的多样性与论文攻读经历"

【数据传输高速公路】：总线系统的深度解析

如何结合PID算法调整PWM信号来优化电机速度控制？请提供实现这一过程的步骤和代码示例。

Vue.js开发利器：chrome-vue-devtools插件解析

c语言统计字符出现次数.txt

2013年九年级英语上册 Module 3 Sporting life综合检测题外研版