idea中用mapper reduce提取一个文本文件数据分为5000及以上， 3000~5000， 3000以下

时间: 2024-03-23 16:41:56 浏览: 77

idea好用的插件：Free Mybatis自动对应mapper层的xml文件

可以使用MapReduce来实现对文本文件数据的提取，具体操作步骤如下： 1. 首先，使用Map函数将文本文件中的每一行数据进行拆分和处理，将每个单词或数字统计出现次数，并输出：<key, value>键值对，其中key为数据范围，value为1。 2. 然后，使用Reduce函数对Map函数输出的数据进行归并和统计，计算出每个数据范围中所有单词或数字出现的总次数，并输出：<key, value>键值对，其中key为数据范围，value为出现次数。 3. 最后，将Reduce函数输出的结果保存到文件中，即可得到文本文件数据分为5000及以上，3000~5000和3000以下的统计结果。下面是具体的代码实现： Mapper函数： ``` public class WordCountMapper extends Mapper<Object, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(Object key, Text value, Context context) throws IOException, InterruptedException { String[] words = value.toString().split(" "); int count = words.length; if (count >= 5000) { word.set("5000及以上"); } else if (count >= 3000 && count < 5000) { word.set("3000~5000"); } else { word.set("3000以下"); } context.write(word, one); } } ``` Reducer函数： ``` public class WordCountReducer extends Reducer<Text, IntWritable, Text, IntWritable> { private IntWritable result = new IntWritable(); public void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException { int sum = 0; for (IntWritable val : values) { sum += val.get(); } result.set(sum); context.write(key, result); } } ``` 在MapReduce作业的Driver类中，设置输入路径、输出路径、Mapper和Reducer类等信息，并提交作业即可。

阅读全文

idea中用mapper reduce提取一个文本文件数据 分为5000及以上， 3000~5000， 3000以下

相关推荐

本人博客里mapperreduce学习基础里文章的数据来源

IDEA中resources包下mybatis主配置文件config与映射配置文件mapper

idea中用mapper reduce提取一个文本文件数据，文件有价格，名字，厂商，等数据 分为价格5000及以上， 3000~5000， 3000以下，

Idea运行MapperReduce本地环境配置(win10).rar

MapperReduce

用idea的 MapperReduce方法分析以下需求：手机高端机（5000及以上），中端机（3000~5000）和低端机（3000以下）

mapper-reduce

idea工具中直接从mapper.Java文件中跳转到mapper.xml文件的插件,挺不错的

如何使用IDEA创建MAPPER模板过程图解

ERmapper信息提取流程参照.pdf

Idea如何去除Mapper警告方法解析

Global Mapper中SHP文件转KML文件方法

摄影测量 EO 文本文件自动生成Footprint（需要Global Mapper)

基于python django的多商家网上商城平台分账系统资料齐全+详细文档.zip

将JSON数据类型一键转换为易语言自定义数据类型,喜欢的给个star吧!欢迎贡献code.zip

数据结构-顺序表的实现代码

Java语言编写的简易自然数计算的程序图形界面.zip

基于Scikit-learn与Python结合实现的气象预报以及气象动态展示系统资料齐全+详细文档.zip

最新推荐

IDEA中resources包下mybatis主配置文件config与映射配置文件mapper

Spring MVC配置双数据源实现一个java项目同时连接两个数据库的方法

Java的MyBatis框架中Mapper映射配置的使用及原理解析

Spring Boot集成MyBatis实现通用Mapper的配置及使用

通过Spring Boot配置动态数据源访问多个数据库的实现代码

Raspberry Pi OpenCL驱动程序安装与QEMU仿真指南

管理建模和仿真的文件

Fluent UDF实战攻略：案例分析与高效代码编写

如何使用DPDK技术在云数据中心中实现高效率的流量监控与网络安全分析？

Apache RocketMQ Go客户端：全面支持与消息处理功能

idea中用mapper reduce提取一个文本文件数据分为5000及以上， 3000~5000， 3000以下

idea中用mapper reduce提取一个文本文件数据，文件有价格，名字，厂商，等数据分为价格5000及以上， 3000~5000， 3000以下，