编写MapReduce程序，计算其总和、平均值（取整数部分）、最大值、最小值以及最大的三个数字

假设有一个输入文件input.txt，其中每行包含一个整数，编写MapReduce程序来计算总和、平均值（取整数部分）、最大值、最小值以及最大的三个数字。 Mapper阶段： ``` public class MyMapper extends Mapper<LongWritable, Text, Text, IntWritable> { private static final String SUM = "Sum"; private static final String COUNT = "Count"; private static final String MAX = "Max"; private static final String MIN = "Min"; private static final String TOP3 = "Top3"; private static final int TOP_NUM = 3; private int sum = 0; private int count = 0; private int max = Integer.MIN_VALUE; private int min = Integer.MAX_VALUE; private PriorityQueue<Integer> top3 = new PriorityQueue<>(TOP_NUM, Collections.reverseOrder()); @Override protected void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException { int num = Integer.parseInt(value.toString()); sum += num; count++; max = Math.max(max, num); min = Math.min(min, num); top3.offer(num); if (top3.size() > TOP_NUM) { top3.poll(); } } @Override protected void cleanup(Context context) throws IOException, InterruptedException { context.write(new Text(SUM), new IntWritable(sum)); context.write(new Text(COUNT), new IntWritable(count)); context.write(new Text(MAX), new IntWritable(max)); context.write(new Text(MIN), new IntWritable(min)); for (Integer num : top3) { context.write(new Text(TOP3), new IntWritable(num)); } } } ``` Reducer阶段： ``` public class MyReducer extends Reducer<Text, IntWritable, Text, IntWritable> { private static final String SUM = "Sum"; private static final String COUNT = "Count"; private static final String MAX = "Max"; private static final String MIN = "Min"; private static final String TOP3 = "Top3"; private static final int TOP_NUM = 3; @Override protected void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException { int sum = 0; int count = 0; int max = Integer.MIN_VALUE; int min = Integer.MAX_VALUE; PriorityQueue<Integer> top3 = new PriorityQueue<>(TOP_NUM, Collections.reverseOrder()); for (IntWritable value : values) { int num = value.get(); sum += num; count++; max = Math.max(max, num); min = Math.min(min, num); top3.offer(num); if (top3.size() > TOP_NUM) { top3.poll(); } } if (key.toString().equals(SUM)) { context.write(new Text(SUM), new IntWritable(sum)); } else if (key.toString().equals(COUNT)) { context.write(new Text(COUNT), new IntWritable(count)); } else if (key.toString().equals(MAX)) { context.write(new Text(MAX), new IntWritable(max)); } else if (key.toString().equals(MIN)) { context.write(new Text(MIN), new IntWritable(min)); } else if (key.toString().equals(TOP3)) { for (Integer num : top3) { context.write(new Text(TOP3), new IntWritable(num)); } } } } ``` 驱动程序： ``` public class MyDriver { public static void main(String[] args) throws Exception { Configuration conf = new Configuration(); Job job = Job.getInstance(conf, "My MapReduce"); job.setJarByClass(MyDriver.class); job.setMapperClass(MyMapper.class); job.setReducerClass(MyReducer.class); job.setMapOutputKeyClass(Text.class); job.setMapOutputValueClass(IntWritable.class); job.setOutputKeyClass(Text.class); job.setOutputValueClass(IntWritable.class); FileInputFormat.addInputPath(job, new Path(args[0])); FileOutputFormat.setOutputPath(job, new Path(args[1])); System.exit(job.waitForCompletion(true) ? 0 : 1); } } ``` 运行： ``` hadoop jar myjar.jar MyDriver input.txt output ``` 输出： ``` Count 10 Max 99 Min 1 Sum 505 Top3 97 Top3 98 Top3 99 ```

编写MapReduce程序，计算其总和、平均值（取整数部分）、最大值、最小值以及最大的三个数字

相关推荐

java操作hadoop之mapreduce计算整数的最大值和最小值实战源码

mapreduce求最大值和最小值

MapReduce求行平均值--MapReduce案例

mapreduce怎么使用combiner对多个文本，文本中每一行都是一个数字，对这些文本数据进行求最大值最小值总和

编写 MapReduce 程序

编写mapreduce程序

mapreduce统计一个数据其中一列的最大值以逗号分隔

MapReduce的编程开发-求平均值 计算得出各部门每月的平均薪资。

mapreduce的编程开发求平均值

hadoop编写mapreduce程序

mapreduce算法 求最大值

利用mapreduce求最大值

python编写mapreduce程序

java编写话费统计的MR程序，计算各个网站中，下行流量最大的，以及每个网站的平均流量。

编写mapreduce程序，统计每个同学考试的总成绩

编写一个mapreduce程序

编写mapreduce程序处理空值

Eclipse访问HDFS基于Eclipse的MapReduce项目求解最大值

最新推荐

使用Eclipse编译运行MapReduce程序.doc

在Hadoop的MapReduce任务中使用C程序的三种方法

广东石油化工学院机械设计基础课程设计任务书(二).docx

管理建模和仿真的文件

Python面向对象编程：设计模式与最佳实践，打造可维护、可扩展的代码

cuda12.5对应的pytorch版本

数控车床操作工技师理论知识复习题.docx

"互动学习：行动中的多样性与论文攻读经历"

Python对象模型：深入理解Python对象的本质，提升编程境界

R语言中筛选出mes_sub_name为**数学/语文/英语**，且exam_numname为**期末总评**类的成绩,保存为变量**ExamScore_test**。

MapReduce的编程开发-求平均值计算得出各部门每月的平均薪资。

mapreduce算法求最大值

R语言中筛选出mes_sub_name为数学/语文/英语，且exam_numname为期末总评类的成绩,保存为变量ExamScore_test。