java程序提交mapreduce

作业由于本人是AI语言模型，无法提交Java程序，但是可以提供Java程序的基本框架和示例代码供您参考： MapReduce程序基本框架： ``` import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.*; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer; import org.apache.hadoop.mapreduce.lib.input.FileInputFormat; import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat; public class WordCount { public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(Object key, Text value, Context context) throws IOException, InterruptedException { StringTokenizer itr = new StringTokenizer(value.toString()); while (itr.hasMoreTokens()) { word.set(itr.nextToken()); context.write(word, one); } } } public static class IntSumReducer extends Reducer<Text, IntWritable, Text, IntWritable> { private IntWritable result = new IntWritable(); public void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException { int sum = 0; for (IntWritable val : values) { sum += val.get(); } result.set(sum); context.write(key, result); } } public static void main(String[] args) throws Exception { Configuration conf = new Configuration(); Job job = Job.getInstance(conf, "word count"); job.setJarByClass(WordCount.class); job.setMapperClass(TokenizerMapper.class); job.setCombinerClass(IntSumReducer.class); job.setReducerClass(IntSumReducer.class); job.setOutputKeyClass(Text.class); job.setOutputValueClass(IntWritable.class); FileInputFormat.addInputPath(job, new Path(args[0])); FileOutputFormat.setOutputPath(job, new Path(args[1])); System.exit(job.waitForCompletion(true) ? 0 : 1); } } ``` 示例代码： ``` import java.io.IOException; import java.util.StringTokenizer; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.*; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer; import org.apache.hadoop.mapreduce.lib.input.FileInputFormat; import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat; public class WordCount { public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(Object key, Text value, Context context) throws IOException, InterruptedException { StringTokenizer itr = new StringTokenizer(value.toString()); while (itr.hasMoreTokens()) { word.set(itr.nextToken()); context.write(word, one); } } } public static class IntSumReducer extends Reducer<Text, IntWritable, Text, IntWritable> { private IntWritable result = new IntWritable(); public void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException { int sum = 0; for (IntWritable val : values) { sum += val.get(); } result.set(sum); context.write(key, result); } } public static void main(String[] args) throws Exception { Configuration conf = new Configuration(); Job job = Job.getInstance(conf, "word count"); job.setJarByClass(WordCount.class); job.setMapperClass(TokenizerMapper.class); job.setCombinerClass(IntSumReducer.class); job.setReducerClass(IntSumReducer.class); job.setOutputKeyClass(Text.class); job.setOutputValueClass(IntWritable.class); FileInputFormat.addInputPath(job, new Path(args[0])); FileOutputFormat.setOutputPath(job, new Path(args[1])); System.exit(job.waitForCompletion(true) ? 0 : 1); } } ``` 使用命令行提交MapReduce作业： 1. 将示例代码保存为WordCount.java文件 2. 在命令行中使用以下命令进行编译： ``` javac -classpath `yarn classpath` WordCount.java ``` 3. 打包： ``` jar -cvf WordCount.jar *.class ``` 4. 提交MapReduce作业： ``` yarn jar WordCount.jar WordCount /input /output ``` 其中，/input是输入文件的路径，/output是输出文件的路径。运行完成后，在输出文件夹中查看结果。

阅读全文

java程序提交mapreduce

相关推荐

mapreduce程序

java 矩阵乘法的mapreduce程序实现

mapreduce.jar

java程序提交mapreduce报错Usage : yarn jar wc.jar com.ch02.mr.wordcount.MainClass <input path> <output path>

Java操作Hadoop Mapreduce基本实践源码

深入探究如何使用Java编写MapReduce程序.rar

Java程序实现Hadoop MapReduce自定义计数器

Java实现Hadoop MapReduce处理无向图算法

使用Java编写MapReduce WordCount示例程序

用java编写mapreduce程序

java编写一个mapreduce程序，提交到高可用集群运行

mapreduce基础实战-基于Java的MapReduce文本词频统计实战教程

Java-MapReduce:用Java文件输入_输出实现MepReduce算法的示例

Java实现MapReduce矩阵乘法详解

Java实现的Hadoop MapReduce项目实战

Java MapReduce编程深入解析

使用“hadoop jar”命令提交 MapReduce 任务 的格式

如何使用Java编写MapReduce程序来计算学生数据集中每个学生的平均分和总分？

Java用mapreduce输出数据保存到本地

在Hadoop伪分布式环境下，如何使用Java编写MapReduce程序来完成词频统计任务？请结合《Hadoop MapReduce实验：词频统计详解》提供详细步骤。

大家在看

TwinSAFE EL6900 安全模块基础使用指南（针对TC3.1.4020.0版本）.pdf

南京工业大学Python程序设计语言题库及答案

泊松分布MATLAB代码-RJNS3D_VER_1.1:离散断裂网络建模

Skill.wz_冒险岛079WZ_079skill.wz_冒险岛的_冒险岛Skill.wz_冒险岛服务端_

Multisim里的NPN三极管参数资料大全.docx

最新推荐

使用python实现mapreduce（wordcount）.doc

mapreduce测试网站

Hadoop_MapReduce教程.doc

自动删除hal库spendsv、svc以及systick中断

世界地图Shapefile文件解析与测试指南

Python环境监控高可用构建：可靠性增强的策略

需要在matlab当中批量导入表格数据的指令

Sqlcipher 3.4.0版本发布，优化SQLite兼容性

Python环境监控性能监控与调优：专家级技巧全集

simulinlk怎么插入线

使用“hadoop jar”命令提交 MapReduce 任务的格式