This article describes how the create Map Reduce jobs on a cluster that has been provisioned on the Hadoop on Windows Azure Portal. For more information on running Map Reduce Jobs for an on-premise or Azure Hadoop cluster, see
the getting started guide for your cluster deployment type.
To run a job, you will need to first create your own jar file – compiled java code that contains your Map Reduce code. For more information on how to create a jar file, you can reference these
We have also made the hadoop-examples-0.20.203.1-SNAPSHOT.jar file available which is made use below to run the pi sample. Once, your logged into the portal, download the .jar file to your local computer by
clicking the Samples tile, then click the Pi Estimator
tile. The steps in this topic walk you through manually uploading a jar file from your local computer. However, you can just click
Deploy to your cluster for the sample.
pi, 10, 100
This indicates that for the examples jar file, you will run the pi sample using iterations / threads of 10 with the number of jobs to aggregate over at 100. The parameters should look much like the screenshot below.
If you click on the task itself, (e.g. the “pi #1” tile), you can re-execute the job – similar to what you had done in Step #3.