This topic describes using the Hadoop on Azure Portal to provision a new Apache Hadoop cluster. Clusters provisioned on the portal are temporary and have an expiration. These clusters are provisioned
to run jobs processing input data that may be on the cluster or located elsewhere. For example, the data could reside in a Azure account.
The advantage of using a temporary cluster is that there is no cost to maintain hardware used for parallel processing of map reduce jobs once the job is completed. You use the cluster then release it or allow it to expire. Apache
Hadoop deployment solutions are also available for deploying Apache Hadoop to a Windows Azure account or on-premise hardware that you manage yourself.
The Windows Azure Map/Reduce Service (or the Hadoop-based service on Azure) is by invitation only during the Community Technical Preview (CTP). The purpose of this CTP is for you to test Hadoop-based
service on Azure, become more familiar with it, and provide feedback. The process is:
Fill out the linked Hadoop-based service on Azure CTP Connect Survey providing
information about yourself, business domain, and technical scenario. You will need a LiveID (e.g. Hotmail.com, Live.com) and a filled out Connect profile.
Once you receive an invite code, go to HadoopOnAzure.com and click
If it is your first time, you will be asked to allow HadoopOnAzure.com access to your LiveID profile information. Please click, Yes.
Under the “Got a code?” dialog, enter in your invite code, and click
For your new account, your first task will be to create a Hadoop on Azure cluster. Your options are:
Once done, click Request cluster (right side bar, under the green bar)
At this point, the Hadoop on Azure service is creating the new cluster. This will take a few to tens of minutes to create depending on the number of nodes and the number of clusters being created at the time.
Once the cluster is setup, your portal page will be similar to the one below. Now you are good to go!