How to do it...

Perform the following steps to set up a Hadoop environment using the Cloudera QuickStart VM:

  1. Visit the Cloudera QuickStart VM download site (you may need to update the link as Cloudera upgrades its VMs; the current version of CDH is 5.3) at http://www.cloudera.com/content/cloudera/en/downloads/quickstart_vms/cdh-5-3-x.html:
A screenshot of the Cloudera QuickStart VM download site
  1. Depending on the virtual machine platform installed on your OS, choose the appropriate link (you may need to update the link as Cloudera upgrades its VMs) to download the VM file:
  2. Next, you can start the QuickStart VM using the virtual machine platform installed on your OS. You should see the desktop of Centos 6.2 in a few minutes:
The screenshot of Cloudera QuickStart VM.
  1. You can then open a terminal and type hadoop, which will display a list of functions that can operate a Hadoop cluster:
The terminal screenshot after typing hadoop
  1. Open a terminal and type R. Access an R session and check whether version 3.1.1 is already installed in the Cloudera QuickStart VM. If you cannot find R installed in the VM, please use the following command to install R:
        $ yum install R R-core R-core-devel R-devel  
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset