How to install Hadoop?

Nicole Sim
3 min readJan 17, 2022

I wanted to learn Hadoop but I was facing some difficulties with the installation. Thus, I wanted to share my learnings and help anyone who is facing similar installation issues.

IMPT! Before you start, make sure that your (1) C drive (downloads folder) has at least 15 GB for downloading the sandbox and (2) that you have at least 20 GB of disk space to import the sandbox image and (3) another 10 GB to run the sandbox. I needed a total of 45GB to get this working.. x.x

1) Go to Cloudera and install Hortonworks Data Platform Sandbox v2.6.5. Link here. Select “Virtualbox” and fill in your details. It takes up 15 GB so make sure you have enough space for it.

2) Install Oracle VM VirtualBox Manager. Link here.

3) Open up virtualbox and open up on the downloaded .ova image file. Click on Import.

During the import, I got an error message “Failed to import appliance. Result Code: E_INVALIDARG (0x80070057)”. I managed to resolve this error by changing my machine base folder to D drive that contains at least 20GB of free disk space.

4) Click on Start to start your virtual machine. It will take a while to load.

Note: While loading up the sandbox, I encountered another error message that flagged out insufficient memory. I had to come up with another 10GB of free disk space to run the sandbox.

5) Input the given localhost url for VirtualBox in the browser and you’ll see the sandbox page. Ta-da, it’s finally working!

6) Click on Launch Dashboard to get started!

7) To shut down the sandbox, click on Machine > ACPI Shutdown

8) To re-start the sandbox, repeat step 4 -6.

Note: Upon re-starting the sandbox, I got 27 alerts. These were resolved by giving the sandbox 20–30minutes to warm up and start all its services.

Hope this guide helps! :)

If you found this article helpful, I would really appreciate if you could follow my account and give this article a clap! Thank you!!

--

--

Nicole Sim

An avid learner who can’t stop thinking about new ideas. I love tech, automation, healthcare and entrepreneurship.