Installing Hortonworks Hadoop Sandbox

To perform a proof of concept of how our data warehouse data would run on Hadoop, I decided to try out the Hortonworks Hadoop Sandbox for VMware, that can be downloaded from Hortonworks Sandbox.

Downloading and importing the .ova file into VMWare Fusion is straighforward. During startup, you get a glimpse of what’s installed on this sandbox vm.

For me the most interesting piece of the Hadoop stack is Hive, since it is the Hadoop component that transforms SQL like statements into map-reduce jobs for the Hadoop core.

After startup of the vm, the following screen is presented:

Hortonworks Sandbox console screenWhen going to the URL, the following screen is shown

Screen Shot 2016-07-06 at 14.41.56which links to a tutorial with a IoT use case.

Lets log into Ambari.

Ambari Login screenNow the following screen is shown:

Screen Shot 2016-07-06 at 14.58.01Everything seems to be up and running! 🙂

Next I want to perform the following steps:

  1. create a data model
  2. load data into this model
  3. run some queries
  4. compare performance and features with our current database server PostgreSQL 9.5
  5. load streaming data

This will be covered in future posts.

Advertenties

Geef een reactie

Vul je gegevens in of klik op een icoon om in te loggen.

WordPress.com logo

Je reageert onder je WordPress.com account. Log uit /  Bijwerken )

Google+ photo

Je reageert onder je Google+ account. Log uit /  Bijwerken )

Twitter-afbeelding

Je reageert onder je Twitter account. Log uit /  Bijwerken )

Facebook foto

Je reageert onder je Facebook account. Log uit /  Bijwerken )

Verbinden met %s