Big Data Tutorial 1: MapReduce

Tuesday, February 20, 2018 - 3:00pm - 5:00pm EST

This class will provide a brief overview of what Hadoop is and the various components that are involved in the Hadoop ecosystem. There will be a hands on showcase for the users on how to use the NYU High Performance Computing dumbo (Hadoop) cluster to run basic map-reduce jobs. Various hands on exercises have been incorporated for the users to get a better understanding.

The pre-requisites of this class:

1. HPC user account is mandatory.
2. The user needs to have a basic knowledge of Unix and Java/python.


View all upcoming library classes: LIBCAL