Session Details

Session Details2019-01-07T06:21:08+00:00

Hadoop Essentials

Presented by: Eric Richardson

Big Data and Cloud platforms have their origins in Hadoop. Learn the fundamentals of HDFS, Map Reduce and Yarn the three core components of Apache Hadoop. You will start a sandbox cluster, interact with HDFS, learn how HDFS saves data and why it does it that way. MapReduce is an important processing paradigm, learn why and explore some of the Computer Science theory behind the technology. Write a simple MapReduce job, old school but effective. YARN is brains behind massive data processing jobs. Learn how it makes decisions and watch it run your MapReduce job.

Tags: Big DataLevel: Intermediate