HDP OVERVIEW: APACHE HADOOP ESSENTIALS

This course provides a technical overview of Apache Hadoop. It includes high-level information about concepts, architecture, operation, and uses of the Hortonworks Data Platform (HDP) and the Hadoop ecosystem.

OBJECTIVES

  • The Case for Hadoop
  • The Hadoop Ecosystem
  • HDFS Architecture
  • Ingesting Data
  • Parallel Processing
  • Apache Hive Overview
  • Apache Pig Overview
  • Apache Spark Overview
  • YARN Architecture
  • Hadoop Security

DEMONSTRATIONS

  • Operational Overview with Ambari
  • Loading Data into HDFS
  • Streaming Data into HDFS
  • Processing with MapReduce
  • Data Manipulation with Hive
  • Risk Analysis with Pig
  • Risk Analysis with Spark
  • Securing Ranger with Hive

To register for the course, please fill out the form.