Course Code: OMEGA-700
Course Title: Big Data Essentials
Duration: 3 Days

The main goal of this course is to

  • Define Big Data
  • Define Hadoop and its core components and the Hadoop ecosystem
  • Use the Hadoop Distributed File System (HDFS)
  • Process big data using MapReduce, YARN, Hive, Pig, Flume, Spark
  • Apply regression, classification, clustering, and deep learning using Mahout

Module 1: Understanding Big Data

Module 2: Understanding Hadoop

Module 2: Hadoop Distributed File System

Module 3: Hadoop MapReduce Programming in Java

Module 4: Apache Hive

Module 5: Apache Pig

Module 6: Apache Flume

Module 7: Apache Spark

Module 8: NoSQL Databases and Apache Cassandra

Module 9: Apache Kafka

Module 10: Machine Learning Basics

Module 11: Apache Mahout