Course Certificate: Hadoop Platform and Application Framework by University of California, San Diego on Coursera.

5 weeks of study which covers following topics in detail : 

  • Hadoop Stack Basics
  • The Apache Framework:
  • Hadoop "Zoo"
  • Hadoop Ecosystem Major Components
  • Overview of the Hadoop Stack
  • YARN, Tez, and Spark
  • Hadoop-Based Applications : PIG, HIVE & HBASE
  • The Hadoop Distributed File System (HDFS) and HDFS2
  1.                            HDFS Architecture
  2.                           HDFS Tuning Parameters
  3.                           HDFS Performance and Robustness
  4.                           HDFS Access, APIs, and Applications

 

  • MapReduce Framework and YARN
  1.                           Introduction to Map/Reduce
  2.                           The Map/Reduce Framework 
  3.                           Programming Assignment · Running Wordcount with  Hadoop streaming, using Python code
  4.                           Programming Assignment · Joining Data
  • Architecture of Spark
  1.                                    Resilient Distributed Datasets
  2.                                   Spark Transformations
  3.                                   Wide Transformations
  4.                                   Programming Assignment · Simple Join in Spark
  5.                                   Directed Acyclic Graph (DAG) Scheduler
  6.                                   Actions in Spark
  7.                                   Memory Caching in Spark
  8.                                  Broadcast Variables
  9.                                  Accumulators
  10.                                  Programming Assignment · Advanced Join in Spark

Certificate Link:                       

             

 

 

                               

                                 

 

                             

 

 

 

 

To view or add a comment, sign in

More articles by Kumar Satish

Explore content categories