Introduction
  • Introduction to Mapreduce
  • Announcement
  • Traditional approach VS Hadoop approach
  • Basic Flow of a Mapreduce program
  • Mapreduce Program flow with Example
  • Types of File Input formats in Mapreduce
Default structure of various classes in Mapreduce
  • Mapper Class structure
  • Reducer Class structure
  • Driver Class structure
  • Partitioner Class structure
  • Shuffling, Sorting & Partitioning in Detail
  • Hadoop Installation
Word Count program in Mapreduce
  • What are Writables in Hadoop
  • Word Count program in Mapreduce
  • Word count program Code run
  • What is Combiner in Hadoop Mapreduce
  • Implementing Combiner in WordCount Mapreduce program
Set of Mapreduce programs
  • Calculate Sum of Even Odd numbers
  • Calculate success rate of Facebook ads
  • Writables - Create our own datatype in Mapreduce
  • Fraud customers of an Ecommerce website - part 1
  • Fraud customers of an Ecommerce website - part 2
  • Assignment 1
Distributed Cache Implementation
  • What is Distributed Cache and it's uses in Mapreduce framework
  • Using Distributed cache calculate average salary
Dealing with Input Split Class
  • What are Input splits in Hadoop
  • Input split Class in Mapreduce
Multiple Inputs & Output class
  • Multiple Inputs class and its Implementation
  • Multiple Output class and its Implementation
  • Quiz 1
Joins in Mapreduce
  • Pseudo code flow of Joins Mapreduce program
  • Join 2 files in a Mapreduce program
  • Performing Outer Join in Mapreduce
  • What is Map Join and Where it is Used
  • Implementing Map Join in a Mapreduce program
Counters in Mapreduce
  • What are Counters in Hadoop
  • Job Counters
  • Create our own Custom Counters in Mapreduce program
  • Assignment 2
Creating Custom Input Formatter
  • File Input format Class's default structure in Mapreduce
  • Custom Input Formatter Need & Problem statement
  • Create custom Input Format class to read XML file | Part 1
  • Create custom Input Format class to read XML file | Part 2
  • Create custom Input Format class to read XML file | Part 3
  • Quiz 2
Different Types of Files in Hadoop
  • Text, Sequence, Avro Files
  • RC, ORC, Parquet Files
  • Performance Test results of Various Files
  • Which File Format to choose
  • Sequence File Implementation in MapReduce
Chaining in Mapreduce
  • Chain Mapper and its Implementation
  • How to Chain Multiple MR Programs
Case study 1 - Bank Loyal Customers Identification
  • Identifying Bank's Loyal Customers
Case study 2 - Predicting Churn customers
  • Predicting Churn customers | Part 1
  • Predicting Churn customers | Part 2
Case study 3 - Flight data Analysis
  • Flight data Analysis | Part 1
  • Flight data Analysis | Part 2
BONUS
  • Bonus lecture