Introduction (Theory)
  • Introduction to Hive
  • Announcement
  • Motivation of Hive
  • Sql vs Hive
  • Trailer- Working of Hive
  • Architecture of Hive
Hive Basic Commands
  • HADOOP AND HIVE INSTALLATION
  • Create databases
  • Table creation and loading data into it |Part 1
  • Table creation and loading data into it |Part 2
  • Internal vs External table- Explained
  • Create Tables
  • Insert statement
  • Multi insert statement (Advance)
  • Alter Table Schema
  • Sorting -- sort by, order by, distribute by, cluster by
Functions in Hive
  • Date and Mathematical functions
  • String functions
  • Split(), Substr(), instr() functions
  • Conditional statements
  • Explode and Lateral view ( Advance functions)
  • Rlike function (Advance)
  • Rank(), Dense_rank(), Row_number() (Advance)
  • Practice Mathematical Functions
Partitioning in Hive
  • What is Partitioning?
  • Static partitioning
  • Dynamic partitioning
  • Alter Partitioned Table and MSCK Repair command (Advance)
Bucketing in Hive
  • What is Bucketing?
  • Create Bucketed Table
  • Tablesampling (Advance)
  • No_drop, Offline command (Advance)
  • Partitioning
Joins in Hive
  • Inner Joins on 2 Tables
  • Outer Joins on 2 Tables
  • Join 3 Tables in Hive
  • Memory Management & Optimization of Joins
  • Map Joins (Advance)
Views in Hive
  • What are views?
  • Creating Views in different ways
  • Advantages of Views
Indexing (Advance)
  • Creation of Indexes (Compact and Bitmap)
  • Multiple Indexes on same table
  • When and When not to use Indexing
UDF's (User defined functions) Advance
  • What is UDF
  • UDF Implementation - Practical
Table Properties (Advance)
  • Skipping Header and Footer records while loading in table
  • Recommendation
  • Immutable Table property
  • Purge property + Difference between Drop and Truncate
  • Null Format property
  • ACID/Transactional features of Hive ( Advance)
  • ORC Table properties
Configurations & Settings in Hive (Advance)
  • Part 1
  • Part 2
  • Merge files in Hive
  • Parallelism Property
Variables in Hive (Advance)
  • Executing Hive queries from Bash shell
  • Run Unix & Hadoop commands from Hive Shell
  • Variables in Hive (hiveconf & hivevar)
  • Difference between hiveconf & hivevar
  • Using variables in bash shell
  • Substituting value of a Variable
  • QUIZ
Different Types of Files in Hadoop
  • Text, Sequence, Avro Files
  • RC, ORC, Parquet Files
  • Performance Test results of Various Files
  • Which File Format to choose
Custom Input Fomatter (Advance)
  • Using Custom Input Format in Hive
Miscellaneous (Advance)
  • Modes of Hive
  • Compression techniques in Hive
  • Hiverc file (Creation and Uses)
  • Archiving files in Hive
TEZ engine in Hive
  • What is Tez engine and its comparison with MR
Load XML data in Hive
  • How to Load XML data in Hive tables-- Part1
  • How to Load XML data in Hive tables-- Part2
Implementing SCD's in Hive (Advance)
  • What is Slowly changing Dimension (SCD)
  • Implement SCD 1 in Hive--part1
  • Implement SCD 1 in Hive--part2
Use Cases asked in Interviews
  • Word count in Hive
  • Multiple Tables on a single file