Introduction to Big Data and Hadoop
  • What is Big Data?
  • Classification of Big Data.
  • Characteristics of Big Data.
  • Challenges Associated with Big Data.
  • Traditional Approach of Storing and Processing Big Data.
  • What is Hadoop?
  • Features of Hadoop.
  • How Hadoop Works?
  • What is Hadoop EcoSystem?
Exploring HDFS
  • What is HDFS?
  • HDFS Daemons: NameNode and DataNode Explained.
  • HDFS Daemons: Secondary NameNode Explained.
  • How HDFS Manages FileSystem MetaData?
  • Reading Data from HDFS.
  • Writing Data to HDFS.
  • What is Rack Awareness?
  • NameNode Federation Explained.
Exploring MapReduce
  • What is MapReduce?
  • MapReduce Daemons JobTracker and TaskTracker Explained.
  • What is YARN?
Understanding MapReduce Programming Framework
  • How MapReduce Works?
  • What is Key / Value Pair?
  • Hadoop Java API’s Explained.
Installing a Single Node, Pseudo Hadoop Cluster on a Local Machine
  • Understanding Hadoop Operating Modes.
  • Creating a Virtual Machine in VMWare.
  • Downloading Linux.
  • Installing Linux.
  • Configuring Linux.
  • Downloading and Installing Hadoop.
  • Configuring Hadoop and Starting Up the Hadoop Cluster.
Installing a Multi Node, Fully Distributed, Hadoop Cluster on a Local Machine
  • Setting up a MultiNode Apache Hadoop Cluster on Local Machine Part 1
  • Setting up a MultiNode Apache Hadoop Cluster on Local Machine Part 2
  • Setting up a MultiNode Apache Hadoop Cluster on Local Machine Part 3
  • Setting up a MultiNode Apache Hadoop Cluster on Local Machine Part 4
Amazon EC2 Basics
  • Setting Up and Connecting to an Amazon EC2 Instance.
Maintaining a Hadoop Cluster
  • Finding Data for Practice.
  • Starting and Stopping Hadoop Daemons.
  • Adding a Node to the Cluster.
  • Removing a Node from the Cluster.
  • Checking the Hadoop Distributed File System.
  • Exploring Commands in Hadoop.
  • Adding a TaskTracker.
  • Removing a TaskTracker.
  • Importing Data into HDFS.
  • Exporting Data out of HDFS.
  • Copying Data Using DistCP.
  • Balancing a DataNode.
  • Turning On & Turning Off SafeMode.
  • Hadoop Commands Reference Guide.
Programming with MapReduce
  • Setting Up Development Environment.
  • More to Follow.