Module 1
  • 1.1 Overview of Big Data
  • 1.2 Introduction to Apache Hadoop
  • 1.3 Hadoop Distributed File System
  • 1.4 Hadoop Map Reduce
Module 2
  • 2.1 Download Virtual Box
  • 2.2 Downloading CentOS
  • 2.3 How to setup a machine in VirtualBox
  • 2.4 How to add CentOS ISO image
  • 2.5 Start Your Machine
  • 2.6 Start machines and make way to communicate
  • 2.7 How to install Java in VirtualBox
  • 2.8 Removing an existing Java Version
  • 2.9 Add Group and User
  • 2.10 Generating ssh Key
  • 2.11 Distributing the Keys
  • 2.12 Setting Java
  • 2.13 Hadoop Path Setting
Module 3
  • 3.1 Starting With Hadoop
  • 3.2 Changing Conf files
  • 3.3 Format Namenode
  • 3.4 Starting the Daemons
  • 3.5 Checking the File System
  • 3.6 Creating a Directory and Putting Data
  • 3.7 Demerit to Store data in tmp
  • 3.8 .Creating Parent directly to permanently store the data. mp4
  • 3.9 Start Admin Commands
  • 3.10 Browser Interface
Module 4
  • 4.1 Setting the Pseudo dist Mode
  • 4.2 viewing the fully distributed mode
  • 4.3 Using a Script file to Monitor the Cluster
  • 4.4 Viewing Under Replicated Blocks
  • 4.5 Setting the Third Machine
  • 4.6 Bringing up the Third Machine
  • 4.7 OverReplicated Blocks
  • 4.8 Commissioning and Decommissioning
Module 5
  • 5.1 Metasave Command
  • 5.2 Dynamically Write Data with Different Replication
  • 5.3 Rack Awareness
  • 5.4 Enabling Rack Awareness
  • 5.5 Default Rack
  • 5.6 Working of Secondary Namenode
  • 5.7 Secondary Namenode Working in the Cluster Setup
  • 5.8 Changes in Cluster viewing the fsimage
  • 5.9 Shutting down the namenode
  • 5.10 Manually Talking to namenode
  • 5.11 Safemode in Hadoop
  • 5.12 Using Namespace
  • 5.13 Commissioning and Decommissioning Nodes
  • 5.14 Setting for Commissioning Decommissioning
  • 5.15 Decommissioning of Nodes
  • 5.16 Commissioning in Hadoop
Module-6
  • 6.1 Balancer
  • 6.2 Backing up the data
  • 6.3 Backing up the Data continued
  • 6.4 Restore the Data
  • 6.5 Deleted Data Permanently
  • 6.6 Distcp Introduction
  • 6.7 Working with Distcp
  • 6.8 Distcp across the Cluster
  • 6.9 Disctcp Log Files
Module 7
  • 7.1 Namenode Crashes
  • 7.2 Corrupt and Missing Blocks
  • 7.3 Starting the Second Datanode
  • 7.4 Starting Namenode on old machine
  • 7.5 Working with Updated Metadata
  • 7.6 Multiple Paths to run the cluster
  • 7.7 Introduction Network File System
  • 7.8 Start NFS
  • 7.9 Start the cluster
  • 7.10 Deleting Primary Path of Namenode
  • 7.11 Getting metadata from secondary to primary location
  • 7.12 Starting cluster Normally