Module 1
  • 1.1 Data Processing in Hadoop
  • 1.2 HDFS in Hadoop
Module 2
  • 2.1 Download Virtual Box
  • 2.2 Downloading CentOS
  • 2.3 How to setup a machine in VirtualBox
  • 2.4 How to add CentOS ISO image
  • 2.5 Start Your Machine
  • 2.6 Start machines and make way to communicate
  • 2.7 How to install Java in VirtualBox
  • 2.8 Download Hadoop
  • 2.9 Add Group and User
  • 2.10 generating ssh key
  • 2.11 distributing the keys
  • 2.12 setting java
  • 2.13 Hadoop path setting
Module 3
  • 3.1 Starting With hadoop
  • 3.2 Changing Conf files
  • 3.3 Format Namenode
  • 3.4 Starting the daemons
  • 3.5 Checking the file system
  • 3.6 Creating a Directory and putting data
  • 3.7 Demerit to store data in tmp
  • 3.8 Start Admin commands
  • 3.9 Browser Interface
Module 4
  • 4.1 Setting the pesudo dist mode
  • 4.2 viewing the fully distributed mode
  • 4.3 Using a Script file to monitor the cluster
  • 4.4 Viewing under replicated blocks
  • 4.5 Setting the third machine
  • 4.6 Bringing up the third machine
  • 4.7 OverReplicated Blocks
  • 4.8 Commisioning and Decommisioning
Module 5
  • 5.1 Metasave Command
  • 5.2 Dynamically Write Data with different Replication
  • 5.3 Rack Awareness
  • 5.4 Enabling Rack Awareness
  • 5.5 default Rack
  • 5.6 Working of Secondary Namenode
  • 5.7 Secondary namenode working in the cluster setup
  • 5.8 Changes in Cluster viewing the fsimage
  • 5.9 Shutting down the namenode
  • 5.10 manually talking to namenode
  • 5.11 Safemode in hadoop
  • 5.12 Using Namespace
  • 5.13 Commissioning and decommissioning Nodes
  • 5.14 setting for commissioning decommissioning
  • 5.15 Decomissioning of Nodes
  • 5.16 Commissioning in Hadoop
Module 6
  • 6.1 Balancer
  • 6.2 Backing up the data
  • 6.3 Backing up the data continued
  • 6.4 Restore the data
  • 6.5 Deleted data Permanently
  • 6.6 Working with Distcp
  • 6.7 Distcp across the cluster
  • 6.8 Disctcp log files
Module 7
  • 7.1 Namenode Crashes
  • 7.2 Corrupt and Missing Blocks
  • 7.3 Starting the second datanode
  • 7.4 Starting Namenode on old machine
  • 7.5 Starting Namenode on old machine II
  • 7.6 Working with Updated Metadata
  • 7.7 Multiple Paths to run the cluster
  • 7.8 NFS Settings
  • 7.9 Start NFS
  • 7.10 Start the cluster
  • 7.11 Deleting Primary Path of Namenode
  • 7.12 Getting metadat from secondary to primary location
  • 7.13 Starting cluster Normally