- Course Introduction
What you'll learn
- Importance of hadoop framework in BigData analytics
- Understanding Hadoop Framework in detail
- Hands on experience on data ingestion techniques : Apache Sqoop and Apache Flume
- Hands on experience on MapReduce Programming and its hidden concepts
- Hands on experience on Apache Hive Programming, Performance tuning, UDF's
- Understand and work with Pig
- Realtime data streaming analysis with Apache Spark and its ecosystems
- Understand and work with Apache Kafka
- Process workflow automation using Oozie
- Understand and work with MongoDb
- Case Studies , practical explanations and Interview Questions
Description
Data Analytics is the practice of using data to drive business strategy and performance. It includes a range of approaches and solutions, from looking backward to evaluate what happened in the past to looking forward to do scenario planning and predictive modelling.Data Analytics spans all of the functional businesses to address a continuum of opportunities in Information Management, Performance Optimisation and Analytic Insights. Organizations now realize the inherent value of transforming these big data into actionable insights. Data science is the highest form of big data analytics that produce the most accurate actionable insights, identifying what will happen next and what to do about it.
Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs. Hadoop is not just an effective distributed storage system for large amounts of data, but also, importantly, a distributed computing environment that can execute analyses where the data is.
In this course, detailed explanation about hadoop framework and its ecosystems has been provided. All the concepts are explained in detail with examples and business use cases as case studies.Also, latest technologies in big data area like apache spark, apache kafka, Mongo DB are explained. In addition, Interview questions with respect to each ecosystem and resume preparation tips are included.
Students also bought
Información sobre el Instructor

- 3.75 Calificación
- 1324 Estudiantes
- 1 Cursos
Srikanth Gorripati
Software Developer
• 8+ years of IT industry experience encompassing wide range of skill set.
• 6+ years of experience in working with Big Data Technologies on system which comprises of several applications, highly distributive, massive amount of data using HortonWorks, Cloudera Hadoop distributions
• MapR certified hadoop developer , Oracle certified Java Programmer, PSM1 Certified and Neo4j Certified Graph Developer
• Experience in Agile Development, application design, software development and testing
• Strong expertise on Big Data and Cloud related technologies , tools and frameworks.
Student feedback
Course Rating
Reviews
I am so glad that I took this course, it was really helpful , great introduction to the tool and all the datawarehousing concepts, and I would like to review the content once again after having some more hands on experience on the tool itself. Instructor is highly experienced and highly capable of teaching what he knows.
Very good teacher who knows what hes talking about and explains it in a very clearly manner. This course is good value for the money I've paid. Thanks Nilanka
Serious accent! Should consider recording native English speakers.