HADOOP Online Training Course Content

Duration: 5-6 Weeks

Modulue-1
  •     An Introduction to Big Data
  •     The explosion of Data
  •     Big Data Characteristics
  •     BI Vs Big Data
  •     Big Data Applications
  •     Big Data Framework
  •     Big Data Vendor information
Module-2
  •     Big Data Approaches
  •     Approaches to Process Big Data (Phases)
  •     Traditional Database analytics
  •     Massive Parallel Processing
  •     Cloud Computing
  •     Grid Computing
  •     Map Reduce
  •     What problems exist with ‘traditional’ large-scale computing Systems?
  •     What requirements an alternative approach should have?
  •     How Hadoop addresses those requirements?
Module-3
  •     The Hadoop Basic concepts
  •     What Hadoop is
  •     What features the Hadoop Distributed File System (HDFS) provides
  •     The concepts behind MapReduce
  •     How a Hadoop cluster operates
  •     What other Hadoop Ecosystem projects exist
Module-4
  •     Introduction to Map Reduce
  •     MapReduce Architecture
  •     Fault tolerance in MapReduce
  •     Sample applications
  •     Getting started with Hadoop
  •     Higher-level languages on top of Hadoop: Pig and Hive
Module-5
  •     Integrating Hadoop Into The Workflow
  •     Introduction
  •     Relational Database Management Systems
  •     Storage Systems
  •     Importing Data From RDBMSs With Sqoop
  •     Hands-On Exercise
  •     Importing Real-Time Data With Flume
  •     Accessing HDFS Using FuseDFS and Hoop
  •     Conclusion
Module-6
  •     An Introduction to Pig
  •     What is Pig?
  •     Pig’s Features
  •     Pig Use Cases
  •     Interacting with Pig
  •     Essential Points
  •     Conclusion
Module-7
  •     Basic Data Analysis with Pig
  •     Basic Syntax of Pig Latin
  •     How to load and store data using Pig
  •     Which datatypes Pig uses to represent data
  •     How to sort and filter data in Pig
  •     How to use many of Pig’s built in functions for data processing
  •     Processing Complex Data with Pig
  •     How Pig uses bags,tuples,and maps to represent complex data.
  •     The Techniques Pig provides to for grouping and ungrouping data.
  •     How to use aggregate functions in Pig Latin
  •     How to iterate through records in complex data structures.
  •     Multi Dataset Operations with Pig
  •     How we can use grouping to combine data from multiple sources.
  •     What types of join operators Pig supports and how to use them.
  •     How to concatenate records to produce a single data set.
  •     How to split a single data set into multiple relations
Module-8
  •     Introduction to Hive
  •     What is Hive
  •     How Hive differs from a relational database.
  •     Ways in which organizations use Hive.
  •     How to invoke and interact with Hive.
  •     Relational Data with Hive
  •     How to explore database and tables in Hive.
  •     How HiveQL syntax compared with SQL
  •     Which datatypes Hive supports.
  •     Join operators in Hive
  •     Built in functions in Hive.
  •     Hive Data Management
  •     How Hive encodes and stores data
  •     How to create Hive databases,tables and views.
  •     How to load data into tables
  •     How to alter and remove tables
  •     How to save query results into tables and files.
  •     How to control access to data in Hive
  •     Important String Functions
Module-9
  •     What is HBase?
  •     When To Use HBase?
  •     HBase Data Model
  •     HBase Logical View
  •     HBase Physical Model
  •     Major Components Of HBase
  •     HBase Big Picture
  •     Compaction
  •     HBase Shell
  •     HBase Useful Commands
  •     Hand’s On exercise
  1. Send us which course you want to join.
Have a Query?