SAP HADOOP Online Training Course Content

SAP HADOOP Online Training

SAP HADOOP Online Training

SAP HADOOP Online Training Overview

Introduction to BigData, Hadoop:-

  • Big Data Introduction
  • Hadoop Introduction
  • What is Hadoop? Why Hadoop?
  • Hadoop History?
  • Different types of Components in Hadoop?
  • HDFS, MapReduce, PIG, Hive, SQOOP, HBASE, OOZIE,
  • Flume, Zookeeper and so on…
  • What is the scope of Hadoop?

Deep Drive in HDFS (for Storing the Data):-

  1. Introduction of HDFS
  2. HDFS Design
  3. HDFS role in Hadoop
  4. Features of HDFS
  5. Daemons of Hadoop and its functionality
  • Name Node
  • Secondary Name Node
  • Job Tracker
  • Data Node
  • Task Tracker
  1. Anatomy of File Wright
  2. Anatomy of File Read
  3. Network Topology
  • Nodes
  • Racks
  • Data Center
  1. Parallel Copying using DistCp
  2. Basic Configuration for HDFS
  3. Data Organization
  • Blocks and
  • Replication
  1. Rack Awareness
  2. Heartbeat Signal
  3. How to Store the Data into HDFS
  4. How to Read the Data from HDFS
  5. Accessing HDFS (Introduction of Basic UNIX commands)
  6. CLI commands

Map Reduce using Java (Processing the Data):-

  1. he introduction of Map Reduce.
  2. Map Reduce Architecture
  3. Data flow in Map Reduce
  • Splits
  • Mapper
  • Portioning
  • Sort and shuffle
  • Combiner
  • Reducer
  1. Understand Difference Between Block and Input Split
  2. Role of Record Reader
  3. Basic Configuration of Map Reduce
  4. Map Reduce life cycle
  • Driver Code
  • Mapper
  • and Reducer
  1. How Map Reduce Works
  2. Writing and Executing the Basic Map Reduce Program using Java
  3. Submission & Initialization of Map Reduce Job.
  4. File Input/Output Formats in Map Reduce Jobs
  • Text Input Format
  • Key Value Input Format
  • Sequence File Input Format
  • NLine Input Format
  1. Joins
  • Map-side Joins
  • Reducer-side Joins
  1. Word Count Example
  2. Partition Map Reduce Program
  3. Side Data Distribution
  • Distributed Cache (with Program)
  1.  Counters (with Program)
  • Types of Counters
  • Task Counters
  • Job Counters
  • User Defined Counters
  • Propagation of Counters
  1. Job Scheduling

PIG:

  1. Introduction to Apache PIG
  2. Introduction to PIG Data Flow Engine
  3. Map Reduce vs. PIG in detail
  4. When should PIG use?
  5. Data Types in PIG
  6. Basic PIG programming
  7. Modes of Execution in PIG
  • Local Mode and
  • Map Reduce Mode
  1. Execution Mechanisms
  • Grunt Shell
  • Script
  • Embedded
  1. Operators/Transformations in PIG
  2. PIG UDF’s with Program
  3. Word Count Example in PIG
  4. The difference between the Map Reduce and PIG

SQOOP:-

  1. Introduction to SQOOP
  2. Use of SQOOP
  3. Connect to mySql database
  4. SQOOP commands
  • Import
  • Export
  • Eval
  • Codegen etc…
  1. Joins in SQOOP
  2. Export to MySQL
  3. Export to HBase

HIVE:-

  1. Introduction to HIVE
  2. HIVE Meta Store
  3. HIVE Architecture
  4. Tables in HIVE
  • Managed Tables
  • External Tables
  1. Hive Data Types
  • Primitive Types
  • Complex Types
  1. Partition
  2. Joins in HIVE
  3. HIVE UDF’s and UADF’s with Programs
  4. Word Count Example

HBASE:

  1. introduction to HBASE
  2. Basic Configurations of HBASE
  3. Fundamentals of HBase
  4. What is NoSQL?
  5. HBase Data Model
  6. Table and Row
  7. Column Family and Column Qualifier
  8. Cell and its Versioning
  9.  Categories of NoSQL Data Bases
  • Key-Value Database
  • Document Database
  • Column Family Database
  1. HBASE Architecture
  • HMaster
  • Region Servers
  • Regions
  • MemStore
  • Store
  1. SQL vs. NOSQL
  2. How HBASE is differed from RDBMS
  3. HDFS vs. HBase
  4. Client-side buffering or bulk uploads
  5. HBase Designing Tables
  6. HBase Operations
  • Get
  • Scan
  • Put
  • Delete

MongoDB:-

  • What is MongoDB?
  •  Where to Use?
  • Configuration On Windows
  • Inserting the data into MongoDB?
  • Reading the MongoDB data.

Cluster Setup:-

  • Downloading and installing the Ubuntu12.x
  • Installing Java
  • Installing Hadoop
  • Creating Cluster
  • Increasing Decreasing the Cluster size
  • Monitoring the Cluster Health
  • Starting and Stopping the Nodes

Zookeeper

  • Introduction Zookeeper
  • Data Modal
  • Operations

OOZIE

  • Introduction to OOZIE
  • Use of OOZIE
  • Where to use?

Flume

  1. Introduction to Flume
  2. Uses of Flume
  3. Flume Architecture
  • Flume Master
  • Flume Collectors
  • Flume Agents