Hadoop Developer Course Content Click Here For Enquiry
Understanding Big Data
- Introduction/Installation - Hadoop Custom VM(Single Node)
- Understanding Big Data
- 3V (Volume-Variety-Velocity) Characteristics
- Structured and Unstructured Data
- Application and use cases of Big Data
- Limitations of Traditional Large Scale Systems
- How a distributed way of computing is superior (cost and scale)
HDFS
- HDFS Overview and Architecture
- Data Replication
- Safe Mode
- Name Node
- Checkpoint Node
- Backup Node
- Configuration Files
HDFS Data Flows
- Read
- Write
HDFS Commands
- File System
- Administrative
Advanced HDFS Features
- HDFS Federation
- HDFS High Availability
MapReduce Overview
- Functional Programming Paradigms
- Input and Output Formats
- Hadoop Data Types
MapReduce Overview
- Input Splits
- Shuffling
- Sorting
- Partitioning
- Configuration Files
- Distributed Cache
MR Alogrithm and Data Flow
- WordCount
MapReduce Architecture
- Legacy MR
- Next Generation MapReduce (aka YARN/MRV2)
Developing and Deploying MR
Programs
- Standalone Mode
- Hadoop Streaming
Optimizers
- Combiners
- JVM Reuse
- Compression
MR Best Practice and Debugging
Fundamental MR Algorithms
(Non-Graph)
- Student Database
- Max Temperature
Higher Level Abstractions for
MapReduce – 1
- Pig Introduction
- Pig Latin Language Constructs
- Pig User Defined Functions
- Pig Use Cases
Higher Level Abstractions for
MapReduce - 2
- Hive - Introduction
- Hive QL
- Hive User Defined Functions
- Hive Use Cases
NOSQL Databases
NoSQL Concepts
- Review of RDBMS
- Need for NOSQL
- Brewers CAP Theorem
- ACID vs BASE
Different Types of NoSQL Databases
- Key Value
- Columnar
- Document
- Graph
- Columnar Databases
Hadoop Ecosystem
- HBASE vs Cassandra
- HBASE Architecture
- HBASE Data Modeling
- HBASE Commands
- HBASE Coprocessors - Endpoints
- HBASE Coprocessors - Observers
- SQOOP
- Flume & OOZIE