‹ Back To Training

Hadoop for Developers

Timeline: 4 Days

Topics

Expand All › ‹ Collapse All

  • Big Data Introduction
  • History
  • Comparison to Relational Databases
  • Hadoop Ecosystem
  • Architecture/Concepts
  • Access
  • Namenodes
  • Filesystem Shell
  • Accessing HDFS with Java
  • Reading/Writing/Browsing File System
  • Basic HDFS Admin
  • Overview
  • Architecture
  • Data Model
  • Installation and Shell
  • Access via Java API
  • Scan API
  • Filters
  • Storage Model
  • Table Design
  • Introduction
  • Processing Model
  • Command line tools
  • MapReduce Framework
  • Submitting MapReduce Jobs
  • Writing MapReduce Jobs in Java
  • MapReduce Theory
  • Distributive Cache
  • Speculative Execution
  • YARN Components
  • Counters
  • Details of MapReduce Job Execution
  • Implementing a Streaming Job
  • Counters in Streaming Jobs
  • Contrast with Java Jobs
  • Problem Decomposition into MapReduce Jobs
  • Coding Workflows
  • Using the JobControl Class
  • Installation
  • Writing Oozie Workflows
  • Deploying and Running Oozie Jobs
  • Installation
  • Pig Latin
  • Writing Pig Scripts
  • User Defined functions
  • Data Set Joins
  • Installation
  • Table Creation and Deletion
  • Partitioning
  • Loading Data into Hive
  • Joins
  • Bucketing