Java – TechSoft

Java

Developer--Java

Day 1 Objective

  • Describe Hadoop 2.X and the Hadoop Distribute File System
  • Describe the YARN framework
  • Describe the Purpose of NameNodes and Data Nodes
  • Describe the Purpose of HDFS High Availability (HA)
  • Describe the Purpose of the Quorum Journal Manager
  • List Common HDFS Commands
  • Describe the Purpose of YARN
  • List Open-Source YARN Use Cases
  • List the Components of YARN
  • Describe the Life Cycle of a YARN Application
  • Define Map Aggregation
  • Describe the Purpose of Combiners
  • Describe the Purpose of In-Map Aggregation
  • Describe the Purpose of Counters
  • Describe the Purpose of User-Defined Counters

Day 1 Demonstrations

  • Demonstration: Understanding Block Storage
  • Configuring a Hadoop Development Environment
  • Putting Files in HDFS with Java
  • Demonstration: Understanding Map Reduce
  • Word Count
  • Distributed Grep
  • Inverted Index
  • Using a Combiner
  • Computing an Average

Enquire Now

Day 1 Objective

  • Describe Hadoop 2.X and the Hadoop Distribute File System
  • Describe the YARN framework
  • Describe the Purpose of NameNodes and Data Nodes
  • Describe the Purpose of HDFS High Availability (HA)
  • Describe the Purpose of the Quorum Journal Manager
  • List Common HDFS Commands
  • Describe the Purpose of YARN
  • List Open-Source YARN Use Cases
  • List the Components of YARN
  • Describe the Life Cycle of a YARN Application
  • Define Map Aggregation
  • Describe the Purpose of Combiners
  • Describe the Purpose of In-Map Aggregation
  • Describe the Purpose of Counters
  • Describe the Purpose of User-Defined Counters

Day 1 Demonstrations

  • Demonstration: Understanding Block Storage
  • Configuring a Hadoop Development Environment
  • Putting Files in HDFS with Java
  • Demonstration: Understanding Map Reduce
  • Word Count
  • Distributed Grep
  • Inverted Index
  • Using a Combiner
  • Computing an Average

Day 2 Objective

  • Describe the Purpose of a Partitioner
  • List the Steps for Writing a Custom Partitioner
  • Describe How to Create and Distribute a Partition File
  • Describe the Purpose of Sorting
  • Describe the Purpose of Custom Keys
  • Describe How to Write a Group Comparator
  • List the Built-In Input Formats
  • Describe the Purpose of Input Formats
  • Define a Record Reader
  • Describe How to Handle Records that Span Splits
  • List the Built-In Output Formats
  • Describe How to Write a Custom Output Format
  • Describe the Purpose of the MultipleOutputs Class

Day 2 Demonstrations

  • Writing a Custom Partitioner
  • Using TotalOrderPartitioner
  • Custom Sorting
  • Demonstration: Combining Input Files
  • Processing Multiple Inputs
  • Writing a Custom Input Format
  • Customizing Output
  • Working with a Simple Moving Average

Day 3 Objective

  • Describe the Purpose of a RawComparator
  • Describe the Purpose of Localization
  • List Scenarios for Performing Joins in MapReduce
  • Describe the Purpose of the Bloom Filter
  • Describe the Purpose of MRUnit and the MRUnit API
  • Describe How to Set Up a Test
  • Describe How to Test a Mapper
  • Describe How to Test a Reducer
  • Describe the Purpose of HBase
  • Define the Differences Between a Relational Database and HBase
  • Describe the HBase Architecture
  • Demonstrate the Basics of HBase Programming
  • Describe an HBase MapReduce Applications

Day 3 Demonstrations

  • Using Data Compression
  • Defining a RawComparator
  • Performing a Map-Side Join
  • Using a Bloom Filter
  • Unit Testing a MapReduce Job
  • Importing Data to HBase
  • Creating an HBase Mapreduce Jo

Day 4 Objective

  • Describe the Purpose of the FOREACH GENERATE Operator
  • Describe the Purpose of Pig User Defined Functions (UDFs)
  • Describe the Purpose of Filter Functions
  • Describe the Purpose of Accumulator UDFs
  • Describe the Purpose of Algebraic Functions
  • Describe the Purpose of Apache Hive
  • Describe the Differences Between Apache Hive and SQL
  • Describe Apache Hive Architecture
  • Describe How to Load Data Into Hive
  • Demonstrate How to Perform Queries
  • Describe the Purpose of Hive User Defined Functions (UDFs)
  • Write a Hive UDF
  • Describe the Purpose of HCatalog
  • Describe the Purpose of Apache Oozie
  • Describe How to Define an Oozie Workflow
  • Describe Pig and Hive Actions
  • Describe How to Define an Oozie Coordinator Job

Day 4 Demonstrations

  • Demonstration: Understanding Pig
  • Writing a Pig UDF
  • Writing a Pig Accumulator
  • Writing a Apache Hive UDF
  • Defining an Oozie Workflow
  • Working with TF-IDF and the JobControl Class

Course Pre-Requisite

Students must have experience developing Java applications and using a Java IDE. Labs are completed using the Eclipse IDE and Gradle. No prior Hadoop knowledge is required.

 

Course Calendar

Start Date End Date Duration Locaton Register Now
25th July 2019 28th July 2019 4 Days Pune, Bangalore Register Now
28th Aug 2019 31th Aug 2019 4 Days Pune, Bangalore Register Now
19th Sep 2019 22nd Sep 2019 4 Days Pune, Bangalore Register Now

Course Calendar

Stat Date

End Date Duration Location
25th July 2019 28th July 2019 4 Days Pune, Bangalore
Stat Date

End Date Duration Location
28th Aug 2019 31st Aug 2019 4 Days Pune, Bangalore
Stat Date

End Date Duration Location
19th Sep 2019 22nd Sep 2019 4 Days Pune, Bangalore

Click Here For More Big Data Related Services.