Apache Hadoop 2.0: Developing Applications with the Hortonworks Data Platform using Java

Training a team? Use a QA Skills Licence and makes better use of your budget

Course type Essentials (What does this mean?)

Course details
Course title Apache Hadoop 2.0: Developing Applications with the Hortonworks Data Platform using Java
Delivery method Classroom Classroom
Days/Duration 4
Code HWAHDJAV2

Secure online payment

dates, pricing & booking
course description
blogs

Print course outline | Download as PDF document | Link to page: www.qa.com/HWAHDJAV2

Course dates

Currently scheduled dates for this training course
Location AUG SEP OCT NOV View later dates

London

International House, E1W

19 17 23 show prices/book

Overview

This 4 days hands-on training course takes a deep-dive into developing Java MapReduce applications for Big Data deployed on the Hadoop Distributed File System (HDFS).

Students who attend this course will learn how to harness the power of Apache Hadoop™ and MapReduce to manipulate, analyze and perform computations on their Big Data.

Prerequisites

This course assumes students have experience developing Java applications and using a Java IDE. Labs are completed using the Eclipse IDE and Maven

Target Audience

Experienced Java developers responsible for developing MapReduce applications and performing analysis of Big Data stored on Apache Hadoop

Delegates will learn how to

  • Write a Java MapReduce application using Eclipse and Maven
  • Develop a Combiner to perform map aggregation
  • Customize input and output formats of a MapReduce job
  • Compute mathematical computations on your Big Data files
  • Use best practices to optimize MapReduce jobs
  • Create JUnit tests for a MapReduce job
  • Discover trends in your Big Data
  • Define an Oozie workflow
  • Access Apache HBase™ data from a Java MapReduce job
  • Write custom, user-defined functions for Apache Pig™ and Apache™ Hive

Lab Content

Students will work through the following exercises using Eclipse, Maven and the Hortonworks Data Platform:

  • Configuring a Hadoop ™ Development Environment
  • Word Count
  • Distributed Grep
  • Inverted Index
  • Using a Combiner
  • Computing an Average
  • Writing a Custom Partitioner
  • Using a TotalOrderPartitioner
  • Custom Sorting
  • Writing a Custom InputFormat
  • Customizing Output
  • Simple Moving Average
  • Using Data Compression
  • Defining a RawComparator
  • A Map-Side Join
  • Using a Bloom Filter
  • Unit Testing
  • Defining an Oozie Workflow
  • Term Frequency-Inverse Document Frequency (TF-IDF)
  • Accessing HBase from Java MapReduce
  • Writing a User-Defined Pig Function
  • Writing a User-Defined Hive Function

related blogs

Are you ready for End of Life for Windows Server 2003

Posted by Paul Gregory on 14 July 2014

It has been well documented that Windows Server 2003 will have support withdrawn on the 15th July 2015.

The benefits of the Cloud and Amazon Web Services (AWS)

Posted by on

If you read the tech press, you would think absolutely everybody was moving to the cloud. But is that just hype, or is it really true? And if it’s true, what benefits are they getting from it?

SP13IE10Issue

Posted by John Day on 15 May 2014

SharePoint 2013 and Internet Explorer 10 have a stormy relationship. I think it's time for marriage guidance counselling.

App-V 4.x to 5.0 Package conversion: Fixing the broken Pipeline!

Posted by on

The App-V 5.0 package format is very different from the previous 4.5/4.6 version, and the App-V 5.0 client is not compatible with the earlier package versions. To help protect your sequencing investment, Microsoft included two PowerShell commands on the sequencer to aid in migration: Test-AppVLegacyPackage and ConvertFrom-AppVLegacyPackage. The first tests the old package for known constraints, while the second attempts to convert the package to the new format

Top 20 Photoshop Shortcuts

Posted by Richard O'Brien on 21 November 2013

One of the things we're regularly asked on courses is "is there a quicker way to do xyz?" Very often the answer is a resounding 'yes'. So, I thought with this post I'd cover my favourite (and most commonly used) top 20 shortcuts when working with Adobe Photoshop (either in Creative Suite or Creative Cloud).

SP13PermissionsConcern

Posted by on

Beware of Geeks bringing gifts. The Site Members have more power in SharePoint 2013 than you may want them to.

See all related blogs

top of page
  • Amazon logo
  • Apple logo
  • AppSense logo
  • cisco logo
  • citrix logo
  • compTIA logo
  • ec council logo
  • Hortonworks CTP logo
  • microsoft gold logo
  • novell logo
  • oracle logo
  • Pya -winner -2013 logo
  • redhat logo
  • Salesforce logo
  • symantec logo
  • vmware logo
  • compTIA logo
  • novell logo
  • symantec logo