Apache Hadoop 2.0: Developing and Operating Hortonworks Data Platform On Windows

Training a team? Use a QA Skills Licence and makes better use of your budget

Course type Essentials (What does this mean?)

Course details
Course title Apache Hadoop 2.0: Developing and Operating Hortonworks Data Platform On Windows
Delivery method Classroom Classroom
Days/Duration 4
Code HWAHDOW2

Secure online payment

dates, pricing & booking
course description
blogs

Print course outline | Download as PDF document | Link to page: www.qa.com/HWAHDOW2

Course dates

Currently scheduled dates for this training course
Location JUL AUG SEP OCT View later dates

London

International House, E1W

22 show prices/book

Overview

Students will learn to develop applications and analyze big data stored in Apache Hadoop running on Microsoft Windows.

Students will learn the details of the Hadoop Distributed File System (HDFS ™) architecture and MapReduce framework, as well as learn how to develop applications on Hadoop® using tools like C#, Pig ™, Hive™, HCatalog, Sqoop, Oozie and Microsoft Excel.

Prerequisites

Students should have programming experience, preferably with Visual Studio and SQL, as well as familiarity with the Windows Server operating system. No prior Hadoop knowledge is required.

Target Audience

.NET Developers and Data Analysts responsible for developing applications and performing analysis on big data using the Hortonworks Data Platform for Windows.

Delegates will learn how to

  • Explain the various tools and frameworks in the Hadoop ecosystem
  • Recognize use cases for HDP for Windows and Big Data
  • Explain the architecture of the Hadoop Distributed File System (HDFS™)
  • Transfer data between HDFS and Microsoft SQL Server using Sqoop
  • Explain the architecture of MapReduce
  • Run a MapReduce job on Hadoop
  • Use Hadoop streaming
  • Use the Microsoft .NET API for Hadoop to write a C# MapReduce job
  • Recognize use cases for Pig
  • Write a Pig script to explore and transform data in HDFS
  • Define advanced Pig relations
  • Use Pig to apply structure to unstructured Big Data
  • Join large datasets using Pig
  • Invoke a Pig User-Defined Function
  • Write a Hive query using Hive QL
  • Understand how Hive tables are defined and implemented
  • Use Hive to run SQL-like queries to perform data analysis
  • Explain the uses and purpose of HCatalog
  • Use an HCatalog schema within a Pig script
  • Explain the purpose of the Hive ODBC driver
  • Connect Microsoft Excel to HDFS using Hive ODBC
  • Import Hive query results into Excel
  • Explain the usages of Oozie
  • Write and execute an Oozie workflow

Lab Content

Students will work through the following lab exercises using the Hortonworks Data Platform for Windows:

  • Access HDFS using the HDFS commands
  • Import SQL Server data into HDFS using Sqoop
  • Export HDFS data from HDFS into SQL Server using Sqoop
  • Run a MapReduce Job
  • Monitor a MapReduce Job
  • Develop a .NET MapReduce application in C#
  • Explore data using Pig
  • Split and join datasets using Pig
  • Transform unstructured for use with Hive
  • Analyze Big Data with Hive
  • Understanding MapReduce with Hive
  • Joining datasets with Hive
  • Use HCatalog with Pig
  • Use Hive ODBC with Microsoft Excel
  • Define an Oozie Workflow

related blogs

Are you ready for End of Life for Windows Server 2003

Posted by Paul Gregory on 14 July 2014

It has been well documented that Windows Server 2003 will have support withdrawn on the 15th July 2015.

The benefits of the Cloud and Amazon Web Services (AWS)

Posted by on

If you read the tech press, you would think absolutely everybody was moving to the cloud. But is that just hype, or is it really true? And if it’s true, what benefits are they getting from it?

SP13IE10Issue

Posted by John Day on 15 May 2014

SharePoint 2013 and Internet Explorer 10 have a stormy relationship. I think it's time for marriage guidance counselling.

App-V 4.x to 5.0 Package conversion: Fixing the broken Pipeline!

Posted by on

The App-V 5.0 package format is very different from the previous 4.5/4.6 version, and the App-V 5.0 client is not compatible with the earlier package versions. To help protect your sequencing investment, Microsoft included two PowerShell commands on the sequencer to aid in migration: Test-AppVLegacyPackage and ConvertFrom-AppVLegacyPackage. The first tests the old package for known constraints, while the second attempts to convert the package to the new format

Top 20 Photoshop Shortcuts

Posted by Richard O'Brien on 21 November 2013

One of the things we're regularly asked on courses is "is there a quicker way to do xyz?" Very often the answer is a resounding 'yes'. So, I thought with this post I'd cover my favourite (and most commonly used) top 20 shortcuts when working with Adobe Photoshop (either in Creative Suite or Creative Cloud).

SP13PermissionsConcern

Posted by on

Beware of Geeks bringing gifts. The Site Members have more power in SharePoint 2013 than you may want them to.

See all related blogs

top of page
  • Amazon logo
  • Apple logo
  • AppSense logo
  • cisco logo
  • citrix logo
  • compTIA logo
  • ec council logo
  • Hortonworks CTP logo
  • microsoft gold logo
  • novell logo
  • oracle logo
  • Pya -winner -2013 logo
  • redhat logo
  • Salesforce logo
  • symantec logo
  • vmware logo
  • Apple logo
  • Apple logo
  • Hortonworks CTP logo
  • oracle logo