About this course

Course code TPZL1_DW612
Duration 3 Days

This training course is for those who want a foundation of IBM InfoSphere BigInsights. It will give you an overview of IBM's Big Data strategy as well as a more detailed information on Apache Hadoop. It presents concepts required by a system administrator to work with the Hadoop Distributed File System and concepts of MapReduce that are required by a developer. It gives an introduction to the scheduling capabilities of Hadoop and how to use Oozie to control workflows and use Flume to load data into HDFS.

Curriculum relationship:

Programming for InfoSphere Streams V3 with SPL (DW720)
InfoSphere Streams Administration (DW730)
IBM InfoSphere BigInsights Analytics for Business Analysts (DW640)
IBM InfoSphere BigInsights Analytics for Programmer (DW650)

Prerequisites

There are no prerequisites for this course. However, knowledge of Linux would be beneficial.

Who Should Attend?
This intermediate training course is for those who want a foundation of IBM InfoSphere BigInsights and is designed for system administrators and developers.

Delegates will learn how to

  • Describe functions and features of InfoSphere BigInsights
  • List the capabilities of Hadoop and HDFS
  • Administer HDFS Describe the use of MapReduce
  • Setup a Hadoop cluster
  • Manage job execution Explain the Oozie workflows
  • Describe some scenarios for loading data into HDFS

Outline

Day 1


  • Unit 1 - Introduction to Big Data
  • Unit 2 - Introduction to InfoSphere BigInsights
  • Exercise 1 - Installing BigInsights
  • Unit 3 - Apache Hadoop and HDFS
  • Exercise 2- Exploring Apache Hadoop
  • Unit 4 - GPFS-FPO Unit 5 - BigInsights Web Console

Day 2


  • Exercise 3 - BigInsights Web Console
  • Unit 6 - Introduction to MapReduce - continued
  • Exercise 4 - MapReduce
  • Unit 7 - Adaptive MapReduce
  • Unit 8 - Setup and Configurations of BigInsights Clusters
  • Exercise 5 - Hadoop Configuration

Day 3


  • Unit 9 - Overview of Oozie
  • Exercise 6 - Scheduling with Oozie
  • Unit 10 - Managing Job Execution
  • Unit 11 - Moving Data into Hadoop
  • Exercise 6 - Using Flume for Data Loading

3 Days

Duration
Training delivered by an IBM Global Training Provider
Delivery Method

Delivery method

Classroom

Face-to-face learning in the comfort of our quality nationwide centres, with free refreshments and Wi-Fi.

Find dates and prices

Online booking is currently not available for this course, to find out more please call us on 0345 074 7998 or email us at info@qa.com to discuss how we can help.

Trusted, awarded and accredited

Fully accredited to ensure we provide the highest possible standards in learning

All third party trademark rights acknowledged.