About this Course

This session is designed to help attendees understand the concepts and benefits of big data and Apache Hadoop and how this technology can help them meet their business goals. Topics include the Apache Hadoop technology stack such as HDFS MapReduce, YARN, Hive, HBase and Spark.

Target Audience

Anyone who is looking at adopting a big data solution, anyone involved in data-driven business change, and everyone who needs an overview of the Apache Hadoop technology ecosystem.

Learn from the UK's leading Data trainer

  • Unrivaled curriculum

    We offer a wide selection of courses for all levels of data experience.

  • Leading certifications

    We offer tracks to professional capabilities.

  • Tailored for business

    We offer courses for business leaders - as well as specialists.

Why people choose QA


There are over 20 QA learning centres and many other sites spread across the UK, providing a convenient choice of learning locations and ensuring that over 90% of the population is within 45 minutes of a training destination. Learn more

  • London


    International House

  • Manchester


    Oxford Street

Delegate portal

Booking courses with QA has always been easy, but now we've made it even easier. With myQA you can book, administer and manage all your bookings online, in one place. Login / sign-up

Course Information

  • Concepts surrounding Big Data, Analytics, and Machine Learning.
  • Real world examples of how data is impacting business.
  • Technology challenges at big data scale
  • How Apache Hadoop works and supports big data, analytics and business transformation.
  • Common Apache Hadoop tools: HDFS, YARN, Sqoop, Pig, Hive, Spark, HBase
  • Introduction to common Hadoop distributions such as Cloudera, Hortonworks, MapR

There are no specific pre-requisites for this course.

What you will learn:

  • What are Big Data, Analytics and Machine Learning, and why they are important.
  • How Big Data is disrupting business and society.
  • Technology changes that support Big Data: distributed computing, data locality, NoSQL, Cloud computing
  • An understanding of how Hadoop works
  • An overview of some of the tools used with Hadoop