Let’s make it work for you 

Overview

Data science is about using scientific methods, processes, algorithms, and systems to analyse and extract insight from data. We believe organisations that master AI, Cloud, and Data can turn information into a competitive advantage. This hands-on workshop demonstrates how GPU-accelerated tools can transform data science workflows, enabling faster experimentation, greater scalability, and more cost-effective outcomes.

Across the workshop, learners use RAPIDS libraries to accelerate data manipulation, machine learning, and graph analytics. Participants work with cuDF, cuML, cuGraph, and related tools to process large and larger-than-memory datasets. The course culminates in a population-scale project that applies GPU-accelerated analytics to simulate and respond to an epidemic affecting the UK, reinforcing practical, real-world application of skills.

Read more +

Prerequisites

Participants should have:

  • Experience with Python programming and common data science libraries
  • A foundational understanding of data manipulation and analysis concepts
  • Familiarity with basic machine learning principles
  • Awareness of pandas or similar dataframe-based workflows

Target audience

This course is designed for:

  • Data scientists seeking to accelerate existing Python workflows
  • Machine learning engineers working with large or complex datasets
  • Developers and analysts exploring GPU-accelerated analytics
  • Organisations aiming to scale data science capabilities efficiently
Read more +

Learning Objectives

By the end of this workshop, learners will be able to:

  • Use cuDF to accelerate pandas, Polars, and Dask workflows for analysing datasets of varying sizes
  • Ingest and prepare large and larger-than-memory datasets directly on single or multiple GPUs
  • Apply GPU-accelerated supervised and unsupervised machine learning algorithms using cuML
  • Use algorithms such as XGBoost to address a range of data science problems
  • Create and analyse complex network data using NetworkX and cuGraph
  • Deploy machine learning models to an NVIDIA Triton Inference Server for optimised performance
  • Integrate multiple large datasets to perform iterative, real-world analysis tasks
Read more +

Course Outline

Introduction and environment setup

  • Meet the instructor and review workshop objectives
  • Set up access to the training environment
  • Overview of GPU-accelerated data science and the RAPIDS ecosystem
  • Understanding the role of GPUs in scalable analytics

GPU-accelerated data manipulation

Ingest and prepare several datasets, including larger-than-memory data, for use in downstream machine learning tasks:

  • Reading data directly to single and multiple GPUs using pandas, Polars, cuDF, and Dask
  • Comparing CPU-based and GPU-accelerated dataframe operations
  • Cleaning and transforming structured datasets on the GPU
  • Preparing population, road network, and clinic datasets for analysis
  • Managing memory constraints and performance considerations
  • Building repeatable, scalable data preparation pipelines

GPU-accelerated machine learning

Apply essential machine learning techniques to prepared datasets using GPU-accelerated libraries:

  • Introduction to cuML and GPU-based model training
  • Using supervised learning algorithms for predictive modelling
  • Applying unsupervised learning techniques for clustering and pattern discovery
  • Leveraging XGBoost for classification and regression tasks
  • Evaluating model performance and refining hyperparameters
  • Understanding performance trade-offs between CPU and GPU workflows

Graph analytics on the GPU

Perform advanced graph analytics to analyse complex networks:

  • Introduction to graph data structures and network analysis concepts
  • Creating graph data on the GPU using cuGraph
  • Analysing connectivity, centrality, and path-based metrics
  • Comparing NetworkX and cuGraph implementations
  • Scaling graph analytics to large, population-scale datasets

Project: data analysis to support the UK during a simulated epidemic

Apply new GPU-accelerated data manipulation and analysis skills to a population-scale scenario:

  • Integrating multiple massive datasets using RAPIDS libraries
  • Performing real-world analysis to model and respond to a simulated epidemic affecting the UK population
  • Pivoting and iterating on analysis as new simulated daily data becomes available
  • Identifying patterns and insights to inform intervention strategies
  • Communicating findings clearly and effectively

Inference and deployment considerations

  • Preparing trained machine learning models for inference
  • Deploying models to an NVIDIA Triton Inference Server
  • Validating model performance in a live inference context
  • Understanding scalability and operational considerations for production environments

Exams and assessments

Learners complete practical, scenario-based exercises throughout the workshop, culminating in a project that integrates data manipulation, machine learning, and graph analytics techniques.

Assessment is based on applied tasks that evaluate the ability to prepare data, train GPU-accelerated models, perform graph analysis, and interpret results within the simulated epidemic scenario.

Hands-on learning

This workshop is built around applied, GPU-enabled practice:

  • Guided exercises using RAPIDS libraries including cuDF, cuML, and cuGraph
  • Real-world datasets reflecting population, infrastructure, and healthcare contexts
  • Iterative experimentation enabled by accelerated compute performance
  • Project-based learning focused on solving a complex, evolving problem
Read more +

Why choose QA

Yellow
Need to know

Frequently asked questions

How can I create an account on myQA.com?

There are a number of ways to create an account. If you are a self-funder, simply select the "Create account" option on the login page.

If you have been booked onto a course by your company, you will receive a confirmation email. From this email, select "Sign into myQA" and you will be taken to the "Create account" page. Complete all of the details and select "Create account".

If you have the booking number you can also go here and select the "I have a booking number" option. Enter the booking reference and your surname. If the details match, you will be taken to the "Create account" page from where you can enter your details and confirm your account.

Find more answers to frequently asked questions in our FAQs: Bookings & Cancellations page.

How do QA’s virtual classroom courses work?

Our virtual classroom courses allow you to access award-winning classroom training, without leaving your home or office. Our learning professionals are specially trained on how to interact with remote attendees and our remote labs ensure all participants can take part in hands-on exercises wherever they are.

We use the WebEx video conferencing platform by Cisco. Before you book, check that you meet the WebEx system requirements and run a test meeting to ensure the software is compatible with your firewall settings. If it doesn’t work, try adjusting your settings or contact your IT department about permitting the website.

How do QA’s online courses work?

QA online courses, also commonly known as distance learning courses or elearning courses, take the form of interactive software designed for individual learning, but you will also have access to full support from our subject-matter experts for the duration of your course.

Once you have purchased the Online course and have completed your registration, you will receive the necessary details to enable you to immediately access it through our e-learning platform and you can start to learn straight away, from any compatible device. Access to the online learning platform is valid for one year from the booking date.

All courses are built around case studies and presented in an engaging format, which includes storytelling elements, video, audio and humour. Every case study is supported by sample documents and a collection of Knowledge Nuggets that provide more in-depth detail on the wider processes.

When will I receive my joining instructions?

Joining instructions for QA courses are sent two weeks prior to the course start date, or immediately if the booking is confirmed within this timeframe. For course bookings made via QA but delivered by a third-party supplier, joining instructions are sent to attendees prior to the training course, but timescales vary depending on each supplier’s terms. Read more FAQs.

When will I receive my certificate?

Certificates of Achievement are issued at the end the course, either as a hard copy or via email. Read more here.

Let's talk

A member of the team will contact you within 4 working hours after submitting the form.

By submitting this form, you agree to QA processing your data in accordance with our Privacy Policy.