• Performance Plus
  • New Course
  • Authored Course

About this course

Fundamentals of Data Science is a three day overview course which blends discussion and group exercises to explore the field of data science with applied real world examples and projects.

Teaching begins with a conceptual introduction to science, data science, big data and machine learning; followed by a litany of real-world data science and machine learning examples.

The remainder is divided into two parts: python-illustrated and r-illustrated. After introducing both languages, the modules cover various applied topics (data preparation, statistics, Markov chains, neural nets) with examples in either python or R.

Learn from the UK's leading Data Science trainer

  • Unrivaled curriculum

    We offer a wide selection of courses for all levels of data experience.

  • Leading certifications

    We offer tracks to professional capabilities.

  • Tailored for business

    We offer courses for business leaders - as well as specialists.

Why people choose QA


There are over 20 QA learning centres and many other sites spread across the UK, providing a convenient choice of learning locations and ensuring that over 90% of the population is within 45 minutes of a training destination. Learn more

  • London


    International House

  • Manchester


    Oxford Street

Delegate portal

Booking courses with QA has always been easy, but now we've made it even easier. With myQA you can book, administer and manage all your bookings online, in one place. Login / sign-up

Building a Digital Culture
Practical Data Science

An introduction to Statistics, R, Python, Analytics, DS and Machine Learning.

How to Thrive in a Digital World
Fundamentals of Big Data

An Introduction to Data Science and mathematical models underpinning Big Data.

Leading Digital Transformations
Practical Big Data Analysis

Introduction to Python, Data Science and Big Data for practitioners working with them.

Navigating the Social Media Landscape
Fundamentals of Machine Learning

Introduction to R and data analytics. Deeper Introduction to Machine Learning.

Course Information

Intro to Science, Data Science and Big Data

  • Introduction to Data Science
  • What is Science?
  • What is the Scientific Method?
  • What is Data?
  • How is Data Structured?
  • What is Big Data?
  • What is Data Science?
  • What is the Method of Data Science?
  • What is Machine Learning?
  • What skills does a Data Scientist have?

Intro to Data Science Methodology

  • Methods of Data Science
  • Methods of Machine Learning
  • Data Preparation
    • Gathering Data
    • Storing Data
    • Cleaning Data
  • Types of problems
    • Classification
    • Regression
    • Clustering
    • Recommender systems

Data Science Python Primer

Intro to Python

  • Overview
  • History & Philosophy
  • Installation
  • Language Characteristics
  • Anatomy of an Python Program
  • Anatomy of an REPL Session
  • Help
  • Getting Started with Python: Data
  • Getting Started with Python: Calling Functions
  • Getting Started with Python: Packages

Intro to Python for Data Science

  • Python is Slow
  • ndarrays
  • Multiple Dimensions
  • Data Type
  • Slice and Dice
  • Matrices
  • Conversions
  • Operations on ndarray
  • Reduce & Accumulate
  • Summary Statistics
  • Plotting

Intro to Markov Chains with Python

  • Probabilities and Expectations
  • Independence
  • The Independent Future
  • The Dependent Future
  • Markov Chains & Assumptions
  • Transition matrix
  • Monte Carlo
  • One Event
  • State
  • Transition
  • Probability
  • Sample Space
  • Sequencing
  • n Events & Convergence

Data Science R Primer

Intro to R for Data Preparation

  • Overview
  • History & Philosophy
  • CRAN
  • Installation
  • R Studio
  • Language Characteristics
  • Anatomy of an R Program
  • Anatomy of an REPL Session
  • Help
  • Getting Started with R: Data
  • Getting Started with R: Plots
  • Getting Started with R: Calling Functions
  • Getting Started with R: Packages
  • Getting Started: Using the REPL
  • Getting Started: Data Preparation
    • Data selection
    • Data sampling
    • Normalisation
    • Cleansing
    • Missing values

Intro to Statistics with R

  • Introduction to Statistics
  • Overview
  • Observation and Measurement
  • Probabilities and Events
  • Frequencies
  • Populations
  • Samples
  • The Normal Distribution
  • Interpreting the Normal Distribution​
  • Measures of Central Tendency
  • Measures of Spread
  • ​Hypotheses
  • Linear Relationships
  • Correlation
  • Simpson's Paradox

Intro to Learning Algorithms and Deep Learning with R

  • Examples of creating algorithms
  • Overview of Machine Learning algorithms
    • Decision trees
    • Clustering
    • Segmentation
    • Association
    • Classification
    • Sequence analysis
  • Neural nets
    • History
    • Layers
    • Weights
    • Back propagation
    • Deep Learning
  • KNN
  • SVM

Delegates attending this course should have technical skills such as application or SQL programming – expose to Python or R would be advantageous.