Overview
This three day course is aimed at those wishing to learn how to use Python to work with and handle Data. When combined with our Introduction to Data Science course you would be set up well to follow a Python learning journey into Data Engineering, Advanced Data Analytics, Data Science, Machine Learning, and Artificial Intelligence.
During the programme you will be introduced to Python and specific development environments and packages for working with Data, with a focus on NumPy, Pandas, Matplotlib, and Seaborn.
Along the way you will see how to clean and manipulate tabular data, apply simple statistical techniques and data visualisations, and learn about how to control the flow of your program in order to automate processes.
Throughout the course you will engage with activities and discussions with one of our Data Science technical specialists and complete technical lab activities to practice the techniques you have learnt and develop ideas for further practice.
Prerequisites
No prior experience with Python is necessary, though it is assumed that you will be familiar with core data concepts such as simple table structures and data types – all the pre-requisites you need are covered by our Data Essentials course.
Target Audience
This course is intended for Data Analysts, Data Engineers, Data Ops roles, and those training to consume AI Services or become Data Scientists and tune and develop Machine Learning and AI models on our subsequent Data Science learning pathway.
This course covers the key pre-requisites for a large range of further learning opportunities involving Python, Data, and AI.
Delegates will learn how to
- Benefit from the speed and functionality of the NumPy and Pandas python packages
- Create and control Data Visualisations using Matplotlib and Seaborn
- Use Python with the Jupyter development environment
- Retrieve, clean, and prepare data from multiple types of sources
- Gain a firm grounding in Python with Data in order to progress to further study to connect to AI models, Engineer data pipelines, and develop Data Science solutions
Outline
1. Introduction to Programming for Data Handling
- Describe the pros and cons of using programming languages to work with data
- Identify the languages most suitable for data handling
- Explain the challenges of using programming languages versus data analysis tools
2. Introduction to Python and IDEs
- Describe the key attributes of the Python programming language.
- Explain the role of the Jupyter IDE for Python programming.
- Use the Jupyter IDE to write a basic Python program.
- Write a program which uses string, integer, float and boolean data types.
3. Data Structures, Flow Control, Functions, and Basic Types
- Construct collections to solve data problems.
- Utilise selection and iteration syntax to control the flow of a Python program.
- Write reusable functions which can be used to alter data & automate repetitive tasks.
- Use Python's built-in open function to create, read, and edit files.
4. Mathematical and Statistical Programming with NumPy
- Describe the core features of NumPy arrays.
- Create, index, and manipulate NumPy arrays to solve data problems.
- Use masking and querying syntax to retrieve desired values.
- Use vectorised ufuncs.
5. Introduction to Pandas
- Create, manipulate, and alter Series and DataFrames with Pandas.
- Define and change the indices of Series & Dataframes.
- Use Pandas' functions and methods to change column types, compute summary statistics and aggregate data.
- Read, manipulate, and write data from csv, xlsx, json and other structured file formats.
6. Data Cleaning with Pandas
- Identify missing data and apply techniques to deal with it.
- Deduplicate, transform and replace values.
- Use DataFrame string methods to manipulate text data.
- Write regular expressions which munge text data.
7. Data Manipulation with Pandas
- Construct Pivot tables in Pandas.
- Time series manipulation.
- Stream data into Pandas to handle data size problems.
8. Methods for Visualising Data
- Construct and tailor basic data visualisations using Matplotlib & Seaborn for both numeric & non-numeric data.
- Meaningfully visualise aggregate data using Matplotlib and Seaborn.
Related learning
Data Science Learning Pathways can be selected by choosing either Python or R and a Cloud Platform certification:
- QAIDSDP Introduction to Data Science for Data Professionals
- Sourcing and handling data:
- QADHPYTHON Data Handling with Python
- QADHR Data Handling with R
- QAPDHAI Python Data Handling with AI APIs
- Statistics for Data Analysis:
- QASDAPY Statistics for Data Analysis with Python
- QASDAR Statistics for Data Analysis with R
- Programming and Software Development skills:
- QAPYTH3 Python Programming
- QARPROG R Programming
- Machine Learning Development:
- QADSMLP Data Science and Machine Learning with Python
- QADSMLR Data Science and Machine Learning with R
- Mathematics for Developing Algorithms for AI models, Big Data Mining, and working with Neural Networks:
- QAMFDS Mathematics for Data Science
- Forecasting:
- QATSFP Time Series and Forecasting with Python
- QATSFR Time Series and Forecasting with R
Suggested courses leading to Certification:
- MDP100 Designing and Implementing a Data Science Solution on Azure (DP-100)
- AMWSMLP Machine Learning Pipelines on AWS
- GCPMLGC Machine Learning on Google Cloud
Frequently asked questions
How can I create an account on myQA.com?
There are a number of ways to create an account. If you are a self-funder, simply select the "Create account" option on the login page.
If you have been booked onto a course by your company, you will receive a confirmation email. From this email, select "Sign into myQA" and you will be taken to the "Create account" page. Complete all of the details and select "Create account".
If you have the booking number you can also go here and select the "I have a booking number" option. Enter the booking reference and your surname. If the details match, you will be taken to the "Create account" page from where you can enter your details and confirm your account.
Find more answers to frequently asked questions in our FAQs: Bookings & Cancellations page.
How do QA’s virtual classroom courses work?
Our virtual classroom courses allow you to access award-winning classroom training, without leaving your home or office. Our learning professionals are specially trained on how to interact with remote attendees and our remote labs ensure all participants can take part in hands-on exercises wherever they are.
We use the WebEx video conferencing platform by Cisco. Before you book, check that you meet the WebEx system requirements and run a test meeting to ensure the software is compatible with your firewall settings. If it doesn’t work, try adjusting your settings or contact your IT department about permitting the website.
How do QA’s online courses work?
QA online courses, also commonly known as distance learning courses or elearning courses, take the form of interactive software designed for individual learning, but you will also have access to full support from our subject-matter experts for the duration of your course. When you book a QA online learning course you will receive immediate access to it through our e-learning platform and you can start to learn straight away, from any compatible device. Access to the online learning platform is valid for one year from the booking date.
All courses are built around case studies and presented in an engaging format, which includes storytelling elements, video, audio and humour. Every case study is supported by sample documents and a collection of Knowledge Nuggets that provide more in-depth detail on the wider processes.
When will I receive my joining instructions?
Joining instructions for QA courses are sent two weeks prior to the course start date, or immediately if the booking is confirmed within this timeframe. For course bookings made via QA but delivered by a third-party supplier, joining instructions are sent to attendees prior to the training course, but timescales vary depending on each supplier’s terms. Read more FAQs.
When will I receive my certificate?
Certificates of Achievement are issued at the end the course, either as a hard copy or via email. Read more here.