hands-on lab

AI Engineer L6 M4 Practise Stage Lab Workshop 1

Difficulty: Intermediate
Duration: Up to 1 hour and 30 minutes
Students: 5
Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.
Learn and validateUse validations to check your solutions every step of the way.
See resultsTrack your knowledge and monitor your progress.

Description

Workshop 1 - Data Wrangling and Feature Engineering

Machine Learning projects start with cleaning and transformation of the data and selecting the most relevant features (attributes). This is what you will practice using this lab. You will look at selecting a subset from the available data, filtering columns and rows and dealing with missing values and outliers. You will then move to the types of transformations that may be needed for numerical and categorical data. You will also explore ways of combining data sets through grouping and aggregation.

Following the data preparation, you will explore different methods of choosing the most relevant (informative) features to include in a model.

The lab for the workshop:

This lab is a sandbox allowing the learners to examine and run available Jupyter notebooks, and to create their own based on tasks given to them. The lab holds data sets (csv files), Jupyter notebook and instructions. It allows the user to download the work to their computer.

Learning objectives

Upon completion of this intermediate level lab, you will be able to:

  • Clean and Prepare Raw Datasets
  • Build datasets that improve model accuracy and business insights
  • Use pandas, NumPy, and scikit-learn (or equivalent in R)
  • Diagnose and Fix Data Quality Issues

Intended audience

  • Cloud Architects
  • Data Engineers
  • DevOps Engineers
  • Machine Learning Engineers
  • Software Engineers

Prerequisites

Familiarity with the following will be beneficial but is not required:

  • Basic programming skills with Python/R
  • Familiarity with Data tool and libraries
  • Data Structures
Hands-on Lab UUID

Lab steps

Logging In to the Amazon Web Services Console
Launching Jupyter Lab on SageMaker Notebook
Open Workshop1 Notebook