Using AWS Data Wrangler for ETL Pipelines With AWS Data Services

Difficulty: Beginner

Duration: 3 minutes and 5 seconds

Students: 205

Rating: 5/5

Learning Objectives

This lesson is an introductory level AWS development lesson. You will learn about the AWS Data Wrangler library, what it does, and how to set it up to be able to use it.

Intended Audience

This lesson is intended for AWS Python developers familiar with the Pandas and PyArrow libraries who are building non-distributed pipelines using AWS services. The AWS Data Wrangler library provides an abstraction for connectivity, extract, and load operations on AWS services.

Prerequisites

To get the most out of this lesson, you must meet the AWS Developer Associate certification requirements or have equivalent experience.

This lesson expects that you are familiar with and have an existing Python development environment and have set up the AWS CLI or SDK with the required configuration and keys. Familiarity with Python syntax is also a requirement. We walk through the basic setup for some of these but do not provide detailed explanations of the process.

For fundamentals and additional details about these skills, you can refer to the following lessons here at Cloud Academy:

1) Python for Beginners

2) Data Wrangling With Pandas

3) Introduction to the AWS CLI

4) How to Use the AWS Command-Line Interface