hands-on lab

Implementing an ETL Pipeline with AWS SDK for Pandas

Difficulty: Beginner

Duration: Up to 1 hour

Students: 2

Start lab

Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.

Learn and validateUse validations to check your solutions every step of the way.

See resultsTrack your knowledge and monitor your progress.

Description

AWS SDK for Pandas is a Python library supplied by Amazon that simplifies data science tasks when using Python to analyze and manipulate data. Built upon the popular Pandas library, it is performant and designed to be used at scale.

Learning how to use AWS SDK for Pandas will benefit anyone who is looking to make use of data science in the public AWS cloud.

In this hands-on lab, you will explore accessing different data stores using the library, and you will implement a Lambda function that uses it to process transaction data in real-time.