Skip to content

hands-on lab

Implementing a RAG Pipeline With AWS Step Functions

Difficulty: Beginner

Duration: Up to 1 hour and 30 minutes

Students: 78

Rating: 5/5

On average, students complete this lab in45m

Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.

Learn and validateUse validations to check your solutions every step of the way.

See resultsTrack your knowledge and monitor your progress.

Description

Retrieval Augmented Generation (RAG) based knowledge bases enable you to interrogate your documents and information using a Generative AI chat interface. To use them most effectively on an ongoing basis, you need to ingest and transform your data to make it available for use by a Generative AI model. AWS Step Functions can be used to build and run a pipeline capable of processing your data.

Learning how to implement a pipeline in AWS Step Functions will benefit those looking to use their own data with Generative AI models.

In this hands-on lab, you will implement a pipeline to ingest and process PDF documents, and observe the pipeline running.

Learning objectives

Upon completion of this beginner-level lab, you will be able to:

Implement ingestion, transformation AWS Lambda functions
Generate AI model embeddings of plain text data
Create a vector index in Amazon OpenSearch
Implement an AWS Step Functions state machine

Intended audience

Anyone who is interested in using Generative AI in the public AWS cloud
Cloud Architects
Data Engineers
DevOps Engineers
Machine Learning Engineers
Software Engineers

Prerequisites

Familiarity with the following will be beneficial but is not required:

AWS Lambda
Amazon OpenSearch Service
AWS Step Functions
The Python scripting language

The following content can be used to fulfill the prerequisites:

Updates

November 26th, 2025 - Updated instructions and screenshots to reflect the latest user interface

Environment before

Environment after

Covered topics

Amazon Web Services

Analytics for AWS

Development for AWS

Compute for AWS

AWS Step Functions

Amazon Elasticsearch Service

Serverless for AWS

Hands-on Lab UUID

Lab steps

0 of 5 steps completed.Use arrow keys to navigate between steps. Press Enter to go to a step if available.

Logging In to the Amazon Web Services Console
Implementing the Pipeline's AWS Lambda Functions
Creating a Vector Index in Amazon OpenSearch
Configuring an AWS Step Functions Workflow
Testing the RAG Pipeline

Lab Rules

Lab rules apply