hands-on lab

Getting Started with Amazon Redshift

Difficulty: Beginner
Duration: Up to 1 hour and 45 minutes
Students: 6,202
Rating: 4.5/5
Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.
Learn and validateUse validations to check your solutions every step of the way.
See resultsTrack your knowledge and monitor your progress.

Description

Amazon Redshift is a managed data warehouse that allows you to analyze all your data using standard SQL and your existing Business Intelligence (BI) tools. Redshift uses query optimization, columnar storage, parallel execution, and high-performance disks to query petabytes of data in seconds. Redshift is valuable for companies that use SQL or existing BI tools and want to analyze large amounts of data with their existing tools. In this lab, you will learn how to create, query, and resize a Redshift cluster.

Learning Objectives

Upon completion of this lab, you will be able to:

  • Log in to the AWS Management Console
  • Use EC2 Instance Connect to communicate with Redshift
  • Create and resize a Redshift cluster
  • Load data into Redshift
  • Query data with Redshift

Prerequisites

You should be familiar with:

  • Basic understanding of the Linux bash shell
  • Conceptual understanding of SQL and Redshift

Lab Environment

Before completing the lab instructions the environment will look as follows:

After completing the lab instructions the environment should look similar to:

Updates

May 18th, 2023 - Resolved an issue that caused the lab to fail to setup on rare occasions

May 17th, 2023 - Cluster creation is initiated once the lab is started to reduce wait times

March 20th, 2023 - Resolved an issue that caused the lab to fail to set up on rare occasions

November 18th, 2022 - Updated the instructions and screenshots to reflect the latest UI

January 13th, 2022 - Updated the instructions and screenshots to reflect the latest Redshift UI

September 10th, 2021 - Updated lab steps to reflect new Amazon Redshift console experience and removed ARN retrieval step

July 22nd, 2020 - Updated all instructions and screenshots

May 13th, 2020 - Migrated to Instance Connect

January 10th, 2019 - Added a validation Lab Step to check the work you perform in the Lab

November 21st, 2018 - Updated Lab to use the new default dc2.large instance type and modified the Lab permission policy to allow resizing the cluster to account for IAM changes in Redshift

Covered topics

Lab steps

Logging In to the Amazon Web Services Console
Creating the Redshift Cluster
Connecting to the Virtual Machine using EC2 Instance Connect
Loading data into Redshift
Running sample queries
Resizing the cluster
Cleaning up the environment