hands-on lab

Troubleshooting Amazon Athena Queries

Difficulty: Intermediate
Duration: Up to 1 hour
Students: 340
Rating: 4/5
Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.
Learn and validateUse validations to check your solutions every step of the way.
See resultsTrack your knowledge and monitor your progress.

Description

Amazon Athena is a serverless analytics service that you can use to query your data at scale without the technical complexity of managing infrastructure. It is built on well-known and reliable open source frameworks such as Apache Hive, Trino, Presto, and Apache Spark.

Learning to troubleshoot queries using Amazon Athena will make you more effective at working with data analysis workloads in the public AWS cloud.

In this hands-on lab, you will use the Amazon Athena console to query data stored in Amazon S3 buckets.

Learning objectives

Upon completion of this intermediate-level lab, you will be able to:

  • Select an Amazon Athena workgroup
  • Query log data stored in plain text
  • Query log data stored in JSON
  • Use a Create Table as Select statement

Intended audience

  • Candidates for AWS Certified Data Engineer Associate certification
  • Data Engineers
  • DevOps Engineers
  • Machine Learning Engineers

Prerequisites

Familiarity with the following will be beneficial but is not required:

  • Amazon Athena
  • Amazon Simple Storage Service (S3)
  • Structured Query Language (SQL)

The following content can be used to fulfill the prerequisites:

Environment before

Environment after

Covered topics

Lab steps

Logging In to the Amazon Web Services Console
Configuring Amazon Athena
Troubleshooting Partial Results
Troubleshooting Query Performance