AWS Glue is a service that data analytics professionals can use to catalog, transform, and integrate data from different sources. By consolidating integration capabilities into a single centralized service, AWS Glue gives you the ability to discover, cleanse, catalog, and transform data in a single place.
Learning how to use AWS Glue to work with data will help you become more effective at creating and using data lakes in the public AWS cloud.
In this lab, you will implement an AWS Lambda function that processes order data as it is uploaded to Amazon S3, and you will see how to configure AWS Glue to make searching the data more efficient.
Upon completion of this beginner-level lab, you will be able to:
Familiarity with the following will be beneficial but is not required:
The following content can be used to fulfill the prerequisites:
June 5th, 2024 - Updated the instructions and screenshots to reflect the latest UI
February 15th, 2023 - Updated the Lambda implementation step with a test event