Big Data Training Library
Learn to architect for scale, get hands-on with the leading big data tools, and reveal meaningful insights from data using services on Amazon Web Services, Microsoft Azure, and Google Cloud Platform. Content added and updated weekly.
Explore all library
- HANDS-ON LABAndrew BurchillConfiguring Distribution Styles and Table Access in Amazon RedshiftBeginnerDuration: Up to 1 hourAuthor: Andrew Burchill; Difficulty: Beginner; Description: Learn how to create tables, set distribution styles, and configure fine-grained access on an Amazon Redshift cluster in this hands-on lab.; Duration: Up to 1 hour; Content Topics: Amazon Web Services; This hands-on lab has: 5 Lab steps
- LEARNING PATHStefano CascavillaCoding with the Functional Programming Paradigm Using ScalaIntermediateDuration: Up to 12 hours and 30 minutesAuthor: Stefano Cascavilla; Difficulty: Intermediate; Description: Get a practical understanding of the Scala programming language, its features, and how to use it with this labs-based course!; Duration: Up to 12 hours and 30 minutes; Content Topics: Development; This learning path has: 2 Lab challenges, 9 Hands-on labs
- LEARNING PATHStefano CascavillaNeo4j Graph Databases: A Firsthand ApproachIntermediateDuration: Up to 6 hours and 58 minutesAuthor: Stefano Cascavilla; Difficulty: Intermediate; Description: In this hands-on course, you will learn how to create and manage your own graph databases using Neo4j.; Duration: Up to 6 hours and 58 minutes; Content Topics: Graph Databases; This learning path has: 1 Course, 1 Lab challenge, 4 Hands-on labs
- HANDS-ON LABStefano CascavillaDefining and Working With dbt TestsAdvancedDuration: Up to 1 hour and 15 minutesAuthor: Stefano Cascavilla; Difficulty: Advanced; Description: In this lab, you will understand what are dbt tests. You will then create a custom test, and test your sources and a model both with native dbt tests and with the custom dbt test.; Duration: Up to 1 hour and 15 minutes; Content Topics: Data build tool (dbt); This hands-on lab has: 7 Lab steps
- HANDS-ON LABStefano CascavillaWorking With Incremental dbt ModelsBeginnerDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will understand what are incremental dbt models, and you will create two models by using the incremental materialization type.; Duration: Up to 1 hour; Content Topics: Data build tool (dbt); This hands-on lab has: 5 Lab steps
- HANDS-ON LABStefano CascavillaCreate and Execute Your First dbt ModelsBeginnerDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will create your first dbt models. You will create a model starting from the dbt sources, and you will create another one starting from the existing model. You will then execute and materialize them in the PostgreSQL database.; Duration: Up to 1 hour; Content Topics: Data build tool (dbt); This hands-on lab has: 5 Lab steps
- HANDS-ON LABStefano CascavillaConfigure a dbt Profile and Define SourcesBeginnerDuration: Up to 40 minutesAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will configure a dbt profile to connect to a PostgreSQL database, and you will define sources to be used in a dbt project.; Duration: Up to 40 minutes; Content Topics: Data build tool (dbt); This hands-on lab has: 4 Lab steps
- HANDS-ON LABStefano CascavillaUnderstand and Use dbt Jinja MacrosIntermediateDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Intermediate; Description: In this lab, you will understand what Jinja macros are and why they are helpful. You will then leverage the most important native macros source and ref, and you will create a custom macro.; Duration: Up to 1 hour; This hands-on lab has: 6 Lab steps
- HANDS-ON LABStefano CascavillaCreate Your First dbt (Data Build Tool) ProjectBeginnerDuration: Up to 40 minutesAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will learn what is dbt, why it is super helpful for data transformations, and you will install it and create your first dbt project.; Duration: Up to 40 minutes; Content Topics: Data build tool (dbt); This hands-on lab has: 4 Lab steps
- HANDS-ON LABStefano CascavillaWorking With Full-Refresh dbt ModelsBeginnerDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will understand what are full-refresh dbt models, and you will create two models by using the table materialization type.; Duration: Up to 1 hour; Content Topics: Data build tool (dbt); This hands-on lab has: 5 Lab steps
- HANDS-ON LABAndrew BurchillCombining and Enriching Data with Amazon Managed Workflows for Apache AirflowIntermediateDuration: Up to 2 hoursAuthor: Andrew Burchill; Difficulty: Intermediate; Description: Learn about Amazon Managed Workflows for Apache Airflow in this hands-on lab as you create a Directed Acyclic Graph in Apache Airflow.; Duration: Up to 2 hours; Content Topics: Amazon Web Services; This hands-on lab has: 5 Lab steps
- LEARNING PATHDaniel MeaseGoogle Professional Cloud Database Engineer Exam PreparationIntermediateDuration: Up to 21 hours and 19 minutesAuthor: Daniel Mease; Difficulty: Intermediate; Description: This course is designed to help you prepare for the Google Cloud Certified - Professional Cloud Database Engineer Exam.; Duration: Up to 21 hours and 19 minutes; Content Topics: Google Cloud Platform; This learning path has: 1 Resource, 14 Courses, 2 Lab challenges, 1 Exam, 11 Hands-on labs
- HANDS-ON LABStefano CascavillaWorking With Ephemeral dbt ModelsIntermediateDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Intermediate; Description: In this lab, you will understand what are ephemeral dbt models, and you will create an ephemeral dbt model that will be then leveraged by a full-refresh model. You will then review what dbt has materialized and what has not.; Duration: Up to 1 hour; Content Topics: Data build tool (dbt); This hands-on lab has: 5 Lab steps
- HANDS-ON LABAndrea GiussaniIntroduction to Financial Data Manipulation with PythonBeginnerDuration: Up to 1 hourAuthor: Andrea Giussani; Difficulty: Beginner; Description: The goal of this lab is to consolidate your data management and manipulation skills using Python.; Duration: Up to 1 hour; Content Topics: Development, Analytics; This hands-on lab has: 2 Lab steps
- LEARNING PATHOracle UniversityOracle APEX: FoundationsBeginnerDuration: Up to 3 hours and 48 minutesAuthor: Oracle University; Difficulty: Beginner; Description: Master the fundamentals of application development using Oracle APEX with this comprehensive course provided in partnership with Oracle University.; Duration: Up to 3 hours and 48 minutes; Content Topics: SQL; This learning path has: 9 Courses
- LEARNING PATHOracle UniversityOracle Autonomous Database Administration WorkshopIntermediateDuration: Up to 7 hours and 54 minutesAuthor: Oracle University; Difficulty: Intermediate; Description: Learn the fundamentals of Autonomous databases and take your DBA skills knowledge to the next level and learn to deploy and administer Autonomous databases.; Duration: Up to 7 hours and 54 minutes; This learning path has: 9 Courses
- HANDS-ON LABStefano CascavillaBest Practices for Organizing dbt ModelsAdvancedDuration: Up to 1 hour and 15 minutesAuthor: Stefano Cascavilla; Difficulty: Advanced; Description: In this lab, you will define some dbt models and you will organize them following best practices in order to keep and maintain a clear dbt project.; Duration: Up to 1 hour and 15 minutes; Content Topics: Data build tool (dbt); This hands-on lab has: 7 Lab steps
- HANDS-ON LABStefano CascavillaUnderstand and Work With dbt SeedsIntermediateDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Intermediate; Description: In this lab, you will understand what dbt seeds are, you will create and use them, and will build a model based on a seed.; Duration: Up to 1 hour; Content Topics: Data build tool (dbt); This hands-on lab has: 5 Lab steps
- HANDS-ON LABStefano CascavillaLeveraging the Yahoo Finance API to Retrieve and Visualize Financial DataBeginnerDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will leverage the Yahoo Finance API using the yfinance Python library, and you will visualize the market data using the plotly Python library.; Duration: Up to 1 hour; Content Topics: Development; This hands-on lab has: 2 Lab steps
- HANDS-ON LABAndrew BurchillPlanning EC2 Capacity With Amazon ForecastBeginnerDuration: Up to 1 hourAuthor: Andrew Burchill; Difficulty: Beginner; Description: In this hands-on lab, you will see how to prepare web log data for use with Amazon Forecast's EC2 planning capability.; Duration: Up to 1 hour; Content Topics: Amazon Web Services; This hands-on lab has: 4 Lab steps
- HANDS-ON LABStefano CascavillaWorking and Visualizing Cryptos Data with CoinGecko and PythonBeginnerDuration: Up to 1 hourAuthor: Stefano Cascavilla; Difficulty: Beginner; Description: In this lab, you will use the CoinGecko Python API to retrieve and visualize cryptocurrencies data.; Duration: Up to 1 hour; Content Topics: Development; This hands-on lab has: 2 Lab steps
- LEARNING PATHDaniel MeaseGoogle Associate Data Practitioner Exam PreparationIntermediateDuration: Up to 31 hours and 12 minutesAuthor: Daniel Mease; Difficulty: Intermediate; Description: This course is designed to help you prepare for the Google Cloud Certified Associate Data Practitioner Exam.; Duration: Up to 31 hours and 12 minutes; Content Topics: NoSQL, Encryption, Serverless, Object Storage, Containers, Machine Learning, Identity and Access Management, Messaging, Big Data, SQL; This learning path has: 27 Courses, 2 Resources, 5 Exams, 11 Hands-on labs