hands-on lab

Transcribing Audio Files with Google Cloud Speech-to-Text

Difficulty: Intermediate
Duration: Up to 1 hour
Students: 157
Rating: 5/5
Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.
Learn and validateUse validations to check your solutions every step of the way.
See resultsTrack your knowledge and monitor your progress.

Description

Google Cloud Speech-to-Text allows you to transcribe audio files to text using speech recognition. Audio can be transcribed within the Google Cloud Platform console, or by calling the Speech-to-Text API using Cloud Functions. Speech-to-Text can be used to enable voice control and searching, add subtitles to streaming content in real-time, and also improve the customer experience by adding interactive voice response (IVR) to your applications.

In this lab, you will work with Google Cloud Speech-to-Text in the GCP console, as well as through the Speech-to-Text API to transcribe an audio file stored in a Cloud Storage bucket.

Learning Objectives

Upon completion of this beginner level lab, you will be able to:

  • Transcribe an audio file into text using Google Cloud Speech-to-Text
  • Incorporate the Speech-to-Text API into a Google Cloud Function

Intended Audience

  • Cloud Architects
  • Software Engineers
  • Machine Learning Engineers

Prerequisites

Familiarity with the following will be beneficial but is not required:

  • Google Cloud Functions

The following content can be used to fulfill the prerequisite:

Updates

December 19th, 2024 - Updated screenshots and instructions to reflect the latest UI

February 21st, 2024 - Resolved deployment issue

Environment before

Environment after

Covered topics

Lab steps

Signing In to the Google Cloud Console
Transcribing an Audio File Using Google Cloud Speech-to-Text
Creating a Speech Recognition Cloud Function using the Google Cloud Speech-to-Text API