Transcribing Audio Files with Google Cloud Speech-to-Text
Description
Google Cloud Speech-to-Text allows you to transcribe audio files to text using speech recognition. Audio can be transcribed within the Google Cloud Platform console, or by calling the Speech-to-Text API using Cloud Functions. Speech-to-Text can be used to enable voice control and searching, add subtitles to streaming content in real-time, and also improve the customer experience by adding interactive voice response (IVR) to your applications.
In this lab, you will work with Google Cloud Speech-to-Text in the GCP console, as well as through the Speech-to-Text API to transcribe an audio file stored in a Cloud Storage bucket.
Learning Objectives
Upon completion of this beginner level lab, you will be able to:
- Transcribe an audio file into text using Google Cloud Speech-to-Text
- Incorporate the Speech-to-Text API into a Google Cloud Function
Intended Audience
- Cloud Architects
- Software Engineers
- Machine Learning Engineers
Prerequisites
Familiarity with the following will be beneficial but is not required:
- Google Cloud Functions
The following content can be used to fulfill the prerequisite:
Updates
December 19th, 2024 - Updated screenshots and instructions to reflect the latest UI
February 21st, 2024 - Resolved deployment issue