Section Intro

Key Capabilities

OpenAI Whisper is a powerful Foundation Model optimized for high-precision audio transcription and real-time speech translation. Whether you want to convert spoken content into text or translate non-English audio into English, Whisper delivers with a straightforward API interface.

Key Capabilities

Capability	Description
Transcription	Convert spoken content from supported formats (e.g., MP3, WAV, FLAC) into written text.
Translation	Transcribe audio recorded in a foreign language and translate the resulting text into English.

Whisper supports multiple audio formats and language pairs out of the box. For optimal accuracy, use audio files encoded at 16 kHz.

In this lesson, we’ll walk through:

Uploading an audio file to the Whisper API
Invoking the transcription endpoint
Processing and analyzing the transcription result
Translating non-English audio into English

Let’s get started!

Watch Video

Overview of Whisper

Creating a Chatbot for a Restaurant

Introduction

Audio Transcription Translation

Building an Interactive Chatbot

Exploring Chat GPT

Fine tuning GPT 3 with a Custom Dataset

Generating Images

Getting Started with Open AI

Implementing Code Completion

Implementing Word Completion

Moderating Prompts with Moderating API

Performing Text Processing and Analysis

Summary and Next Steps

Understanding Prompt Engineering

Understanding Tokens and API Parameters

Using Word Embeddings For Dynamic Context

What is Generative AI

Key Capabilities

Watch Video

Introduction

Audio Transcription Translation

Building an Interactive Chatbot

Exploring Chat GPT

Fine tuning GPT 3 with a Custom Dataset

Generating Images

Getting Started with Open AI

Implementing Code Completion

Implementing Word Completion

Moderating Prompts with Moderating API

Performing Text Processing and Analysis

Summary and Next Steps

Understanding Prompt Engineering

Understanding Tokens and API Parameters

Using Word Embeddings For Dynamic Context

What is Generative AI

​Key Capabilities

Watch Video

Key Capabilities