Azure AI Speech
Energize your apps with prebuilt, customizable, multilingual speech AI models.
OVERVIEW
Discover the latest Azure AI Speech capabilities
-
Build voice-enabled, multilingual generative AI apps with fast transcriptions and natural-sounding voices.
-
Enable AI agents with end-to-end speech, including customized transcription, voice, and avatars.
-
Enable real-time, multi-language speech-to-speech translation and speech-to-text transcription of audio streams.
-
Run AI models wherever your data resides. Deploy your apps in the cloud or at the edge with containers.
USE CASES
Develop generative AI apps with speech models
Build voice-enabled agents
Use foundation models along with customized audio-in and audio-out models to power agents with voice.
Transcribe speech to text
Transcribe call center or meeting conversations. Go global with audio captioning in more than 100 languages.
Convert text to speech
Build bots that speak naturally. Differentiate your brand with customized, realistic voices and speaking styles.
Use post-call analytics
Analyze audio or video call recordings to gain deep insights using foundation models in Azure AI Content Understanding.
Transcribe audio with OpenAI Whisper
Transform your call centers using the latest OpenAI Whisper model in Azure AI Speech or Azure OpenAI.
Build custom voices
Build natural-sounding voices with custom neural voice.
Build your avatars
Bring your brand to life using pre-built or custom avatars with natural-sounding voices.
Enable multilingual communication
Translate audio or video data from and into an ever-growing list of supported languages. Customize translations to your industry.
Embed speech
Use embedded speech to power on-device speech-to-text and text-to-speech scenarios where cloud connectivity is intermittent or unavailable.
Security
Embedded security and compliance
34,000
Full-time equivalent engineers dedicated to security initiatives at Microsoft.
15,000
Partners with specialized security expertise.
>100
Compliance certifications, including over 50 specific to global regions and countries.
PRICING
Flexible pricing to meet your needs
Pay for only what you use—no upfront costs. Azure AI Speech pay-as-you-go pricing is based on:
RELATED PRODUCTS
Azure products work better together
Build comprehensive solutions using Azure AI Speech and other Azure AI products.
CUSTOMER STORIES
See what customers are building with Azure AI Speech
FAQ
Frequently asked questions
-
Speech supports an ever-growing set of languages. For supported languages, please refer to the current list.
-
Customers are building interesting applications using Azure AI services. Get started with Azure AI Speech analytics in Azure AI Foundry for use cases including conversational AI, post-call analytics, and video summarization.
Next steps
Choose the Azure account that’s right for you
Pay as you go or try Azure free for up to 30 days.