Voozh

2,349 questions with Azure Speech in Foundry Tools tags

0 answers

Azure Batch Transcription: Consistent ~12h failure with "InvalidUri" for diarization-enabled jobs

Azure Batch Transcription: Consistent ~12h failure with "InvalidUri" for diarization-enabled jobs Problem I'm using the Azure Speech-to-Text Batch Transcription API with diarization enabled (maxSpeakers: 2, locale: ja-JP). A subset of jobs…

asked

👁 Image

Kajisha Hiroshi 0 Reputation points

answered

AI answer

2 answers

Unable to get Azure Speech Service authorization token. REST API always returns 401 for Japan West region.

I am trying to use Azure Speech Service Text-to-Speech through the REST API. My Speech resource was created in the Japan West region. I followed the official documentation and tried to get an authorization token with this endpoint: POST…

asked

👁 Image

Lancy Wu 0 Reputation points

answered

👁 Image

Karnam Venkata Rajeswari 3,830 Reputation points • Microsoft External Staff • Moderator

0 answers

Base Model 20250808 Internal Error when training with Structure Text file

When I train a custom model using a structure text file and output file using 20250808 build (english) I get an failure for the model and it says internal error. The structure text and output files are error free when I uploaded them. I was successfully…

asked

👁 Image

Kaiyip Ho 0 Reputation points

commented

👁 Image

Anshika Varshney 13,320 Reputation points • Microsoft External Staff • Moderator

2 answers

Azure AI speech doesnt load at all, ever

I tried hard reset and incognito, nothing works. I am using google chrome. I opened azure account today and struggling since the very begining. Tried to even run it in vs code or make my own small html program just not to have to deal with this thing. I…

asked

👁 Image

AJ 0 Reputation points

commented

👁 Image

Anshika Varshney 13,320 Reputation points • Microsoft External Staff • Moderator

1 answer

How do I download a Speech Studio Voice Gallery voice just for my TTS programs?

I use text to speech programs to help my ADHD brain read long documents, like textbooks from back in college or contracts now that I'm a "real job" adult. The default TTS voices (Mark, David, and Zira) sound robotic and grating, but the…

asked

👁 Image

Andrew Welker 0 Reputation points

answered

👁 Image

Irfan Usman 0 Reputation points

3 answers One of the answers was accepted by the question author.

In azure TTS speech playground, there is no way to generate speech

I'm in the speech playground text to speech area. It feels like there should be a box to enter text but there isn't. There is however, a "try it out" in each voice sample. However, entering text in that area and clicking play produces no sound.…

asked

👁 Image

Peter 20 Reputation points

answered

👁 Image

SRILAKSHMI C 19,110 Reputation points • Microsoft External Staff • Moderator

2 answers

Azure Speech Service: ConversationTranscriber via Private Endpoint returns 0 segments with 140s session_stopped delay - canadacentral

Service Azure Cognitive Services — Speech Service (azure-cognitiveservices-speech==1.46.0, Python), AKS canadacentral, Private Endpoint. Scenario Using ConversationTranscriber with the universal/v2 real-time endpoint accessed via a Cognitive Services…

asked

👁 Image

Amandeep Sadioura 0 Reputation points

answered

👁 Image

Harshitha Eligeti 10 Reputation points • Microsoft External Staff • Moderator

3 answers One of the answers was accepted by the question author.

How to implement AEC on iOS using 'MicrosoftCognitiveServicesSpeechEmbedded-iOS' (1.49.1)

What is the solution to implement AEC echo cancellation on iOS? The SDK used is 'MicrosoftCognitiveServicesSpeechEmbedded-iOS' (1.49.1) The specific requirement is to always turn on continuous STT, and at the same time, the speaker has TTS sound playing.…

asked

👁 Image

Yin Chenqiao 20 Reputation points

accepted

👁 Image

Yin Chenqiao 20 Reputation points

2 answers

Batch Transcription API – Excessive Processing Time (24+ hours for 2h 24m audio)

We are experiencing an unexpected and significant delay with the Azure Cognitive Services Batch Transcription API. Issue: An audio file of approximately 2 hours and 24 minutes in length has been submitted for batch transcription and remains in a…

asked

👁 Image

Yoichi NARUSE 0 Reputation points

answered

👁 Image

Manas Mohanty 17,185 Reputation points • Microsoft External Staff • Moderator

2 answers

AI Foundry Fine-Tuning: 403 Error When Uploading Output Format Training Data with BYOS Enabled

I'm using AI Foundry for fine tuning my custom speech model. I needed content logging and I want that data to be stored in my BYOS (Bring Your Own Storage) storage account, but once I've done that, I'm unable to upload training data of type Output…

asked

👁 Image

Hristijan Stefov 0 Reputation points

answered

👁 Image

Manas Mohanty 17,185 Reputation points • Microsoft External Staff • Moderator

2 answers

Improving the speed of speech recognition processing in Disconnected Azure Speech Containers

I am using a disconnected container for Azure Speech. Please let me know if there is a way to improve the response of the speech-to-text processing. The current system returns the final results 3 seconds after a 5-second speech segment when performing…

asked

👁 Image

tomoe 25 Reputation points

commented

👁 Image

tomoe 25 Reputation points

1 answer

Azure OpenAI Realtime client_secrets returns 500 when input_audio_transcription is included (Sweden Central)

We are seeing a consistent server-side failure in Azure OpenAI Realtime when requesting client secrets with input_audio_transcription enabled. Environment Region: Sweden Central Resource: LBBD-OpenAI-Sweden-Dev Subscription:…

asked

👁 Image

James Morgan 0 Reputation points

commented

👁 Image

James Morgan 0 Reputation points

2 answers

AudioEchoCancellation with PersonalVoice is not working on the Voice Live API

AudioEchoCancellation is working without AzureStandardVoice but not with PersonalVoice via the azure.ai.voicelive.models Voice Live python SDK

asked

👁 Image

Arne De Proft 0 Reputation points • Microsoft Employee

commented

👁 Image

Manas Mohanty 17,185 Reputation points • Microsoft External Staff • Moderator

4 answers

Crash when app enter background

#37 0 libc++abi.dylib __cxa_get_exception_ptr +88 1 libc++abi.dylib __cxa_throw +92 2 libc++.1.dylib std::__1::__throw_system_error[abi:ne190102] +92 3 libc++.1.dylib std::__1::__throw_system_error(int, char const*)…

asked

👁 Image

Li Wangbiao 0 Reputation points

commented

👁 Image

Thanmayi Godithi 10,570 Reputation points • Microsoft External Staff • Moderator

2 answers One of the answers was accepted by the question author.

zh-CN voices: mstts:express-as styles and paralinguistic tags produce identical output regardless of value

curl --location --request POST "https://${SPEECH_REGION}.tts.speech.microsoft.com/cognitiveservices/v1" \ --header "Ocp-Apim-Subscription-Key: ${SPEECH_KEY}" \ --header 'Content-Type: application/ssml+xml' \ --header…

asked

👁 Image

Ming-Li Lin 20 Reputation points

accepted

👁 Image

Ming-Li Lin 20 Reputation points

1 answer

Cognitive services STT batch transcription: incomplete/cut-off transcripts

Hi, The URLs/sources in this ticket have been replaced with placeholders, but are of course available for support upon request. We are using batch transcription through the endpoint:…

asked

👁 Image

STT 0 Reputation points

edited a comment

👁 Image

STT 0 Reputation points

1 answer One of the answers was accepted by the question author.

Japanese Voice-to-Text: Preventing Unwanted Kanji Transcription for Names

When using Azure Speech to Text for batch transcription of conversations in Japanese, there is an issue with person names being transcribed into incorrect Kanji characters. A custom speech model has been created to handle specific industry terms, but…

asked

👁 Image

Thierry Tropée 40 Reputation points

edited a comment

👁 Image

danniel lee 0 Reputation points

3 answers

Loading subscriptions

I have set up an account and I am trying to use the Azure speech studio. But when I go there, it just gets stuck on 'Loading subscriptions'. Is it to do with multiple account conflicts or something?

asked

👁 Image

Eddie Geadley 5 Reputation points

answered

👁 Image

K. Borowski 0 Reputation points

2 answers One of the answers was accepted by the question author.

Custom Avatar

I am unable to create custom avatar for my subscription. I had filled the form for the access approval and haven't received any response to it yet

asked

👁 Image

Amrutha M 20 Reputation points

accepted

👁 Image

Amrutha M 20 Reputation points

2 answers One of the answers was accepted by the question author.

Changing ProfanityFilter Settings in Disconnected Azure Speech Containers

I am using a disconnected container for Azure Speech. docker image ls REPOSITORY TAG IMAGE ID CREATED …

asked

👁 Image

tomoe 25 Reputation points

commented

👁 Image

Karnam Venkata Rajeswari 3,830 Reputation points • Microsoft External Staff • Moderator

URL: https://learn.microsoft.com/en-us/answers/tags/55/azure-speech

⇱ Azure Speech in Foundry Tools - Microsoft Q&A

Azure Speech in Foundry Tools

Content

2,349 questions with Azure Speech in Foundry Tools tags

Azure Batch Transcription: Consistent ~12h failure with "InvalidUri" for diarization-enabled jobs

Unable to get Azure Speech Service authorization token. REST API always returns 401 for Japan West region.

Base Model 20250808 Internal Error when training with Structure Text file

Azure AI speech doesnt load at all, ever

How do I download a Speech Studio Voice Gallery voice just for my TTS programs?

In azure TTS speech playground, there is no way to generate speech

Azure Speech Service: ConversationTranscriber via Private Endpoint returns 0 segments with 140s session_stopped delay - canadacentral

How to implement AEC on iOS using 'MicrosoftCognitiveServicesSpeechEmbedded-iOS' (1.49.1)

Batch Transcription API – Excessive Processing Time (24+ hours for 2h 24m audio)

AI Foundry Fine-Tuning: 403 Error When Uploading Output Format Training Data with BYOS Enabled

Improving the speed of speech recognition processing in Disconnected Azure Speech Containers

Azure OpenAI Realtime client_secrets returns 500 when input_audio_transcription is included (Sweden Central)

AudioEchoCancellation with PersonalVoice is not working on the Voice Live API

Crash when app enter background

zh-CN voices: mstts:express-as styles and paralinguistic tags produce identical output regardless of value

Cognitive services STT batch transcription: incomplete/cut-off transcripts

Japanese Voice-to-Text: Preventing Unwanted Kanji Transcription for Names

Loading subscriptions

Custom Avatar

Changing ProfanityFilter Settings in Disconnected Azure Speech Containers