1,737 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Confused on custom neural voice docs

Hi all, I'd like to implement custom text to speech voices and I was pleased to see that azure offers different solutions, but i'm confused on the types of services. In the docs page I saw 3 types of custom speech methods: pro, lite and personal but in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-14T16:18:01.1266667+00:00
Talkkit 0 Reputation points
edited a comment 2024-10-14T19:06:45.3633333+00:00
Talkkit 0 Reputation points
1 answer

When is the GA release for customization of Avatars in Microsoft TTS Avatar service and non-photorealistic options for custom avatars

Two part question on the TTS Avatars When is the TTS Avatar - customization of Avatars going to be GA ? How can we do non-photorealistic Avatars (i.e, Animated/Cartoon characters like the Microsoft Mesh)?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-02T16:52:53.35+00:00
Asgar Ali 0 Reputation points
commented 2024-10-14T15:50:53.53+00:00
AshokPeddakotla-MSFT 34,016 Reputation points
0 answers

How to use private endpoint for Azure speech resource in realtime

We have used below documenation which provide details of realtime transcription using speech SDK. https://zcusa.951200.xyz/en-us/azure/ai-services/speech-service/get-started-stt-diarization?tabs=linux&pivots=programming-language-python We…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-09T09:20:51.8533333+00:00
Ulhas Hulyal, Nilesh 0 Reputation points
commented 2024-10-14T13:22:29.3733333+00:00
santoshkc 8,945 Reputation points Microsoft Vendor
0 answers

Delay in Transcription (Multi-Device Conversation) Cognitive Speech

As the documentation says that multi device conversation is real time but running the sample code there is delay of about 12 - 15 second in transcribing. How can i make it real time??

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-14T09:27:08.78+00:00
Basil Ali Khan 0 Reputation points
commented 2024-10-14T13:17:14.1533333+00:00
santoshkc 8,945 Reputation points Microsoft Vendor
1 answer

Azure AI Speech Studio TextToSpeech with voice "AlloyTurboMultilingual" shows "Error 400 Synthesis failed. StatusCode: NotFound"

Inside of Azure AI Speech Studio when trying to generate speech with the voice "AlloyTurboMultilingual" or "NovaTurboMultilingual" the following error occurs: Response status code does not indicate success: 400 (Synthesis failed.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-10T11:29:28.04+00:00
NSM 0 Reputation points
edited an answer 2024-10-14T11:36:21.2+00:00
santoshkc 8,945 Reputation points Microsoft Vendor
0 answers

Is there GRPC support for Speech to Text in Azure Speech SDK in java?

Hi, Is there GRPC support for Azure speech SDK? We are looking for this support for the Realtime Speech to Text feature. Is that support available in Java? If there is no GRPC support, what is the underlying architecture, and how is the voice streamed to…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,857 questions
asked 2024-10-10T12:56:31.7733333+00:00
Sai Vishnu Soudri 0 Reputation points
commented 2024-10-14T09:22:36.5533333+00:00
kothapally Snigdha (Quadrant Resource LLC) 10 Reputation points Microsoft Vendor
0 answers

Issue with Continuous Language Identification in Azure Speech SDK for Angular Application

We are currently using the "microsoft-cognitiveservices-speech-sdk" in our Angular application (version 14) for speech transcription and translation. The transcription and translation functionality is working as expected. However, we are…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-14T04:41:10.32+00:00
sanjay.bisht 0 Reputation points
commented 2024-10-14T07:33:53.7833333+00:00
romungi-MSFT 46,141 Reputation points Microsoft Employee
0 answers

Adjusting Audio Speed in Azure AI Speech

Is it possible to adjust the speed of audio generated using an OpenAI voice? I know that OpenAI's REST API supports a parameter for speed, but I couldn't find anything similar in the Azure AI Speech documentation. Thanks in advance!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-09T09:09:12.2466667+00:00
RN Dev 0 Reputation points
commented 2024-10-14T06:49:34.7066667+00:00
Avinash Devarakonda 0 Reputation points Microsoft Vendor
1 answer

Can I set maximum number of participants to real-time diarization?

Hi, I follow the document below and success to distinguish the speaker with audio streaming by ConversationTranscriber Class. (I don't use voice signature so it shows Guest-1,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-10T08:27:10.33+00:00
RES 0 Reputation points
commented 2024-10-14T06:45:11.5933333+00:00
santoshkc 8,945 Reputation points Microsoft Vendor
0 answers

speech api fails where Speech Studio succeeds?

I am using the Standard Tier, with a couple paragraphs of text and only a few ssml tags. The ssml pasted into Speech Studio renders correctly, even exports to an audio file correctly. The same ssml rendered through the python API causes the error below.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-10T18:09:20.84+00:00
Jory 0 Reputation points
commented 2024-10-11T13:12:03.3933333+00:00
Jory 0 Reputation points
1 answer

I can't create a simple support ticket to increase my text to speech concurrency request limit

I want to increase my text to speech concurrency request and TPS limit but Microsoft wouldn't allow me to create a support ticket but instead wants me to buy a plan to contact support. I am already on Standard S0 Tier.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-10T06:55:21.78+00:00
Abas Oladosu 0 Reputation points
commented 2024-10-11T05:11:19.88+00:00
AshokPeddakotla-MSFT 34,016 Reputation points
0 answers

Why is speakTextAsync() occasionally silent?

I am using the following Javascript code to test Azure TTS. It works well, but about 1 in 10 times, when called, it produces no sound despite logging "Synthesis finished" as expected. The next time it is called, the sentence "Good morning,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-08T23:31:19.63+00:00
Praxis Labs 20 Reputation points
commented 2024-10-11T04:42:37.2466667+00:00
navba-MSFT 24,465 Reputation points Microsoft Employee
1 answer

I am locked out of my account. I don't know what to do.

I just tried to log into my account and it says i am locked out and I should contact my support person. First, i do not know why i am being locked out. Second, i do not have a support person or who to contact about this. Any help please? See attached…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-10T22:43:58.57+00:00
Abas Oladosu 0 Reputation points
edited an answer 2024-10-11T03:04:23.6133333+00:00
navba-MSFT 24,465 Reputation points Microsoft Employee
0 answers

Transcription Denormalization.

Is there a way to "denormalize" Azure speech transcription, so it provides verbatim transcription (as close as possible, with word fillers, hesitations, repeats, etc)? I will also need word level timestamping and diarization. I am hoping there…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,857 questions
asked 2024-07-19T15:31:59.8+00:00
Alex Cohen 10 Reputation points
edited a comment 2024-10-10T22:21:48.79+00:00
hotrod 1 Reputation point
0 answers

I can't create a simple support ticket to increase my text to speech concurrency request limit

I want to increase my text to speech concurrency request and TPS limit but Microsoft wouldn't allow me to create a support ticket but instead wants me to buy a plan to contact support. I just need to increase my limit. Can someone please here from here?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-07T11:38:24.1+00:00
Abas Oladosu 0 Reputation points
edited a comment 2024-10-10T19:49:06.49+00:00
VasaviLankipalle-MSFT 17,281 Reputation points
0 answers

Do we have any API Support to get the cost estimation of Audio Translation?

Do we have any API Support to get the cost estimation of audio translation from language to another language by taking input as audio duration, source language, target language, etc., required input. If any API support in java, please help me Thanks…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-09T04:28:42.14+00:00
Ganesh P 0 Reputation points
commented 2024-10-10T14:16:10.7133333+00:00
Ganesh P 40 Reputation points
1 answer

looking for python code examples for fast transcription api

can you please share some python code examples for fast transcription api ? I can't find much...

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-09-24T20:17:31.5333333+00:00
WolfK001-2904 0 Reputation points
commented 2024-10-10T12:41:45.52+00:00
Pavankumar Purilla 105 Reputation points Microsoft Vendor
1 answer

azure speech to text only transcribe about 50% of the audio

I am using the speech to text but only able to transcribe about 50% of the audio. (ms word is able to transcribe the entire audio file) audio file is wav, only 2 min and 15 sec, size: 2112 kb this is my first time using this. Is there any setting that…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-03T23:54:13.0833333+00:00
Dan Nguyen 0 Reputation points
commented 2024-10-10T12:11:14.5633333+00:00
romungi-MSFT 46,141 Reputation points Microsoft Employee
0 answers

How to Stream Chat API Response Directly to Azure TTS Avatar for Continuous Speech?

I'm currently working on integrating Azure Text-to-Speech (TTS) with an avatar system. Right now, the avatar only starts speaking after receiving the entire chat response. Here is the relevant part of the code I'm using: const handleSpeak = () => { …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-10-08T07:25:20.47+00:00
Anandu K B 0 Reputation points
commented 2024-10-10T11:47:46.39+00:00
romungi-MSFT 46,141 Reputation points Microsoft Employee
0 answers

[Multi Device Conversation] - [Multi-Device Conversation][DotNet] Cannot set display name/Nickname when join conversation.

Hi, I tried to implement the code according to the example at the link: https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/quickstart/csharp/dotnet/multi-device-conversation/helloworld/Program.cs. But when I set display name…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,737 questions
asked 2024-09-26T03:49:53.6466667+00:00
Văn Chương Mai 0 Reputation points
commented 2024-10-10T11:42:08.4266667+00:00
romungi-MSFT 46,141 Reputation points Microsoft Employee