Share via


Note

Please see Azure Cognitive Services for Speech documentation for the latest supported speech solutions.

Benefits of Using the Microsoft Speech Platform SDK 11

Speech is an effective and natural way for people to interact with applications, complementing or even replacing the use of mice, keyboards, controllers, and gestures. A hands-free, yet accurate way to communicate with applications, speech lets people be productive and stay informed in a variety of situations where other interfaces will not.

Microsoft Speech Platform SDK 11

The Microsoft Speech Platform SDK 11 includes both managed-code and native-code application programming interfaces (APIs).

The Microsoft.Speech managed-code namespaces provide you with easy access to the advanced speech recognition and speech synthesis technologies supported by the Microsoft Speech Platform Runtime 11. The functionality that the managed-code APIs expose is a subset of the native-code APIs and will satisfy the majority of speech application scenarios.

The native-code APIs implement all the low-level details needed to control and manage the real-time operations of various speech engines.

You do not need to create your own engines for speech recognition and text-to-speech because you can download and install the speech Runtime and any of the 26 Runtime Languages for speech development. See What’s New in the Microsoft Speech Platform SDK 11.

Speech Recognition

Speech recognition allows users to interact with and control your applications by speaking. Using the APIs in the Speech Platform SDK 11, you can acquire and monitor speech input, create speech recognition grammars that produce both literal and semantic recognition results, capture information from events generated by the speech recognition, and configure and manage speech recognition engines. See Speech Recognition (Microsoft.Speech).

Speech Synthesis (text-to-speech or TTS)

The speech synthesis APIs in the Speech Platform SDK 11 will have your application talking with just a few lines of code. You get control over many aspects of voice expression, including volume, speaking rate, and pitch. See Speech Synthesis (Microsoft.Speech).

Standards Compatible

For both speech recognition and speech synthesis, you get your choice of authoring formats. You can author content programmatically, using the members of the Microsoft.Speech namespaces. Or you can create industry-standard, XML-format speech documents for consumption by the Speech Platform Runtime 11. You can even alternate authoring formats based on your preferences and combine programmatic content with XML-format content.

Grammar Development Tools

The Speech Platform SDK 11 provides tools that you can use to debug and tune grammars that you create for use with the Speech Platform Runtime 11. The functionality of the Microsoft Grammar Development Tools includes grammar validation, phrase generation, confusability detection, batch-mode simulated speech recognition, analysis of simulated recognition results including graphs, and grammar compiling. The Grammar Development Tools will help you to identify problem areas that are hard to detect through standard speech application testing, and to identify problems earlier in the application development life cycle. See Microsoft Grammar Development Tools.

Cost Efficient

The Speech Platform SDK 11 can significantly reduce the amount of work necessary to implement speech in applications for Windows Server or other platforms that can leverage Microsoft's redistributable speech engines. By providing access to the Speech Platform Runtime 11 and engines for speech recognition and speech synthesis, the Speech Platform SDK 11 gives you the tools to create sophisticated speech applications while shortening development cycles and reducing development costs.