Perform intent recognition with the Azure Speech SDK - 代码天地

Perform intent recognition with the Azure Speech SDK

其他 2021-11-18 16:20:09 阅读次数: 0

To use a Language Understanding model from the Speech SDK, your code should follow this pattern:

A SpeechConfig and AudioConfig are used by an IntentRecognizer object, which consumes a Language Understanding model

Use a SpeechConfig object to encapsulate the information required to connect to your Language Understanding prediction resource (not a Speech resource). Specifically, the SpeechConfig must be configured with the location and key of the Language Understanding prediction resource.
Optionally, use an AudioConfig to define the input source for the speech to be analyzed. By default, this is the default system microphone, but you can also specify an audio file.
Use the SpeechConfig and AudioConfig to create an IntentRecognizer object, and add the model and the intents you want to recognize to its configuration.
Use the methods of the IntentRecognizer object to submit utterances to the Language understanding prediction endpoint. For example, the RecognizeOnceAsync() method submits a single spoken utterance.
Process the response. In the case of the RecognizeOnceAsync() method, the result is an IntentRecognitionResult object that includes the following properties:
- Duration
- IntendId
- OffsetInTicks
- Properties
- Reason
- ResultId
- Text

If the operation was successful, the Reason property has the enumerated value RecognizedIntent, and the IntentId property contains the top intent name. Full details of the Language Understanding prediction can be found in the Properties property, which includes the full JSON prediction.

Other possible values for Result include RecognizedSpeech, which indicates that the speech was successfully transcribed (the transcription is in the Text property), but no matching intent was identified. If the result is NoMatch, the audio was successfully parsed but no speech was recognized, and if the result is Canceled, an error occurred (in which case, you can check the Properties collection for the CancellationReason property to determine what went wrong.)

Use the Speech and Language Understanding Services

https://docs.microsoft.com/en-us/learn/modules/use-language-understanding-speech/4-exercise-use-speech-services

猜你喜欢

转载自blog.csdn.net/figosoar/article/details/119754694

Perform intent recognition with the Azure Speech SDK

[speech recognition]Speech Recognition Technology

语音识别（Speech Recognition）

iOS - Speech Recognition

Food Log with Speech Recognition and NLP

speech_recognition实现录音ffmpeg实现音频文件转换，并用百度语音的sdk实现语音识别

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

(未测试)Speech recognition script for Asterisk

Git Github and python speech-recognition learning

Improving speech recognition by revising gated recurrent units

EE 519: Speech Recognition and Processing for Multimedia

Building an Automatic Speech Recognition System with De

Microsoft Speech SDK 编程入门

Azure Cognitive Services- Speech To Text

Use Azure Speech and Language Understanding Services

Azure sdk for python

Azure Kinect SDK使用

Deep Learning for Environmentally Robust Speech Recognition-An Overview of Recent Developments

文章：Emotion Recognition From Speech With Recurrent Neural Networks

State of the art speech recognition with sequence-to-sequence models

[翻译]Review——How to do Speech Recognition with Deep Learning

Robust CNN-based Speech Recognition With Gabor Filter Kernels

C#的语音识别 using System.Speech.Recognition;

（ICASSP 19）Streaming End-to-end Speech Recognition for Mobile Devices

DEEP-FSMN FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION翻译

李宏毅DLHLP.02.Speech Recognition

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition

Effectiveness of self-supervised pre-training for speech recognition

DFSMN-SAN WITH PERSISTENT MEMORY MODEL FOR AUTOMATIC SPEECH RECOGNITION翻译

Deep Graph random Process for Relational-Thinking-based Speech Recognition

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

BPM为企业带来的实际利益

好程序员web前端分享css常用属性缩写

Java文件下载（excel）

css样式的动态添加及显示和隐藏等零碎用法

axios全局配置以及拦截器

使用Logstash来实时同步MySQL和log日志数据到ES

C++获取当前时间（年月日、时分秒、毫秒）

Odoo产品分析 (四) -- 工具板块(11) -- 网站即时聊天(1)

Java环境配置正确，但是java、javac、java -version均返回“不是内部或外部命令，也不是可运行的程序或批处理文件”？

01 官网下载各种CentOS教程（超详细版）

每日归档

更多

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)