laiyangdeli

浏览: 1515759 次
性别:
来自: 南京

最近访客更多访客>>

bitzgx

u012363178

二冲2010

u012361334

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

Nuance - Dragon Mobile SDK - Speech Kit

博客分类：

TTS&ASR
iOS

Speech Kit Basics

The Speech Kit framework allows you to add voice recognition and text-to-speech services to your applications easily and quickly. This framework provides access to speech processing components hosted on a server through a clean asynchronous network service API, minimizing overhead and resource consumption. The Speech Kit framework lets you provide fast voice search, dictation, and high-quality, multilingual text-to-speech functionality in your application.

Speech Kit Architecture

The Speech Kit framework is a full-featured, high-level framework that automatically manages all the required low-level services.

Speech Kit Architecture

At the application level, there are two main components available to the developer: the recognizer and the text-to-speech synthesizer.

In the framework there are several coordinated processes:

The framework fully manages the audio system for recording and playback.

The networking component manages the connection to the server and, at the start of a new request, automatically re-establishes connections that have timed-out.

The end-of-speech detector determines when the user has stopped speaking and automatically stops recording.

The encoding component compresses and decompresses the streaming audio to reduce bandwidth requirements and decrease latency.

The server system is responsible for the majority of the work in the speech processing cycle. The complete recognition or synthesis procedure is performed on the server, consuming or producing the streaming audio. In addition, the server manages authentication as configured through the developer portal.

Using Speech Kit

You can use the Speech Kit framework in the same way that you use any of the standard iPhone frameworks such as Foundation or UIKit. The only difference is that the Speech Kit framework is a static framework and is entirely contained in your compiled application. This does not affect you as a developer except that you must be certain that you and any other developers working on your application all use the same release of Speech Kit. You can easily ensure this by including the entire framework in your application and your source control.

The Speech Kit framework depends on some core iPhone OS frameworks that you must include as dependencies in your application so that they are available at run time. In addition to Foundation, you must add the System Configuration and Audio Toolbox frameworks to your Xcode project, as follows:

Start by selecting the Frameworks group within your project.

Then right-click or command-click Frameworks and, from the menu, select Add ‣ Existing frameworks... .

Finally, select the required frameworks and click Add . The frameworks appear in the Frameworks folder (see figure below).

To start using the SpeechKit framework, add it to your new or existing project, as follows:

Open your project and select the group where you want the Speech Kit framework to be stored, for example:file:Frameworks .

From the menu select Project ‣ Add to Project....

Then find the framework SpeechKit.framework where you extracted the Dragon Mobile SDK and select Add .

To ensure that the Speech Kit framework is stored in your project and is not referencing the location where you found it, select Copy items... and then select Add .

You should now see the Speech Kit framework in your project, which you can expand to view the public headers.

Frameworks Required for Speech Kit

The Speech Kit framework provides one top-level header, which provides access to the complete API including classes and constants. You should import the Speech Kit header in all source files where you intend to use Speech Kit services:

#import <SpeechKit/SpeechKit.h>

You are now ready to start using recognition and text-to-speech services.

Speech Kit Errors

While using the Speech Kit framework, you will occasionally encounter errors. In this framework there is a custom NSError domain, SKSpeechErrorDomain , which includes special error codes and messages to support your development and use. In all cases, errors have a valid localized description set, which may prove useful in development and, in some cases, may be presented to the user.

There are effectively two types of errors that can be expected in this framework.

The first type are service connection errors and include the SKServerConnectionError and SKServerRetryError codes. These errors indicate that there is some kind of failure in the connection with the speech server. The failure may be temporary, and it can be solved by retrying the query. The error may be the result of an authorization failure or some other network problem.

The second type are speech processing errors and include the SKRecognizerError and SKVocalizerError codes. These errors indicate a problem with the speech request, ranging from a text format issue to an audio detection failure.

It is essential to always monitor for errors, as signal conditions may generate errors even in a correctly implemented application. The application’s user interface needs to respond appropriately and elegantly to ensure a robust user experience.

分享到：

Nuance - Dragon Mobile SDK - Speech Kit ... | Nuance’s Dragon ID Lets You Unlock Your ...

2012-07-02 15:57
浏览 1441
评论(0)
分类:移动开发
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Nuance - Dragon Mobile SDK - Speech Kit

Speech Kit Basics

Speech Kit Architecture

Using Speech Kit

Speech Kit Errors

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Nuance - Dragon Mobile SDK - Speech Kit

Speech Kit Basics

Speech Kit Architecture

Using Speech Kit

Speech Kit Errors

评论

发表评论

相关推荐

Voice detection for Android

Google hired one of Nuance soft engineers to help work around all Nuance patents

The Voice Browser Working Group

Nuance网站

Nuance HTTP Services

Nuance - Dragon Mobile SDK - Speech Kit Library Guide (for Android)

Nuance’s Dragon ID Lets You Unlock Your Smartphone Or Tablet By Talking To It

Android 4.1 Jelly Bean adds Offline Voice Typing

iOS的开源库和开源项目

[iOS开发教程-5]Create Indexed UITableView

[iOS开发教程-4]Create a UITabBarController from scratch

[iOS开发教程-3]Create a Detail View Page using UIImageView, UITextView and UILabel

[iOS开发教程-2]Customize that UIViewCell – Part 1: Using Interface Builder

[iOS开发教程-1]Hello UITableView!

Window 7主机与VMware中Mac虚拟机共享文件夹

VMware 8.02虚拟机安装MAC lion 10.7.3教程 附送原版提取镜像InstallESD.iso！

[iOS]深入浅出 iOS 之多线程 NSThread

Object-C中的Selector概念

CodeBlocks集成Objective-C

用ultraEdit打造自己的Objective-C IDE for Windows

最近访客更多访客>>

VMware 8.02虚拟机安装MAC lion 10.7.3教程附送原版提取镜像InstallESD.iso！