- 浏览: 1508244 次
- 性别:
- 来自: 南京
-
文章分类
- 全部博客 (419)
- XMPP (19)
- Android (180)
- Java (59)
- Network (4)
- HTML5 (13)
- Eclipse (9)
- SCM (23)
- C/C++ (4)
- UML (4)
- Libjingle (15)
- Tools&Softwares (29)
- Linphone (5)
- Linux&UNIX (6)
- Windows (18)
- Google (10)
- MISC (3)
- SIP (6)
- SQLite (5)
- Security (4)
- Opensource (29)
- Online (2)
- 文章 (3)
- MemoryLeak (10)
- Decompile (5)
- Ruby (1)
- Image (1)
- Bat (4)
- TTS&ASR (28)
- Multimedia (1)
- iOS (20)
- Asciiflow - ASCII Flow Diagram Tool.htm (1)
- Networking (1)
- DLNA&UPnP (2)
- Chrome (2)
- CI (1)
- SmartHome (0)
- CloudComputing (1)
- NodeJS (3)
- MachineLearning (2)
最新评论
-
bzhao:
点赞123!
Windows的adb shell中使用vi不乱码方法及AdbPutty -
wahahachuang8:
我觉得这种东西自己开发太麻烦了,就别自己捣鼓了,找个第三方,方 ...
HTML5 WebSocket 技术介绍 -
obehavior:
view.setOnTouchListenerview是什么
[转]android 一直在最前面的浮动窗口效果 -
wutenghua:
[转]android 一直在最前面的浮动窗口效果 -
zee3.lin:
Sorry~~
When I build "call ...
Step by Step about How to Build libjingle 0.4
http://dmfs.org/handsfree/?engines
http://dmfs.org/handsfree/?assistants
Many (if not most or even all) recent Android phones feature (at least) two speech recognition approaches. The first one is the Nuance SREC package which is included in the Android sources and hopefully in all Android devices. The second one is Google Voice Search. If you didn't install any voice recognition software chances are good your device will use the Google Voice Search by default, but some users may have to install it themselves (look for Google Voice Search in the Market).
Note: Handsfree-Lite does not feature the native engine.
The following comparison does not claim to be complete or even true under all circumstances. It is just what I noticed during development and may help the user to decide which engine to use.
In this comparison I will refer to the Nuance engine as the native engine and the Google engine as the default engine despite of what will be installed on your device. In the following list the first paragraph always refers to the native engine and the second one to the default engine.
languages
As of Android 2.2 the native engine is delivered with en-us locale only. So only English words are reliably recognized. Though you can try to transcribe a word in your language using English pronunciation (e.g. 'unnamen' works amazingly well for the German word 'annehmen' to answer calls ;-) )
Handsfree sets up the default engine to your default language (the one you set up in your phones settings). This should work with most languages. Please understand that I can't guarantee for any language to be supported. You may try to install 3rd party software to add support for your language.
offline recognition
The native engine is implemented on your device and does not need a working internet connection.
The default engine uses Google's computing power to recognize your spoken words. So it doesn't work without an internet connection. This can be a problem in areas with weak infrastructure. Also watch out your costs if you don't have a flat rate!
accuracy
The native engine is grammar based and will always return one of the commands you entered. Even if you didn't say anything or something completely different. Handsfree will accept the result only if the engine is very confident that the words match. Anyway, under circumstances (e.g. music, traffic noise, people talking, reverb or echos) this engine may understand the wrong command and Handsfree will take the wrong action. Even if it is quiet but your pronunciation is not clear enough or to fast this may happen. If this happens to often to you, please contact me .
The default engine is a free text engine. This means it tries to recognize literally what you said. Under bad conditions (see last paragraph) and even under good conditions it can understand a different word than you said. Thats why Handsfree features assistants to learn all those alternatives (see assistants for details). The bright side is, the probability of false actions is very low (depending on the commands you've chosen).
speed
The native engine is pretty fast.
The default engine has to transmit some data via internet and wait for the results. This takes some time.
user interface
The native engine is without user interface by default and does not show any annoying popups.
The default engine always opens a popup dialog (you know, the one with the microphone). During the ringing of an incoming call this dialog blocks the accept and reject buttons on the screen. So you first have to cancel the recognition dialog on order to manage your call manually. And even worse, if it didn't understand you (because you didn't say anything), it sometimes shows a dialog you have to acknowledge, preventing Handsfree from starting another recognition.
recommendation
If you're comfortable with using English words try the native engine. It does it's job pretty well. If you get too many wrong results use the default engine.
The default engine on most Android devices (see engines ) is a free text engine. It tries to recognize exactly all words you said. Often this doesn't work. Reasons might be fast or unclear pronunciation or music, traffic or any other noise in the background. The result is a variety of recognized words although you always say the same command.
Handsfree features three assistants to learn all those words the recognition engine returns for your commands. (see the screenshot below)
To teach Handsfree your commands as good as possible, start one of the assistants and press the 'Learn command' button. Then say your command as if you would get a call. The assistant shows the recognized word and adds it to the list.
Repeat the procedure multiple times and also under different conditions like inside, outside, in your car, on your bike, with soft music in the background, with and without a wired headset, in the morning, during a marathon and so on. You will see you get a long list of words. The more you train Handsfree the better the result at an actual call will be.
If you accidentally recorded something that wasn't meant as a command you always can remove it from the list by touching it until a context menu pops up. This menu has an option to delete the word or even the whole list (in case you want to change your command).
If you get warnings about connection problems, try to wait a few seconds or minutes and try again later.
发表评论
-
[Android] 为Android安装BusyBox —— 完整的bash shell
2013-12-27 10:19 1503http://www.cnblogs.com/xiaowen ... -
Windows的adb shell中使用vi不乱码方法及AdbPutty
2013-12-27 10:17 7605http://www.veryhuo.com/down/ht ... -
AppMobi推出新XDK,可创建测试PhoneGap项目
2012-09-03 13:39 2643AppMobi今天发布了一个新的工具PhoneGap Mobi ... -
Sencha
2012-09-03 12:59 1193http://www.sencha.com/ Se ... -
jQuery Mobile学习
2012-09-01 12:33 1702使用Jquery Mobile设计Android通讯录 ... -
BackBone
2012-09-01 12:34 1265Backbone.js 是一种重量级javascript M ... -
jQTouch
2012-08-30 15:57 990A Zepto/jQuery plugin for mobil ... -
SwiFTP
2012-08-30 15:43 1312SwiFTP is a FTP server that run ... -
kWS
2012-08-30 15:41 1206kWS is a lightweight and fast W ... -
jQuery Mobile
2012-08-30 15:07 1037http://jquerymobile.com/ -
PhoneGap
2012-08-30 15:07 1051http://phonegap.com/ -
Android Button background image pressed/highlighted and disabled states without
2012-08-06 12:49 1686http://shikii.net/blog/android- ... -
[AndriodTips]Image, saved to sdcard, doesn't appear in Android's Gallery app
2012-08-04 16:15 1166http://stackoverflow.com/questi ... -
Voice detection for Android
2012-07-23 11:39 2359Here it is, my fist JAVA applic ... -
[AndroidTip]local reference table overflow (max=512)的错误解决
2012-07-22 22:56 6064JNI层coding经常会遇到ReferenceTable o ... -
[AndroidTip]EditText如何初始状态不获得焦点?
2012-07-22 15:35 1229最简单的办法是在EditText前面放置一个看不到的Linea ... -
[AndroidTip]android textview滚动条
2012-07-21 14:29 1304本来是想做一个显示文字信息的,当文字很多时View的高度不能超 ... -
Google公布Android 4.1完整功能
2012-07-16 09:48 3191http://www.android.com/about/je ... -
Android开发:使用AudioTrack播放PCM音频数据【附源码】
2012-07-13 15:20 20887http://www.linuxidc.com/Linux/2 ... -
Android上的行车记录仪
2012-07-11 22:31 2013MyCar Recorder DailyRoads
相关推荐
nuance uapi是一种先进的语音识别接口,被广泛应用于Android系统中。通过nuance的语音库,开发者能够轻松地为Android应用添加语音识别功能。本篇将详细介绍nuance uapi的主要特点及其在Android平台上的具体应用方式...
在这个名为“VRRcognition_nuance_DEMO_语音识别_vr_”的项目中,我们聚焦于利用Nuance平台实现这一功能,并且在虚拟现实(VR)环境中进行演示。Nuance是一家全球领先的语音识别技术提供商,其技术广泛应用于各种...
7. 移动设备同步:通过云服务,如Dropbox或Google Drive,用户可以在电脑、平板和手机之间同步文档,实现跨平台的工作流。 总的来说,Nuance PaperPort Professional v14.1是一个强大且全面的文档管理解决方案,...
利用Nuance语音引擎的语音识别测试程序
Nuance Omnipage SDK v19 帮助文档 OCR 效果还可以的
微软宣布收购Nuance公司并重新定义AI语音龙头价值,此次收购事件不仅体现了微软对人工智能语音技术的重视,也标志着AI语音技术在医疗保健行业应用中的巨大潜力与广阔前景。以下从几个方面详细解读事件背后的知识点。...
粤语语言包非常难找,这里补充一个资源,粤语NextUp-Nuance-Scansoft-TTS-Chinese-Cantonese-F-Sin-Ji.exe.baiduyun 把后缀扩展名删除 粤语NextUp-Nuance-Scansoft-TTS-Chinese-Cantonese-F-Sin-Ji.exe, 然后解压后...
### Android令人激动的Google手机操作系统 #### 概述 2007年11月13日,Google正式发布了其令人瞩目的手机操作系统——Android。作为一个开放源代码的平台,Android基于Linux内核构建,旨在为移动设备提供一个强大...
光学OCR字元辨识软件。版本17.1。这个版本比以前的版本好用些。识别率不错。
这一联盟将会支持 Google 发布的 Android 手机操作系统或者应用软件,共同开发名为 Android 的 开 放源代码的移动系统。开放手机联盟包括手机制造商、手机芯片厂商和移动运营商几类。目前,联盟成员 数 量已经达到了...
[Nuance.PDF阅读器].NuancePDFReader_English
其中,移动手机联盟创始成员包括Aplix、Ascender、Audience、Broadcom、中国移动、eBay、Esmertec、谷歌、宏达电、英特尔、KDDI、Living Image、LG、Marvell、摩托罗拉、NMS、NTT DoCoMo、Nuance、Nvidia、...
Android是由Google主导开发的一款基于Linux内核的操作系统,主要用于移动设备,如智能手机和平板电脑。它提供了丰富的应用程序框架,允许开发者创建高性能的应用程序,并且具有高度的可定制性,使得设备制造商能够...
### Cognos案例分析:Nuance-Watson (HK) Limited #### 概述 Nuance-Watson (HK) Limited作为香港国际机场领先的零售运营商,管理着47家世界级免税店,提供包括手表珠宝、时尚配饰、香水化妆品、视听电子设备、...
开放手机联盟(Open Handset Alliance,OHA)是谷歌公司于2007年11月5日宣布成立的一个全球性联盟组织,致力于支持和推广Google发布的Android移动操作系统。Android是一个基于Linux的开放源代码的移动设备操作系统,...
该联盟旨在支持Google发布的Android手机操作系统及相关应用软件的开发,推动开源移动系统的普及和发展。 - **目的**:建立一个开放的移动设备生态系统,为用户提供更好的移动体验,并促进开发者、制造商及运营商...