List of speech recognition software
From Wikipedia, the free encyclopedia
This section needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. (July 2009) |
The following list presents notable speech recognition software with a brief synopsis of characteristics.
[edit]Open Source
- CMU Sphinx — open source under a BSD license
- Julius — Japanese language only programs with BSD-style license.
- simon — GPL; Uses Julius and the HTK
- iATROS — Released under a GPL license.
- RWTH ASR — QPL-style license.
[edit]Open source acoustic models and speech corpus
[edit]Macintosh
- Dragon Dictate for Mac – From Nuance Communications, renamed from MacSpeech Dictate ca. 2010.
- MacSpeech Dictate Medical – Dictation product for Intel-based Macintosh with included vocabularies for 54 medical and dental specialties.[1] Developed by MacSpeech; acquired by Nuance Communications in 2010.
- MacSpeech Dictate Legal – Dictation product for Intel-based Macintosh with a vocabulary of legal terms.[2] Developed by MacSpeech; acquired by Nuance Communications in 2010.
- MacSpeech Scribe – By Nuance Communications. Transcription product for automatically transcribing recorded dictation into text.
- iListen – Product from MacSpeech, developed and supported for PowerPC-based Macintosh until ca. 2009.
- Speakable items – Included with Mac OS ca. 1993 onward. Apple's speech synthesis and recognition technology is collectively calledPlainTalk.
- ViaVoice – Product from IBM, developed and supported until ca. 2007.
- Voice Navigator - First voice control system for a graphical user interface, by Articulate Systems in 1989.
[edit]Mobile Devices / Smartphones
Many cell phone handsets have basic dial-by-voice features built in. Smartphones such as iPhone or Blackberry also support this. A number of 3rd party Apps have implemented natural language speech recognition support, including:
- Sonic Cloud Online Speech to text services
- Dragon Dictation
- Dragon Search
- Google Voice Search
- Bing voice search
- Siri Personal Assistant
- Shoutout
- DriveSafe.ly Speech Recognition
- Vlingo
- Jeannie (Voice Actions) by Pannous for Android
- Speaktoit Assistant for Android
- Ziri Assistant Beta for Android
[edit]Windows
[edit]Windows 7
[edit]Windows 7 built-in speech recognition
The Windows Speech Recognition by Microsoft is the speech recognition system that comes built into Windows Vista and Windows 7.Windows Vista and Windows 7 include version 8.0 of the Microsoft speech recognition engine. Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese.
[edit]Add-ons for Windows 7 speech recognition
- VoiceAttack - [5] is used primarily by the gaming community to allow hands-free keyboard and mouse input in Windows 7, Windows Vistaand Windows XP. Its popularity lies mainly in its ease of use and extended feature set, which includes the ability to create multi-threaded macros.
- Voice Finger – software for Windows Vista and Windows 7 that improves the Windows speech recognition system by adding several extensions to accelerate and improve the mouse and keyboard control.
- WSRToolkit – adds dictionaries, macros and other features similar to Dragon
- Trigamtech – adds features for medical users similar to Dragon [6]
- Vocola – a macro language [7]
This article contains instructions, advice, or how-to content. The purpose of Wikipedia is to present facts, not to train. Please help improve this article either by rewriting the how-to content or by moving it to Wikiversity or Wikibooks. (January 2012) |
Windows Speech Recognition is available only when the language of the operating system matches the language of Windows Speech Recognition. In Windows Vista Ultimate, you can change the language of the operating system by installing a language pack from Windows Update. If you install the language pack of a supported Windows Speech Recognition language, you can then use Windows Speech Recognition for that language if that is also the language of the operating system.
Check the article “The Windows Speech Recognition language must be the same as the operating system language in Windows Vista”http://support.microsoft.com/kb/934377
You may also refer to the link “How do I get additional language files?’ http://windows.microsoft.com/en-us/windows-vista/How-do-I-get-additional-language-files
Note: Multilingual User Interface Pack(MUIs) require a license to be used and are only available with Windows Vista Ultimate and Windows Vista Enterprise. If you are using Windows Vista Ultimate, you can download MUIs by using Windows Update. If you are using Windows Vista Enterprise, contact your system administrator for information about installing additional languages. Also you cannot switch between languages for Windows Speech recognition.
[edit]Windows 7 third-party speech recognition
- Sonic Extractor from Digital Syphon – Supports 22 languages. Focus on broadcasting and telephony. Support for 64-bit Windows XP, Vista , and Windows 7 using version 3.0, as of January 2012. Perform high speed automated speech transcriptions and multithreaded to process 4 or 8 files or streams of audio at a time on a single appliance.
- Dragon NaturallySpeaking from Nuance Communications – Successor to the older DragonDictate product. Focus on dictation. 64-bit Windows support since version 10.1.
- SpeechMagic – Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded.[8][9]
- VoxCommando – VoxCommando allows control of many media programs on Vista and Windows 7 including XBMC, iTunes, MediaMonkey, Windows Media Center, Skype and many more through eventghost. Ability to customize all spoken commands, create macros, launch applications, perform web searches etc. VoxCommando scans your media library for all your music (and in the case of XBMC, TV and Movie titles). Request media by name, and partial name matches. Currently free to try, but $25. Demo mode is fully functional, but you must close and restart after a fixed number of commands are issued.
- Tazti – A multi function software with versions for Windows 7, Vista and Windows XP. Facilitates controlling almost any desktop software applications via tazti speech recognition API functionality; creating custom speech commands;play with all multimedia. playing PC games by talking to a PC; voice search; voice bookmark management; internet navigation; and voice control of iTunes music player. Includes a lite dictation capability.[10]
[edit]Windows XP or 2000 only
- e-Speaking[11] – a software for Windows XP that facilitates use of the Microsoft Speech API by adding ability to create commands to perform custom actions.
- Microsoft Speech API – Speech recognition functionality included as part of Microsoft Office and on Tablet PCs running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.
[edit]Programs for controlling a computer's screens and desktop applications with claps or words
- Clap Commander – Novel Human-Computer Interface, based on hand clap recognition idea. For remotely controlling your computer from another part of the room by clapping your hands. Windows XP, Windows Vista and Windows 7.
- Tazti – A multi function software with versions for Windows 7, Vista and Windows XP. Facilitates controlling almost any desktop software applications via tazti speech recognition API functionality; creating custom speech commands;; playing PC games by talking to a PC; voice search; voice bookmark management; internet navigation; and voice control of iTunes music player. Includes a lite dictation capability.[12][13]
[edit]Interactive voice response
The following are IVR/Interactive Voice response systems:
- AT&T Watson
- CSLU Toolkit
- HTK — copyrighted by Microsoft, but altering the software for the Licensee's internal use is allowed.
- iSpeech ASR API
- Loquendo ASR
- Nuance Recognizer ASR
- Rubidium Ltd.ASR] ASR
- Proteus Conversational Interface
- Simmortel Voice
- Tellme Networks (acquired by Microsoft)
- Parlance nameConnector
- Verbyx Inc
[edit]Unix-like x86 and x86_64 Speech Transcription Software
- Vocapia Research's VoxSigma software suite[14]
[edit]Discontinued software
- SpeechWorks from Nuance Communications.
- Quack.com (acquired by AOL) The name has now been reused for an iPad search app.
- IBM ViaVoice – Embedded version still maintained by IBM.[15] No longer supported for versions above Windows Vista.[16] Untested above Mac OS X 10.4 or on Macintoshes with an Intel chipset.[17]
- Game Commander 2 by Mindmaker. Gaming oriented voice recognition. Voice commands can be assigned to issue keystrokes and key combinations. Computer Gaming World reviewed it in their March 2001 issue, giving it a 5/5 score.
[edit]See also
[edit]References
- ^ MacSpeech Dictate Medical product page
- ^ MacSpeech Dictate Legal product page
- ^ "PowerSecretary Announcement".
- ^ [1]
- ^ http://www.voiceattack.com/
- ^ http://www.trigramtech.com/products.htm
- ^ http://vocola.net/
- ^ [2]
- ^ Philips SpeechMagic named European Technology Leader by Frost & Sullivan
- ^ http://thegamefanatics.com/tazti-brings-voice-recognition-to-pc-games/
- ^ http://www.e-speaking.com/
- ^ http://thegamefanatics.com/tazti-brings-voice-recognition-to-pc-games/
- ^ http://www.popsci.com/gear-%2526-gadgets/article/2008-09/hands-free-computing
- ^ http://www.vocapia.com
- ^ http://www-01.ibm.com/software/pervasive/embedded_viavoice/
- ^ http://nuance.custhelp.com/app/answers/detail/a_id/5775/p/31/c/980/r_id/100023
- ^ http://nuance.custhelp.com/app/answers/detail/a_id/4987/related/1/p/31/c/980/r_id/100023
[edit]External links
This article's use of external links may not follow Wikipedia's policies or guidelines. Pleaseimprove this article by removing excessive or inappropriate external links, and converting useful links where appropriate into footnote references (January 2012) |
- SpeechGear
- Auditory Sciences
- Sonic Extractor
- e-speaking
- iSpeech Speech Recognition and Text-to-Speech
- AT&T Watson
- Loquendo ASR
- Tatzi -Free Speech Recognition Software by Voice Tech Group, Inc.
- Clap Commander
- Voice Finger
- Simon GPL, to help disabilited persons
- VoxSigma speech-to-text software
- iATROS speech recognition software
- RWTH ASR
- Speech To Text Software Information
- Speech Recognition Software Tools Directory
0 komentar " ", Baca atau Masukkan Komentar
Post a Comment