OS X’s built-in speech recognition gets better the more it is used

Written by Writer on Sunday, October 12th, 2008

FX tensions

OS X’s built-in speech recognition gets better the more it is used

GRAHAM K. ROGERS

in OS X, Leopard, includes to use special features like Speech Preferences which works with voice and hearing. As with many such features, there is : Speech Recognition and .

When OS X first appeared it included , called Speakable Items. It wasn’t all that good. It is still not perfect, but has improved. It requires training; the more it is used, the better it becomes.

The Speech Recognition panel has selection for On or Off at the top and below that a for Settings and Commands. When active, a small appears on the desktop remaining on top of windows, so is always visible. A list of is also shown. As the input microphone picks up sound, so bars and arrows on that icon show this.

The computer’s is the , but this can be changed in the “Microphone” button below commands. Line-in is also available, but like the Sound preferences this needs a device attached. In the same way, if I attach a , that is also listed. To the right of the is a Calibrate button.

displays a and a list of phrases. As we read these out so they flash when recognised. I don’t know if it is my accent, but the system only recognised six out of the first time, and never identified my input on one of them even when I changed input speed, emphasis and volume. I will have to keep working on this. I am unsure if it is the computer or user that needs training.

The feature works either with a listening key or a keyword. The is ESC and this can be changed. The is “Computer” so commands are preceded with this, like “Computer, what time is it?” Precisely when the keyword is used may also be changed. I find that using the key is more reliable than the keyword.

When a command is recognised, the computer acknowledges this and plays a sound. The acknowledgement can be disabled (there is a check box) and the sound can be selected from one of the 16 system sounds, or None.

The Commands button, next to Settings, allows selection of which parts of the system can be used by Speech recognition. Of these, the top pair, Address Book and Global Speakable Items are configurable. At the bottom of the panel is a link to the Speakable Items Folder. This contains files in XML format so it may be possible to add to these.

Although this has a novelty aspect, there are values. For example if I have my hands full, I can access the AddressBook and require a specific phone number to be displayed on the screen. I can also switch to another application (for example iTunes), or I can make it check for email. Once turned on, it remains active if we quit .

The other half of the Speech preference deals with output from text: More effective and usable than Speech Recognition. The feature has been available via application menus for a while under the Services sub-menu. Highlighted text is read out in the selected computer voice.

Voices have ranged from the sublime (Victoria) to the ridiculous (Deranged or Bubbles). A new voice, Alex, was added with Leopard, which gives clear output and is almost in the class of HAL in 2001: A Space Odyssey.

Beneath the voice selector, is a speed control. Slowing the voice improves output although the fastest setting is still understandable. To test each voice or speed setting, there is a Play button alongside.

The feature is good for proof reading, but its primary value is for those with seeing difficulties. Apple recently added features to the new iPod nano which use similar technology which was applauded by the US Society for the Blind but there has been a long-time inclusion of such features in Apple systems. Cupertino has also increased its facilities for those with disabilities, some of which may be seen on the Accessibility pages. There are also applications like ZappTek’s iSpeakIt that will create a sound file and put it in iTunes for iPod use. Visit http://www.apple.com/accessibility and http://www.zapptek.com/ispeak-it for more information.

For more accessibility options there are three check-boxes beneath: Announce when alerts are displayed; Announce when an application requires attention; and Speak selected text when keys are pressed. This is easier than the current menu access. At the bottom are two buttons that bring in features from other preferences; to have the clock announce the time; and to change VoiceOver settings.

Graham K. Rogers has OS X-flavoured web pages, with links to an RSS feed and a weekly podcast at http://www.extensions.in.th/index4.html.

News Topics Related Posts :

News Topics : , , , , , , , , , , , , , , , , , , , ,

This entry was posted on Sunday, October 12th, 2008 and is filed under Database. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

Asia News Reports

News Headlines

Advertisement

Bookmarks Me

del.icio.us Digg Furl Reddit Ask BlinkList Bloglines blogmarks BUMPzee Blogg-Buzz DZone Facebook Google Ma.gnolia Mixx MisterWong muti Newsvine PlugIM ppnow Propeller Rojo Shadows Simpy Slashdot Socializer Sphere Sphinn Spurl StumbleUpon Tailrank Technorati ThisNext Twitter Windows Live Wists YahooMyWeb

Thailand News Update

Asia News Update

World News Update