AT&T releasing Watson voice recognition APIs to developers in June

13

AT&T's research arm has spent over two decades developing its Watson speech and language engine, which translates spoken words into text. Now, AT&T is planning to release a number of Watson APIs for developers in June, in an effort to accelerate development and innovation in the voice recognition space. Instead of having to develop their own speech recognition software, developers will now be able to plug AT&T's Watson APIs into their apps to more easily include voice recognition features.

AT&T's first APIs will be focused around seven different areas: web search, local business search, Q&A, voice mail to text, SMS, AT&T's U-verse video programming guide, a general-purpose dictation API. AT&T has found that speech recognition works best when focused on specific categories, so these categories will help Watson know what types of words to expect. Unsurprisingly, AT&T's informational video (included below) focused on the example of building a Watson-enabled U-verse programming guide, so you could tell it what channel, movie, or actor you were looking for. While these seven categories will be part of the initial release, it sounds like AT&T plans to add more and more categories over time.

Additionally, AT&T will also be releasing an SDK it's calling the Speech Kit; it'll allow developers to create software that captures spoken words and sends them into a network for transcription. There's minimal details on where exactly the captured words are sent, but we expect we'll hear more when this SDK is released. As a way of showing off Watson software in action before the API release, AT&T recently launched the AT&T Translator app for Android and iOS. It purports to translate your speech into another language of your choice, but the few reviews in iTunes make it sound like there's a few bugs that need to be worked out still.

Historically, AT&T has used Watson internally for interactive voice response for things like the automated customer care systems we've grown to know (and possibly hate) over the years, as well as voicemail-to-text, voice search, and many other applications for translating a human's voice to a computer. While that usage may have its place, we're happy to see Watson's technology get into the hands of developers who will hopefully apply it to situations beyond yelling at voice-automated menus for an operator.

Update: AT&T had plenty of Watson-enabled experiments to show us at an event in New York today, the most flashy of which was a QNX-equipped Porsche 911 that used the carrier's cloud-based service to handle voice commands. The convertible had its top down and unfortunately had trouble picking up commands accurately and reliably with the din of New York City around it, but be sure to check our impressions of the car (which appears to be unchanged from CES 2012) here.

Qnx-porsche-911_1020

Back to top ^
X
Log In Sign Up

forgot?
Log In Sign Up

Please choose a new Verge username and password

As part of the new Verge launch, prior users will need to choose a permanent username, along with a new password.

Your username will be used to login to Verge going forward.

I already have a Vox Media account!

Verify Vox Media account

Please login to your Vox Media account. This account will be linked to your previously existing Eater account.

Please choose a new Verge username and password

As part of the new Verge launch, prior MT authors will need to choose a new username and password.

Your username will be used to login to Verge going forward.

Forgot password?

We'll email you a reset link.

If you signed up using a 3rd party account like Facebook or Twitter, please login with it instead.

Forgot password?

Try another email?

Almost done,

By becoming a registered user, you are also agreeing to our Terms and confirming that you have read our Privacy Policy.
Spinner.vc97ec6e

Authenticating

Great!

Choose an available username to complete sign up.

In order to provide our users with a better overall experience, we ask for more information from Facebook when using it to login so that we can learn more about our audience and provide you with the best possible experience. We do not store specific user data and the sharing of it is not required to login with Facebook.

tracking_pixel_5345_tracker