Commentary

Music Recognition App Moves To Be Consumer Translator For Internet Of Things

More people are going to be speaking to more things.

While the Internet of Things will bring the addition of new screens into the home and present the opportunity for more and varied forms of advertising, the power of voice may become front and center when it comes to consumer interactions.

And some of these new forms of interactions may be facilitated by what essentially started out as a music listening mobile app.

Much like Shazam, the SoundHound app quickly identifies a song being played. It launched back in 2009. I’ve regularly used the app since its early days, primarily because it works well, both from a speed and accuracy standpoint.

The music recognition app has been downloaded somewhere over 280 million times.

“It’s been all viral, with no advertising dollars spent,” SoundHound CEO and founder Keyvan Mohajer told me yesterday.

But the music recognition capability was just a stop on the road to the Internet of Things.

After many months of beta testing, the company’s new app called Hound was recently released to the public.

Rather than listening for music to identify, Hound is a voice-driven search device, squarely targeted at the Internet of Things.

SoundHound started more than 10 years ago on a long-term mission to perfect voice interactions.

 “We concluded there would be a day that we talk to everything around us, which was not obvious at the time,” Mohajer said.

And that time is starting to be now, with many consumers already starting to use voice commands in their homes.

In one recent study, the majority (64%) of smart home product owners use voice commands with 61% of them wanting to use them even more, as I wrote about here recently (Consumer Voice Commands Rule For Smart Home Objects; 61% Want Even More).

The idea for Hound is for it to be embedded into connected objects so that consumers can control those by simply speaking.

“This has the potential to be extremely disruptive,” said Mohajer

One of the elements for voice commands to be effective with connected objects will be speed.

Voice activated search has two basic components. There’s the speech-to-text recognition part and then the process of translating the text to meaning, essentially so an answer to a question can be returned.

Hound combined the two functions into one, so that searches are getting underway even while a question is being asked.

One of the other elements of Hound is the ability to understand more complex questions.

For the connected home, this capability could allow rapid understanding of what a consumer is asking of a connected object.

Another potential down the road would be for Hound to recognize whose voice it is when someone in the home says something, such as ‘play music I like.’

And the consumer-smart object conversation is where brands likely can work their way in.

The dynamics of people speaking to activate and control products and appliances is at the starting gate.

Next story loading loading..