Watching Your Words

Techcrunch’s Marshall Kirkpatrick reveals an inter­esting new tech­no­logy devel­op­ment designed to improve the podcast format:

Seattle based podcast dis­covery and man­age­ment service Pluggd is unveiling a major new feature at DEMO this weekend that combines speech recog­ni­tion and semantic analysis to let users search for and skip to parts of an audio file that are related to topics of interest to them. It’s more than just speech recognition.

This is one of the most com­pel­ling examples I’ve seen lately of a growing trend: making mul­ti­media content more granular and letting users take even greater control over the media we consume. We don’t just want to consume what we wish, we want to consume it in the way we wish.

Podcasts have a number of ease-​​of-​​use problems that this helps with. Only Apple’s podcast format can be properly split into chapters, for example, but those are less popular because you can’t play them on non-​​Apple devices. On an iPod, you can’t fast-​​forward an MP3 file. Lastly, podcasts can only be consumed in real-​​time — you can’t skim through it the way you can with printed publications.

However, while ease of use is high­lighted, I think the impetus for search tech­no­lo­gies in audio comes from other motiv­a­tions too. For pub­lishers, podcasts (and video) have proven dif­fi­cult to monetise, with most opting for a flat spon­sor­ship deal, if anything. However, if you know what words are in a piece of audio, or video, then auto­mat­ic­ally targeted advert­ising works a lot better. You could opt, for example, for your adverts to be appended to podcasts that contain the word ‘iMovie’, the same way that advert­isers buy these words from AdSense. Advertisers might also be able to generate a black­list of offensive terms and com­pet­itors’ names to avoid being asso­ci­ated with content they would rather not be.

There has been talk of meta­tag­ging innov­a­tions in audio and video, but the problem with all metatags is that they can be spammed. True audio search is superior in every way. The video equi­valent is already underway at Dave.TV which will appar­ently monitor the words in a clip to decide on appro­priate ads:

We will be using the same tech­no­logy used by Homeland Security to monitor [tele­phone] chatter. Audio keywording will allow us to con­tex­tu­ally figure out where to sell ads and to place more than just pre– and post-​​roll ads.

Share this post:

Digg This
Reddit This
Stumble Now!
Buzz This
Share on Facebook
Bookmark this on Delicious
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)

Possibly related:

Leave a Reply

  

  

  

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>