PCs Powered by the Wisdom of Crowds

Physorg.com — daily reading in the Delaney house­hold — has posted an inter­esting piece on how researchers at the Technion-​​Israel Institute of Technology have fed their com­puters a diet of Wikipedia to make them more clever. While Web 2.0 sceptics might have pre­ferred the Britannica, the point of the exercise is to make the machines capable of forming asso­ci­ations, the same way you and I do when we think.

It seems to me like one more step towards the Holy Grail next-​​gen Google-​​killer search engine. Google can already do synonyms: when you search for cars, it also returns results that talk about auto­mo­biles. What it can’t do is make any asso­ci­ations. If you search for ‘Iraq War’, it won’t return results on ‘axis of evil’, ‘bush foreign policy’ or ‘mutually-​​assured destruc­tion’. It doesn’t know what topics are asso­ci­ated with your search term. By giving a machine a diet of (mainly) intel­li­gent dis­cus­sion about as many topics as possible, they’ll be able to find pages that are relevant to your search term, but which aren’t keyword heavy.

The second example given is a spam filter. A simple filter might block anything con­taining the word ‘vitamin’ that comes from a stranger. A filter which has been taught a little more would know that ‘B12’ is a vitamin and be able to dis­tin­guish sci­entific dis­cus­sion from a sales pitch.

The method could also appar­ently be used to improve auto­mated trans­la­tions. When a simple trans­lator comes across the word ‘mouse’, it doesn’t know if it’s a rodent or a computer peri­pheral. If it knew enough about the context in which the word appeared, though, it would be able to dis­am­big­uate the passage it was working on.

The article doesn’t mention the expres­sion, but this is very much along the lines on which the semantic web is supposed to work. Wikipedia provides an ontology for machines to have some under­standing of human text. It isn’t quite arti­fi­cial intel­li­gence, but it’s quite close: the machines use our intel­li­gence to simulate their own.

N.B. Interestingly, my friend Marc Fawzi described exactly this idea in a piece he posted on the subject last June.

Share this post:

Digg This
Reddit This
Stumble Now!
Buzz This
Share on Facebook
Bookmark this on Delicious
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)

Possibly related:

5 comments to PCs Powered by the Wisdom of Crowds

Leave a Reply

  

  

  

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>