Reuters tests facial recognition video search

Cory Bergman September 18th, 2007

Powered by Viewdle, Reuters Labs has debuted a facial recognition search feature that scans across 609 hours of video for the beta test. For example, I typed in Amy Winehouse, and it returned a list of clips making up 18 appearances for a total on-screen time of 1:06. Click a clip, and it plays it from the moment she appears. Mouse over the video, and a box outlines her face with the words “Amy Winehouse.” Very, very cool.

3 Comments Add your own

  • 1. discreet_chaos  |  September 18th, 2007 at 3:58 pm

    Neat idea, but still a little buggy.

    As my first attempt, I typed Jewel hoping for video of Jewel Kitchner, the singer from Homer Alaska. The autocomplete dropped-down Richard Jewell, Jewel Sansom and the target of my search (sans last name) and no matter if I left plain-old Jewel in the searchbox or if I selected her name, Richard Jewell was the one that would always come up.

    Sticking with just one-named people, I tried Cher, but they possibly don’t have any video and they had Elvis listed with his first and last. They also didn’t have any Bruce Springsteen, but I didn’t try any newsmakers. I’ll have to give them a whirl later.

  • 2. Safran  |  September 18th, 2007 at 7:27 pm

    In fairness, a TRS-80 could recognize Amy Winehouse.

  • 3. baker  |  September 19th, 2007 at 10:08 am

    in all likelihood, they’re balancing this “facial recognition” parlor trick stuff with some good ol’ fashion metadata crawling and voice-to-text transcription. and without any data that i can point to for support, i bet audio can provide a way better ID print of a person than some facial recognition software. in fact, that’s probably the last determining factor that constitutes a match. in the case of a musician, it’s probably more like:

    1. Metadata crawl for relevant info
    2. Speech to text; index and search against lyrics to determine if this is performance or if keywords indicate this video is about or featuring the musician
    3. Audio sampling to compare voice print to database of voice print samples
    4. Image relevancy scan to determine if the video sections with the highest probability of the musician appearing (based on 2,3 above) meet some criteria to confirm that yes, they’re 70% sure this person actually appears in the video you’re watching.

    it’s like magic, except a lot less magical, and way cooler when you call it “facial recognition” technology.

Leave a Comment

(Please keep URLs out of the comment body or the spam filter will block you.)

hidden

Subscribe to the comments via RSS Feed


Most Recent Stories



 

Calendar

September 2007
M T W T F S S
« Aug   Oct »
 12
3456789
10111213141516
17181920212223
24252627282930

Posts by Month

Posts by Category