computer vision group department of computer science university of illinois at urbana-champaign

12
Computer Vision Group Department of Computer Science University of Illinois at Urbana- Champaign

Upload: brianne-leonard

Post on 18-Jan-2018

219 views

Category:

Documents


0 download

DESCRIPTION

Multimodal Information Access & Synthesis  Abstract  Process  Demo  Results  Future Work  Questions Outline

TRANSCRIPT

Page 1: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Computer Vision GroupDepartment of Computer Science

University of Illinois at Urbana-Champaign

Page 2: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

Sang Hyun ParkJoel QuintanaRobert Rand

David Forsyth

Recognition and Efficient Retrieval of Recognition and Efficient Retrieval of Similar Images in Large Datasets Similar Images in Large Datasets

Using Visual WordsUsing Visual Words

Page 3: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

Abstract Process Demo Results Future Work Questions

OutlineOutline

Page 4: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

Problem: We want to...Identify pictures by content rather than color.Compare large sets of images to find near duplicates.Recognize similar pictures despite small changes.

Challenges: Similar Images can be...in different filenames.in different formats.in different sizes and arrangements.stretched, skewed, colored and otherwise altered.

AbstractAbstract

Page 5: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

Why would we want to find near duplicate images?News ReportsForged PhotosSocial NetworksPicture/Mugshot MatchingWeapon & Symbol ID

ApplicationApplication

Page 6: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

ProcessProcessImages Database

Part1: Get the Visual Words

Extract SIFT Features

Interest Points Database

Group Similar Interest Points (Kmeans)

List of General Points (Visual Words)

Page 7: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

ProcessProcessOriginal Images

Database

Part2: Store Histograms of Visual Words of the Images on the Database

Image

Add Histogram to the Histograms Database Histograms Database

Calculate Histogram of Visual Words

Histogram of Visual Words

Page 8: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

ProcessProcessPart3: Retrieval of Similar Images

New Image

Histograms Database

Calculate Histogram of Visual Words

Histogram of Visual Words

Query For Nearest Neighbors

List Of Nearest Neighbors

Page 9: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

DemoDemo

Page 10: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

Current Configuration– 10 Interest points per image– 3000 Visual words (K-mean Clustering)– KDTree to get approximate nearest neighbors– Precision : ~0.50

Future Configuration– All Interest points from images– More Clustering Algorithms (Hierarchical K-means / KDTree)– Usage of Full Potential of FLANN

ResultsResults

Page 11: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

Bigger Database–  Web (Image Search Engine / Flicker / Facebook)

  Multiple Queries

– Parallelized Processing (Efficient Processing of Queries)

Other Application– Detection of objects inside images: logos, symbols, tattoos,

weapons, etc– Finding relationships between people according to their common

pictures on social networks

Future WorkFuture Work

Page 12: Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign

Multimodal Information Access & Synthesis

QuestionsQuestions