qingqun kong 2011.7 - iavision.ia.ac.cn/zh/senimar/reports/visnet.pdf · visnet a model of...

47
Qingqun Kong 2011.7.12

Upload: others

Post on 13-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Qingqun Kong

2011.7.12

Page 2: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Visnet A model of invariant object recognition

Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience of Vision”, Oxford University Press,2002

Page 3: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Visnet A model of invariant object representation

Hierarchical network

Page 4: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

提纲物体识别简介

物体识别的生理机制

物体识别的方法

Visnet

Visnet的实现过程及结果分析

下一步的工作

Page 5: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Invariant object recognition

model

Outputs

Inputs

Page 6: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Invariant object recognition

Visnet

Outputs

InputsImages of different

0bjects at different positions

Page 7: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Invariant object recognition

model

Outputs

Inputs

labels

Page 8: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Invariant object recognition

model

Outputs

Inputs

labels

Images of different 0bjects at

different positions

Page 9: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Invariant object recognition

model

Outputs

Inputs

Invariant objectRepresentation

labels

Page 10: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Invariant object recognition Solving translation(view、size…) invariance:

responding the same local spatial arrangement ,ignoring the global position of the object

Recognizing the object in different transforms in just a few seconds of inspection of an object

Page 11: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

提纲 物体识别简介

物体表示的生理机制

物体表示的方法

Visnet

Visnet的实现过程及结果分析

下一步的工作

Page 12: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Neurophysiological mechanisms Hierarchical network

Feed forward connection

Page 13: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Neurophysiological mechanisms Hierarchical network

Feed forward connection

Lateral connection

Page 14: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Neurophysiological mechanisms Hierarchical network

Sparse representation

Local representation

distributed representation

Page 15: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Neurophysiological mechanisms Hierarchical network

Sparse representation

Local representation

distributed representation

Representing similarity by vector correlation;

Exponential coding capacity;

Page 16: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Neurophysiological mechanisms Hierarchical network

Sparse coding

Temporal properties

When a object was translated to a nearby position, because this would occur in a short period, the membrane of the postsynaptic neuron would still be in its ‘Hebb-modifiable’ state, and the presynaptic afferents activated with the neuron.

Page 17: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

提纲 物体识别简介

物体表示的生理机制

物体表示的方法

Visnet

Visnet的实现过程及结果分析

下一步的工作

Page 18: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Approaches to invariant object recognition Feature space

Regardless of the relative arrangement of the features

Some birds(pigeons)

Structural descriptions and syntactic pattern

3D descriptions

Necessary for language to provide description of objects

Template matching and the alignment

Active vision (some invertebrates)

Feature hierarchies and 2D view-based object recognition

Visnet

Page 19: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

提纲 物体识别简介

物体表示的生理机制

物体表示的方法

Visnet

Visnet的实现过程及结果分析

下一步的工作

Page 20: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Visnet

Page 21: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Visnet

Page 22: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Architecture of Visnet

The forward connections to individual cells are derived from a topologically corresponding region of the preceding layer , using a Gaussian distribution of connection probabilities.

Page 23: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Input to Visnet

Visnet

Outputs

Inputs

Images of different 0bjects at

different positions( , )I x y

Page 24: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Input to Visnet

Visnet

Outputs

Inputs

Images of different 0bjects at

different positions( , )I x y

Page 25: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Input to Visnet

Camera

Visnet

Outputs

Inputs

filter

( , )I x y

( , , )xy f

( , )* ( , , )xyI x y f

2 2 2cos sin cos sin cos sin( ) ( ) ( )

2 1.6 2 3 21( , , ) [ ]

1.6

x y x y x y

f f f

xy f e e e

1 1 0.5 0.25 0.125 0.0625f

0 45 90 135

Images of different 0bjects at

different positionsRetina

V1

Page 26: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Learning ProcessLearning Process(take layer 1 for example)

2.Competition and lateral inhibition

i j ij

j

h x w

*r h I

1.The activation of each neuronih

Page 27: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Learning ProcessLearning Process(take layer 1 for example)

2.Competition and lateral inhibition

3.Contrast enhancement

i j ij

j

h x w

*r h I

1.The activation of each neuronih

is used to control the sparseness of firing rates

within each layer

Page 28: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Learning ProcessLearning Process(take layer 1 for example)

2.Competition and lateral inhibition

3.Contrast enhancement

4.Updating weights

i j ij

j

h x w

*r h I

ij i jw y x

1

(1 )i i iy y y

1.The activation of each neuronih

Page 29: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Learning ProcessLearning Process(take layer 1 for example)

2.Competition and lateral inhibition

3.Contrast enhancement

4.Updating weights

5.Return 1

i j ij

j

h x w

*r h I

1.The activation of each neuronih

ij ij ijw w w

( )ij i j ijw y x w

Page 30: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Testing ProcessTesting Process(take layer 1 for example)

2.Competition and lateral inhibition

3.Contrast enhancement

i j ij

j

h x w

*r h I

1.The activation of each neuronih

Page 31: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment Each image is 64*64 pixels and is shown at different

positions in the 128*128 “retina”.

The number of pixels by which the image was translated was 8 for each move.

Page 32: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment 1

Page 33: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment 1

Page 34: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment 1

Page 35: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment 1

Page 36: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment 2

1

ij i jw y x

Learning rule:

Page 37: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Experiment 2

Page 38: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

ConclusionForming feature combination at the early stage of

processing

Trace learning rule

Solving translation invariance(responding the same local spatial arrangement ,ignoring the global position of the object)

Recognizing the object in different transforms in just a few seconds of inspection of an object

It would be less good for making actions in 3D space

Page 39: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

提纲 物体识别简介

物体表示的生理机制

物体表示的方法

Visnet

Visnet的实现过程及结果分析

下一步的工作

Page 40: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

结果分析 输入

滤波器2 2 2cos sin cos sin cos sin

( ) ( ) ( )2 1.6 2 3 21

( , , ) [ ]1.6

x y x y x y

f f f

xy f e e e

Page 41: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

结果分析 输入

滤波器

输入连接

Page 42: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

结果分析 输入

滤波器

输入连接

的确定

Page 43: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

结果分析 输入

滤波器

输入连接

的确定

的确定

Page 44: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

结果分析 输入

滤波器

输入连接

Visnet的输出,作为竞争性网络的输出,用于分类

的确定

的确定

Page 45: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

提纲 物体识别简介

物体表示的生理机制

物体表示的方法

Visnet

Visnet 的实现过程及结果分析

下一步的工作

Page 46: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

下一步的工作继续查找原因,实现Visnet针对平移不变性的功能;

对于View、size,测试Visnet的不变性

考虑反馈的作用

物体识别与立体视觉的关系

Page 47: Qingqun Kong 2011.7 - IAvision.ia.ac.cn/zh/senimar/reports/visnet.pdf · Visnet A model of invariant object recognition Edmund T. Rolls and Gustavo Deco, ”Computational Neuroscience

Thanks!