text recognition techniques

17
Text Recognition Techniques Group #2 Di Wu (d8wu) Ehren Choy (e3choy) Muhammad Qureshi (m2quresh) Mohammad Talha Khalid (mtkhalid)

Upload: cael

Post on 23-Feb-2016

64 views

Category:

Documents


0 download

DESCRIPTION

Text Recognition Techniques. Group #2 Di Wu (d8wu) Ehren Choy (e3choy) Muhammad Qureshi (m2quresh) Mohammad Talha Khalid ( mtkhalid ). Problem. Recognize hand-written characters. Motivation. Hand-writing is a complex problem N eed AI techniques to help solve it. Remember A4?. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Text Recognition Techniques

Text Recognition Techniques

Group #2Di Wu (d8wu)

Ehren Choy (e3choy)Muhammad Qureshi (m2quresh)

Mohammad Talha Khalid (mtkhalid)

Page 2: Text Recognition Techniques

Problem

• Recognize hand-written characters

Page 3: Text Recognition Techniques

Motivation• Hand-writing is a complex problem• Need AI techniques to help solve it

Page 4: Text Recognition Techniques

Remember A4?

Page 5: Text Recognition Techniques

Preprocessing

• Colour Conversion

• Edge DetectionoCanny Algorithm

• Thinning & SkeletonizationoGenerate Predictor

Variables

Page 6: Text Recognition Techniques

Feature Extraction

• Predictor Variables oAspect Ratio

o Junction and End Points

oLoop

oAscenders and Descenders

Page 7: Text Recognition Techniques

Application to Radial Basis

• Input LayeroOne neuron per predictor variable 

• Hidden LayeroCalculate distance from center point

• Output LayeroOne output neuron for every category

Page 8: Text Recognition Techniques

Fuzzy Systems

• Pre-processing• Feature Extraction

– “Structural recognition”– Extracting individual features

• Fuzzy classification– Compare word structure with

reference words

Page 9: Text Recognition Techniques

Identifying structural features - English

• Micro Vertical Line• Micro Horizontal Line• Micro Positive Slant• Micro Negative Slant

Page 10: Text Recognition Techniques

Identifying structural features - Arabic

• 4 sub-words• 1 ascender• 1 dot above• 2 loops• 2 descenders

Numerical decomposition of the word

Page 11: Text Recognition Techniques

Compare word structure with reference words

• Fuzzy classifier classifies word’s membership in different classes

• Uses Fuzzy K nearest neighbor algorithm– Calculate distance between word & training samples

Page 12: Text Recognition Techniques

Fuzzy Network: Pros and Cons

• Advantages– Lower computational requirements

• Disadvantages– Not widely used in handwriting recognition

problems

Page 13: Text Recognition Techniques

Genetic Programming

• Generating graph from input image – Break down image to line segments

• Compute fitness using fitness function – Edge Deviation – Graph Deviation

• Crossover operation – Replace a path between two vertices in one graph

with a path between two matching vertices in another graph

Page 14: Text Recognition Techniques

Pros and Cons

• Advantages – Easier to generate a large solution set by creating

hybrids from a smaller initial set • Disadvantages

– Long running time due to high number of possible combinations of graph pairs for computing fitness

Page 15: Text Recognition Techniques

Demo

Page 16: Text Recognition Techniques

Neural Networks: Pros and Cons

• Advantages– Automatic Learning– Quick Classification

• Disadvantages– Efficiency– Locality

Page 17: Text Recognition Techniques

Questions?