634350765869323109seminartongquanocr
TRANSCRIPT
-
8/3/2019 634350765869323109SeminarTongQuanOCR
1/29
1
I HC DUY TNKHOA CNG NGH THNG TIN
Ch
NHN DNG CHV
CC HNG NGHIN CU
Ngi trnh byTS. PHM ANH PHNG
B mn C s Tin hc
Friday, March 04, 2011
-
8/3/2019 634350765869323109SeminarTongQuanOCR
2/29
-
8/3/2019 634350765869323109SeminarTongQuanOCR
3/29
3
GII THIU
Friday, March 04, 2011
Nhn dng ch l mt lnh vc c quan tm nghin cu vng dng tnhiu nm nay theo hai hng chnh:
Nhn dng ch vit tay: vi nhng mc rng buckhc nhau v cch vit, kiu ch... phc v cho cc ng dng
c v x l cc chng t, ha n, phiu ghi, bn vit taychng trnh... Nhn dng ch vit tay c tch ra haihng pht trin: nhn dng ch vit tay trc tuyn (on-line) v chvit tay ngoi tuyn (off-line).
Nhn dng ch in: phc v cho cng vic t ng ha cti liu, tng tc v cht lng nhp thng tin vo my
tnh trc tip tcc ngun ti liu.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
4/294
GII THIU (tt)
Friday, March 04, 2011
Nhn dng chvit tay: vn cn l vn thch thc ln
i vi cc nh nghin cu. Bi ton ny cha thgii quyttrn vn c v n hon ton ph thuc vo ngi vit vs bin i qu a dng trong cch vit v trng thi sckhe, tinh thn ca tng ngi vit.
Nhn dng ch in: c gii quyt gn nh trn vn(sn phm FineReader 9.0 ca hng ABBYY c thnhn dngch in theo 192 ngn ng khc nhau, phn mm nhn dngchVit in VnDOCR 4.0 ca Vin Cng ngh Thng tin H Ni
c th nhn dng c cc ti liu cha hnh nh, bng vvn bn vi chnh xc trn 98%).
-
8/3/2019 634350765869323109SeminarTongQuanOCR
5/295
LCH SPHT TRIN
Friday, March 04, 2011
Cc sn phm nhn dng ch thng mi c t nhng nm1950, khi my tnh ln u tin c gii thiu tnh nng mi
v nhp v lu trd liu hai chiu bng cy bt vit trn mttm bng cm ng. Cng ngh mi ny cho php cc nhnghin cu lm vic trn cc bi ton nhn dng ch vit tayon-line.
Nhn dng ch c bit n t nm 1900, khi nh khoa hcngi Nga Tyuring pht trin mt phng tin tr gip cho
nhng ngi m.
Giai on 1: (1900 1980)
-
8/3/2019 634350765869323109SeminarTongQuanOCR
6/296
LCH SPHT TRIN
Friday, March 04, 2011
Nm 1954, my nhn dng ch u tin c pht trin biJ. Rainbow dng c ch in hoa nhng rt chm.
M hnh nhn dng chvit c xut tnm 1951 do phtminh ca M. Sheppard c gi l GISMO, mt robot c-vit.
Giai on 1: (1900 1980) (tt)
Nm 1967, Cng ty IBM thng mi ha h thng nhndng ch.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
7/297
LCH SPHT TRIN
Friday, March 04, 2011
Cc hng tip cn theo cu trc v i snh c p dngtrong nhiu h thng nhn dng ch.
Vi s pht trin ca cc thit b phn cng my tnh v ccthit b thu thu nhn d liu, cc phng php lun nhn dng c pht trin trong giai on trc c c mi trng
l tng trin khai cc ng dng nhn dng ch.
Giai on 2: (1980 1990)
Trong giai on ny, cc hng nghin cu ch tp trung vocc k thut nhn dng hnh dng chcha p dng cho thngtin ngngha. iu ny dn n shn chv hiu sut nhn
dng, khng hiu qu trong nhiu ng dng thc t.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
8/29
8
LCH SPHT TRIN
Friday, March 04, 2011
Cc k thut nhn dng kt hp vi cc phng php luntrong lnh vc hc my (Machine Learning) c p dngrt hiu qu.
Cc h thng nhn dng thi gian thc c ch trng tronggiai on ny.
Giai on 3: (T1990 n nay)
Mt s cng c hc my hiu qu nh mng n ron, m hnhMarkov n, SVM (Support Vector Machines) vx l ngnngtnhin...
-
8/3/2019 634350765869323109SeminarTongQuanOCR
9/29
-
8/3/2019 634350765869323109SeminarTongQuanOCR
10/29
10
Giai on tin xl
Friday, March 04, 2011
Nh phn ha nh Lc nhiu Tm xng
Hiu chnh nghing
-
8/3/2019 634350765869323109SeminarTongQuanOCR
11/29
11
Giai on tch ch
Friday, March 04, 2011
Tch dng Tch t, k t
-
8/3/2019 634350765869323109SeminarTongQuanOCR
12/29
12Friday, March 04, 2011
CC PHNG PHP TRCH CHN C TRNG
Bin i ton cc v khai trin chui
c trng thng k
c trng hnh hc v hnh thi
-
8/3/2019 634350765869323109SeminarTongQuanOCR
13/29
-
8/3/2019 634350765869323109SeminarTongQuanOCR
14/29
14Friday, March 04, 2011
c trng thng k
Phn vng (Zone) Cc giao im v khong cch
-
8/3/2019 634350765869323109SeminarTongQuanOCR
15/29
15Friday, March 04, 2011
c trng thng k (tt)
Chu tuyn (Contour Profile) Projection histograms
-
8/3/2019 634350765869323109SeminarTongQuanOCR
16/29
16Friday, March 04, 2011
c trng thng k (tt)
c trng hng (Direction Features)
Cc k t c m t nhcc vectm cc phn tca n l cc gi trthng k vhng.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
17/29
17Friday, March 04, 2011
c trng hnh hc v hnh thi
Cc cu trc hnh thi: da trn cc cu trc nguyn thy(on thng, cung) to ra k t.
Cci lng hnh hc: cc k tc biu din bng oca cc i lng hnh hc nh t sgia chiu rng v chiu caoca hp cha k t, quan h khong cch gia hai im, so snh di gia hai nt, rng ca mt nt, khi lng ch hoa vchthng ca cc t, di t...
thv cy: u tin, cc t hoc cc k tc phn chiathnh mt tp cc i tng nguyn thy nh cc nt, cc imchc... Sau , cc thnh phn nguyn thy c s dng trongcc th lin quan.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
18/29
18
CC HNG TIP CN NHN DNG
Friday, March 04, 2011
i snh mu
Tip cn cu trc
Cu trc ngphp(Grammatical Methods)
Phng php th
(Graphical Methods)
Ch p dng tt i vi nhn
dng ch in, cn chvit tay tht ra km hiu qu.
S dng trong giai on hu
x l sa cc li m khinhn dng thc hin sai
Ch vit c m t bi cc th, mi th l skt hp ca
cc dng nguyn thu: onthng, cung
-
8/3/2019 634350765869323109SeminarTongQuanOCR
19/29
19
CC HNG TIP CN NHN DNG (tt)
Friday, March 04, 2011
Tip cn thng k da trn c s ba gi thuyt chnh:
1. Phn b ca tp c trng l phn b Gauss hoctrong trng hp xu nht l phn b u.
2. C cc s liu thng k y c th dng chomi lp.
3. Tp nh {I} c th trch chn mt tp c trng
{fi}F, i{1,...,n} m tp c trng ny i dincho mi lp mu ring bit.
k-NNk-NN BayesBayes
-
8/3/2019 634350765869323109SeminarTongQuanOCR
20/29
20
CC HNG TIP CN NHN DNG (tt)
Friday, March 04, 2011
Cc phng php hc my tin tin
M hnh Markov n
(HMM Hidden MarkovModel)
M hnh Markov n
(HMM Hidden MarkovModel)
Mng n ron
(NN - Neural Network)
Mng n ron
(NN - Neural Network)
My vect ta
(SVM - Support Vector Machines)
My vect ta
(SVM - Support Vector Machines)
-
8/3/2019 634350765869323109SeminarTongQuanOCR
21/29
21
CC HNG TIP CN NHN DNG (tt)
Friday, March 04, 2011
Kt hp cc chin lc nhn dng
Mi k thut phn lpu c nhng u im
v nhc im ring.
Kt hp vi nhau theo mtcch no nng cao
hiu qu nhn dng
Xy dng cc kin trckt hp phn lp
Kin trc tun tKin trc tun t
Kin trc song songKin trc song song
Kin trc lai ghpKin trc lai ghp
-
8/3/2019 634350765869323109SeminarTongQuanOCR
22/29
22
CC HNG TIP CN NHN DNG (tt)
Friday, March 04, 2011
Kin trc tun tKin trc tun t
Kin trc song songKin trc song song
Kin trc lai ghpKin trc lai ghp
Chuyn kt qu u ra ca mt my
phn lp thnh u vo ca myphn lp tip theo, cc chin lctiu biu: Boosting, thc nc
Kt ni kt qu ca cc myphn lp c lp ca nhiuchin lc khc nhau. Tiu biu
nht l chin lc b phiu vlut quyt nh Bayes
Lai ghp gia hai kin trc tun t
v song song.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
23/29
23
CC KIN THC CN THIT NGHIN CU
Friday, March 04, 2011
X l nh (Image Processing)
Hc my (Machine Learning)
Xc sut thng k v ton ng dng
Ngn nghc v ngn nghc tnh ton
(Linguistic and Computational Linguistic)
Mng n ron, HMMSVM
Boosting,..
Kernel method
Bayes
k-NN,..
n-Gram
http://www.kernel-machines.org/
-
8/3/2019 634350765869323109SeminarTongQuanOCR
24/29
24
CC B DLIU CHUN PHC V NGHIN CU
Friday, March 04, 2011
B dliu USPS (United States Postal Service)
B dliu MNIST (National Institute of Standardand Technology of the United States)
gm 7291 mu dng Train v 2007 mu khc test,mi mu l mt nh a cp xm kch thc 1616.
gm 60.000 mu dng Train v 10.000 mu khc test,mi mu l mt nh a cp xm kch thc 2828.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
25/29
25
MT S KT QU THC NGHIM TRN TP MNIST
Friday, March 04, 2011
CLASSIFIER ERROR (%) Reference
K-nearest-neighbors, L3 1.22 Kenneth Wilder, U. Chicago
K-NN, shape context matching 0.63 Belongie et al. IEEE PAMI 2002
K-NN with non-linear deformation
(P2DHMDM)0.52 Keysers et al. IEEE PAMI 2007
SVM deg 4 polynomial 1.1 LeCun et al. 1998
Reduced Set SVM deg 5 polynomial 1.0 LeCun et al. 1998
Virtual SVM deg-9 poly [distortions] 0.8 LeCun et al. 1998
Trainable feature extractor + SVMs 0.54 Lauer et al., Pattern Recognition 40-6, 2007
3-layer NN, 500+300 HU 1.53 Hinton, unpublished, 2005
2-layer NN, 800 HU, MSE 0.9 Simard et al., ICDAR 2003
2-layer NN, 800 HU, cross-entropy 0.7 Simard et al., ICDAR 2003
NN, 784-500-500-2000-30 + nearest neighbor,
RBM + NCA training1.0 Salakhutdinov and Hinton, AI-Stats 2007
http://yann.lecun.com/exdb/mnist/
-
8/3/2019 634350765869323109SeminarTongQuanOCR
26/29
26Friday, March 04, 2011
KT LUN
Nhn dng ch in c gii quyt gn nhtrn vn Nhn dng chvit tay (online/Offline) vn l bi ton m
Trong nc: cc gii php nhn dng chvit tay tingVit vn ang c quan tm, nghin cu.
Xu hng s dng cc kin trc lai ghp gia cc
phng php nhn dng, Boosting tng tc cngnh chnh xc nhn dng.
M hnh ngn ng thng k N-Gram trong giai on hu
x l cng l ch rt ng quan tm.
-
8/3/2019 634350765869323109SeminarTongQuanOCR
27/29
27Friday, March 04, 2011
KT LUN (tt)
Pht trin cc ng dng nhn dng trn cc Form chvit tay
-
8/3/2019 634350765869323109SeminarTongQuanOCR
28/29
Friday, March 04, 2011 28
-
8/3/2019 634350765869323109SeminarTongQuanOCR
29/29
Friday, March 04, 2011 29
Cm nquv ch lng nghe!