programming for linguistics: evaluating parser output
Post on 26-Dec-2014
431 Views
Preview:
DESCRIPTION
TRANSCRIPT
Programming for Linguistics December 16, 2013
Evaluating parser output
Outline of topics
1. Preprocessing (10 slides)2. Processing (6 slides)3. Evaluation (1 slide)4. Results (1 slide)5. Improvements (1 slide)
Programming for Linguistics
1 Preprocessing
Penn Treebank trees as NLTK Tree objects
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
1 Preprocessing
Programming for Linguistics
2 Processing
D. Klein and C. Manning 2002. A generative constituent-context model for improved grammar induction. In Proceedings of the ACL.
Programming for Linguistics
2 Processing
Does this work for original trees and processed trees?
Programming for Linguistics
2 Processing
Spans represented as label-span tuples:(‘label’, start, end)
Programming for Linguistics
2 Processing
Programming for Linguistics
2 Processing
Programming for Linguistics
2 Processing
Programming for Linguistics
2 Processing
Now what can we do with this information?
Programming for Linguistics
4 Evaluation
Programming for Linguistics
4 Evaluation
Programming for Linguistics
5 Results
Evaluation after removing all complex labels and empty categories.GL = gold labelsAL = auto labelsM = matches
Programming for Linguistics
5 Improvements
● Cross brackets● Remove only labels/categories● Consider context● Constituent types● Depends on task…
Thanks!
top related