lenguage testing (ii bimestre abril agosto 2011)
DESCRIPTION
Universidad Técnica Particular de Loja Ciclo Académico Abril Agosto 2011 Carrera: Inglés Docente: Mgs. Orlando Lizaldes E. Ciclo: Sexto Bimestre: SegundoTRANSCRIPT
![Page 1: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/1.jpg)
1
LANGUAGE TESTING INGLÉS
Second Bimester
Language School
Teacher: Orlando V. Lizaldes E.
April – August 2011
![Page 2: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/2.jpg)
Second Bimester• 5 Validity• 6 Measurement• 7 The social character of language tests• 8 New directions –and dilemmas?
2http://www.google.com/imgres
![Page 3: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/3.jpg)
Testing is a matter of using data to establish evidence of learning.
![Page 4: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/4.jpg)
What makes a good tests good?
Its qualities: Reliable, Valid, Practical.
There’s no such a thing as a “good test”
4
![Page 5: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/5.jpg)
Validity
Validity Reliability (standardized tests) Inference Judgment Test Validation
5
![Page 6: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/6.jpg)
Testing the test
Validity
Reliability
6
![Page 7: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/7.jpg)
Key questions in assessment
Validity: does this test measure what is supposed to measure?
Reliability: does this test or instrument consistently measure what is supposed to measure?
![Page 8: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/8.jpg)
The harder of the two concepts is…
Reliability doesn’t really apply to classroom teachers or
classroom based test very often.
![Page 9: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/9.jpg)
Reliability. Conceptual understanding
•May refer to a complete test or to individual items on the test.• It has to deal with consistency of measurement… means the same test to the same group of students.•It is not really a reliability application in classroom-based teaching. We really don’t have time to give the same tests over and over to the same person to see if this test is reliable or not. High stakes test (YES)
![Page 10: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/10.jpg)
VALID TEST
Remember:
T: V = R
T: R ≠ V
10
![Page 11: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/11.jpg)
11
EXAMPLE:
![Page 12: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/12.jpg)
Validity: the degree to which the test actually measures what it is intended to measure.
VALIDITY
content
construct
face
Consequential
![Page 13: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/13.jpg)
If no validation
There is potential for unfairness and injustice
The potential is in proportion to what is at stake.
The validation procedure guarantees the FACE VALIDITY of the test.
![Page 14: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/14.jpg)
MEASUREMENT
What is measurement?Is the estimation of physical quantity such as distance, energy, temperature, time. Measurements find the ratio of some physical quantity to a standard quantity of the same type, thus a measurement of length is the ratio of a physical length to some standard length, such as a standard meter.
![Page 15: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/15.jpg)
MEASUREMENT Assessment usually involves allocating
a score, an attractively simple number. A rose is a rose is a rose “Gertrude Stein
(Sacred Emily) A score is not a score is not a score
because different raters give the same and different scores.
Measurement = dauntingly technical field = means, percentiles, standard deviations and statistics.
![Page 16: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/16.jpg)
Measurement always involves some error, and so in science measurements are accompanied by error bounds.
![Page 17: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/17.jpg)
QUANTIFICATION
MATH – PROCEDURES
The assigning of numbers and scores
For various kinds of mathematical and
statistical patterning within the matrix in order to investigate
the extent to which necessary properties are present in
the assessment.
![Page 18: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/18.jpg)
Investigating the properties of individual test items
Investigating rater characteristics is important to guaranteeing the meaningfulness and fairness of assessment performance. (ITEM ANALYSIS).
Item analysis is a normal part of test development
PILOT OPERATIONAL
![Page 19: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/19.jpg)
Correlation coefficient r
It expresses the extent to which one score set is knowable from another, and uses a scale from 0 to 1.
Reliability coefficient Inter-rater reliability
19
![Page 20: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/20.jpg)
Norm-referenced and Criterion-referenced MEASUREMENTS
Norm-referenced Measurements (N-R-M) adopts a framework of comparison between individuals for understanding the significance of any single score.
In Criterion-referenced Measurements (C-R-M) individual performances are evaluated against a verbal description of a satisfactory performance at a given level.
![Page 21: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/21.jpg)
Criterion-referenced
They are not always easily defined in a yes/no judgment.
![Page 22: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/22.jpg)
Norm-referenced
Scores may not be consistent across instrumentswww.utpl.edu.ec
![Page 23: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/23.jpg)
Bell curve of a normal distribution
http://www.google.com/imgres?imgurl=http://classes
![Page 24: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/24.jpg)
CENTRAL TENDENCY
The Central Tendency of a distribution is an estimate of the “center” of a distribution of values.
http://www.google.com/images?imgurlstr=http://centraltendency
![Page 25: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/25.jpg)
CENTRAL TENDENCY There are three major types of
estimates of Central Tendency:
- Mean - Median - Mode
![Page 26: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/26.jpg)
CENTRAL TENDENCY
The Mean or average is probably the most commonly used method of describing central tendency.
![Page 27: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/27.jpg)
CENTRAL TENDENCY
The Mean To compute the mean, add up all the values and divide by the number of values.
![Page 28: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/28.jpg)
CENTRAL TENDENCY The Mean
For example: 20, 20, 20, 18, 17, 14, 14=
135 The sum of these 8 values is
135/8=
16.87
![Page 29: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/29.jpg)
CENTRAL TENDENCY The Median Is the score found at the exact middle of the set of values. One way to compute the median is to list all scores in numerical order, and then locate the score in the center of the sample.
![Page 30: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/30.jpg)
The Median EXAMPLES:
15, 15, 15, 15, 15, 17, 18, 20
There are 8 scores and score # 4 and # 5 represent the halfway point. Since both these scores are 15, the median is 15.
Example: find the Median of {12, 3 and 5} Put them in order: 3, 5, 12 The middle number is 5, so the median is 5.
![Page 31: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/31.jpg)
CENTRAL TENDENCY If the two middle scores have different
values, you would have to interpolate to determine the median.
There are now fourteen numbers and so we don't have just one middle number, we have a pair of middle numbers:
3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 40, 56 In this example the middle numbers are 21 and
23. To find the value half-way between them, add them
together and divide by 2: 21 + 23 = 44
44 ÷ 2 = 22 And, so, the Median in this example is 22.
![Page 32: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/32.jpg)
The social character of language tests
Educational assessment has traditionally drawn its concepts and procedures from the field of Psychology.
When tests reform are introduced within the educational system, they are likely to figure prominently in the press and become matters of public concern.
![Page 33: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/33.jpg)
Conventional proficiency tests have been used for purposes of exclusion.
Industrialized countries have developed more flexible policies for the recognition and certification of specific work-related skills (competencies)
International Ss need to meet a standard on a language test for academic purposes.
![Page 34: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/34.jpg)
Computers and Language Testing
The proponents of computer based testing can
point to a number of advantages. First, scoring
of fixed response items can be done
automatically, and the candidate can be given
a score immediately. Second, the computer
can deliver tests that are tailored to the
particular abilities of the candidate.
![Page 35: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/35.jpg)
It seems inefficient for all candidates to take all the questions on a test; clearly some are so easy for some candidates that they provide little information on their abilities; others are too hard to be of use. It makes sense to use the very limited time available for testing to focus on those items that are just within, and just beyond a candidate’s threshold of ability.
![Page 36: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/36.jpg)
The use of computer for delivery of test materials raises questions of validity. For example, different levels of familiarity with computers will affect people’s performance with them, and interaction with the computer may be stressful experience for some students or candidates. (McNamara ( 2000, 79-81)
![Page 37: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/37.jpg)
New directions
Computer based tests (CBT) Do raters react differently to
printed versus handwritten texts? Semi-direct test of speaking.
(cheaper to administer – raises questions of validity since there’s no COMMUNICATION at all.)
37
![Page 38: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/38.jpg)
Summing - up
Language testing remains a complex and perplexing activity.
Language testing is an uncertain and approximate business at the best times, even if to the outsider this may be camouflaged by its impressive, even daunting, technical trappings (McNamara, Language Testing, 86).
38
![Page 39: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/39.jpg)
Consulted Bibliography McNamara, T.(2000). Language Testing. Oxford
University Press. London Heaton J. B.(1998). Classroom Testing. Keys to
Language Teaching. Longman. New York (USA) Richards, J.C. (2005). Communicative Language
Teaching , Cambridge Univ. Press Brown, H. D. (2004). Language Assessment. Principles
and classroom practices. Longman, United States IBT Tests (2004). MacGraw Hills. Freeman D., Richards J.C. (2001). Teacher Learning in
Language Teaching. Pearson. USA O’Malley, J. M., Valdez Pierce, L. (1996). Authentic
assessment for English language learners. Practical approaches for teachers. Longman. USA 39
![Page 40: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/40.jpg)
THANK YOU
40
![Page 41: LENGUAGE TESTING (II Bimestre Abril Agosto 2011)](https://reader035.vdocuments.site/reader035/viewer/2022062319/55493493b4c905144d8b45ce/html5/thumbnails/41.jpg)
41