model selection for adaptive testing · data set paper test of mathematical knowledge (domain of...

22
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Model Selection For Adaptive Testing Martin Plajner, Jirka Vomlel The Institute of Information Theory and Automation Czech Academy of Sciences 16.7.2015 Bayesian Applications Workshop UAI Amsterdam Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 1 / 18

Upload: others

Post on 21-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Model Selection For Adaptive Testing

Martin Plajner, Jirka Vomlel

The Institute of Information Theory and AutomationCzech Academy of Sciences

16.7.2015Bayesian Applications Workshop

UAIAmsterdam

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 1 / 18

Page 2: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Adaptive testing

Selecting subset of questionsShorter test versionsIndividual sets of questions

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 2 / 18

Page 3: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Data set

Paper test of mathematical knowledge (domain of functions)

4 grammar schools

281 test subjects (students)

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 3 / 18

Page 4: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Test description

29 questions (mathematical problems) - 53 sub questions

Example question

Decide which of the following functions

f (x) = x2 − 2x − 8

g(x) = −x2 + 2x + 8

is decreasing in the interval (−∞,−1].

Rated correct(0) / incorrect(1)or graded 0 - 4

Total maximum of 120 points

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 4 / 18

Page 5: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Test results, correlations

Table: Average test scores of the four grammar schools.

GS1 GS2 GS3 GS4

42.76 46.68 46.35 43.65 44.53

Table: Correlation of the grades and the test total score.

Mathematics Physics Chemistry

-0.60 -0.42 -0.41

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 5 / 18

Page 6: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Bayesian network models

Each model consist of

A set of n variables {S1, . . . , Sn} - skills.

S will denote the multi variable (S1, . . . , Sn) taking statess = (s1, . . . sn).

A set of m variables {X1, . . . ,Xm} - questions (mathematicalproblems)

X will denote the multi variable (X1, . . . ,Xm) taking statesx = (x1, . . . , xm).

A set of arcs between variables

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 6 / 18

Page 7: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Models’ options

Options summary

Question grading scales Boolean numericNumber of skill nodes 1 7Skill nodes states 2 3Additional information NO YES

Bayesian network models7 models for Boolean scale7 models for numeric scale

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 7 / 18

Page 8: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Network model with one skill node and additionalinformation

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 8 / 18

Page 9: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Expert network model

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 9 / 18

Page 10: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Adaptive test

Select a next question

Ask the question

Update the network

Next question selection

Selection based on information gaini.e. reduction of expected entropy

Cumulative Shannon entropy

H(e) =n∑

i=1

∑si

−P(Si = si |e) · logP(Si = si |e)

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 10 / 18

Page 11: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Next question selection

Expected entropy for the selection of X ′

EH(X ′, e) =

p∑j=1

P(X ′ = x ′j |e) · H(e ∪ {X ′ = x ′j})

Question X ∗ minimizing the expected entropy

X ∗ = argminX ′∈Xs

EH(X ′, e)

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 11 / 18

Page 12: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Model evaluation

10 fold cross-validation

For every (281) test repeat:

Select a questionAsk the selected questionInserted answer as evidence into the BNDistribute and collect evidenceEstimate subsequent answers

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 12 / 18

Page 13: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Model evaluation

We measure the success rate of a test

⇒ By comparing the estimated value to the observed value

Success rate

SRts =

∑Xi∈Xs+1

I (x∗i = x ′i )

|Xs+1|, where

I (expr) =

{1 if expr is true0 otherwise

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 13 / 18

Page 14: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Boolean graded question answers

0 10 20 30 40

0.65

0.75

0.85

0.95

step

succ

ess

rate

b2+b3b2e

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 14 / 18

Page 15: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Numeric graded question answers

0 10 20 30 40

0.65

0.70

0.75

0.80

0.85

0.90

0.95

step

succ

ess

rate

n2+n3n2e

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 15 / 18

Page 16: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Question selection order

0 5 10 15 20 25 30

010

2030

40

steps

vars

Step

Que

stio

n ID

0 5 10 15 20 25 30

steps

vars

Step

Que

stio

n ID

0 5 10 15 20 25 30

stepsva

rs

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 16 / 18

Page 17: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Conclusions

Proposed models provide reasonable results

3-stated skill variables provide better results

Expert model does not score as good as expected

Additional information is not as important

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 17 / 18

Page 18: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Future research

Explore more options of parameters selection

Comparison with other model types

standard Rash, IRT, other methods (neural network)

Practical issues (fairness, constraints,. . . )

−4 −2 0 2 4

0.2

0.6

1.0

x

f

−4 −2 0 2 40e+

003e

−10

x

f

−4 −2 0 2 40e+

004e

−21

x

f

−4 −2 0 2 40e+

006e

−26

x

f

−4 −2 0 2 40e+

004e

−30

8e−

30

x

f

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 18 / 18

Page 19: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Future research

Explore more options of parameters selection

Comparison with other model types

standard Rash, IRT, other methods (neural network)

Practical issues (fairness, constraints,. . . )

−4 −2 0 2 4

0.2

0.6

1.0

x

f

−4 −2 0 2 40e+

003e

−10

x

f

−4 −2 0 2 40e+

004e

−21

x

f

−4 −2 0 2 40e+

006e

−26

x

f

−4 −2 0 2 40e+

004e

−30

8e−

30

x

f

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 18 / 18

Page 20: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Future research

Explore more options of parameters selection

Comparison with other model types

standard Rash, IRT, other methods (neural network)

Practical issues (fairness, constraints,. . . )

−4 −2 0 2 4

0.2

0.6

1.0

x

f

−4 −2 0 2 40e+

003e

−10

x

f

−4 −2 0 2 40e+

004e

−21

x

f

−4 −2 0 2 40e+

006e

−26

x

f

−4 −2 0 2 40e+

004e

−30

8e−

30

x

f

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 18 / 18

Page 21: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Future research

Explore more options of parameters selection

Comparison with other model types

standard Rash, IRT, other methods (neural network)

Practical issues (fairness, constraints,. . . )

−4 −2 0 2 4

0.2

0.6

1.0

x

f

−4 −2 0 2 40e+

003e

−10

x

f

−4 −2 0 2 40e+

004e

−21

x

f

−4 −2 0 2 40e+

006e

−26

x

f

−4 −2 0 2 40e+

004e

−30

8e−

30

x

f

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 18 / 18

Page 22: Model Selection For Adaptive Testing · Data set Paper test of mathematical knowledge (domain of functions) 4 grammar schools 281 test subjects (students) Martin Plajner, Jirka Vomlel

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

Future research

Explore more options of parameters selection

Comparison with other model types

standard Rash, IRT, other methods (neural network)

Practical issues (fairness, constraints,. . . )

−4 −2 0 2 4

0.2

0.6

1.0

x

f

−4 −2 0 2 40e+

003e

−10

x

f

−4 −2 0 2 40e+

004e

−21

x

f

−4 −2 0 2 40e+

006e

−26

x

f

−4 −2 0 2 40e+

004e

−30

8e−

30

x

f

Martin Plajner, Jirka Vomlel Model Selection For Adaptive Testing 16.7.2015 18 / 18