patient reported outcome measures (proms) for …pediatric diabetes 11, 364–375cooper h., cooper...
TRANSCRIPT
-
Patient Reported Outcome
Measures (PROMs) for use
with children and adolescents:
a view across disciplines
Dr Gillian Lancaster
Postgraduate Statistics Centre
Lancaster University, UK [email protected]
-
Funded by Diabetes UK
Supported by NIHR Research Network: Children
-
Acknowledgments
Principal Investigator: Prof Helen Cooper
Research Associates: Joy Spencer, Sara Wheeler Department of Community Health and Wellbeing, University of Chester
Statisticians: Gill Lancaster, Andrew Titman Department of Mathematics and Statistics, Lancaster University
IT Specialist: Mark Johnson Dept of Games Computing/Creative Technologies, University of Bolton
Clinical Psychologist: Rebekah Lwin Alder Hey Children’s NHS Trust and University of Chester
-
DIFFERENT by Clere Parsons (1908-31)
“Not to say what everyone else was saying
not to believe what everyone else believed
not to do what everybody did,
then to refute what everyone else was saying
then to disprove what everyone else believed
then to deprecate what everybody did,
was his way to come by understanding
how everyone else was saying the same as he was saying
believing what he believed
and did what doing”.
-
-ICT improved knowledge, psychosocial well-being and self-care skills, with trends toward improving glycaemic control.
4
3
1Cooper et al (2009) Pediatric Diabetes; 2Spencer, Cooper & Milton (2009) Pediatric Diabetes; 3Spencer, Cooper & Milton (2013). Diabetic Medicine; 4Spencer & Cooper (2011) Practical Diabetes.
-
Adolescent Diabetes Management: Complex Adaptive System (CAS)
Blood Glucose Friends
Puberty
Exercise
Family
Alcohol
School
Emotions
Body image
Blood glucose tests
Lifestyle
Health &
illness
Independence &
autonomy
Insulin Diet
-
Friends Family
Alcohol Emotions
Blood glucose tests
Insulin Diet
Successive UK National Paediatric DiabeBlood Gtes lucoseAudits show (here for 2013-14):
Only Independence 15.8% achieved recomme& nded glycaemic targets, with 25.9% at high risk of future complicaautonomytions Puberty
90% of glycaemic variability is due to service related factors, inc. standards Body imageand delivery of diabetes self-care Lifestyleeducation, only 45.2% receiving structured education
27.5% Exerciseof those aged 12 or over had high blood Schoolpressure, > 7% had excess protein in their urine, >14% had early signs of eye disease, and nearly 1 in 4 was obese Health &
Just 16% received all 7 of the recommended health care processes illness
Adolescent Diabetes Management: Complex Adaptive System (CAS)
-
5
COMPLEXITY SCIENCE/EXPERIENTIAL LEARNING/CHANGE CYCLE
Cooper & Geyer. (2007) Riding the Diabetes Roller Coaster: Oxford: Radcliffe Publishing.
Cooper & Geyer (2008) Social Science and Medicine.
-
Fitness Landscape
Troughs Poor fitness
(poor control)
Peaks
High fitness
(good control)
In-between Neutral fitness (mod. Control)
Designed to model
chemical & species
interactions as
evolutionary processes
Basic rules for survival
on a fitness landscape:
– Adaptability
– Flexibility
– Learning
– Balance
Gill T.G. (2008) Reflections on Researching the Rugged Fitness Landscape, Int. Jl. Emerging Transdiscipline, vol.11.
-
Study Aims • To create a Diabetes Fitness Landscape
• Primary goal = adaptation + balance
• To develop capability to manage condition
where
• Main participants are the patients/families
• Experts are helpers on their journeys
• No endpoint to learning
• Mistakes are a learning tool i.e. learning from
experience is key
-
Paediatric Diabetes Education*
Age and maturity appropriate programmes
Individual Needs Assessment → Tailored Diabetes Self Management Education
Onset of DM
Paediatric long term care
< 5 years, primary, adolescents
Adult
long term care
*Adapted from a slide by Sheridan Waldron
-
SE
D
EV
EL
OP
ME
NT
PH
A
TESTING PHASE
RESPONDANT VALIDATION PHASE
-
SE
D
EV
EL
OP
ME
NT
PH
A
70 items
7 domains
109 items
8 domains
112 items
7 domains
194 items
6 domains
129 items
6 domains
TESTING PHASE
117 items
6 domains
RESPONDANT VALIDATION PHASE
-
Study participants
Sample: • Mersey Region Children's Diabetes Network - 4 paediatric diabetes
centres: Alder Hey, Chester, Leighton and Whiston
Inclusion criteria: • Age 12-18 years
• Able to read and write
• Minimum diabetes duration of 1 year
• Ability to complete a questionnaire unaided
• Computer literate
Exclusion criteria: • Significant medical disease in addition to diabetes
• Diagnosed psychiatric disorder or neuro-cognitive disorder
• Drug and/or alcohol dependency
-
Data Analysis
Pilot study (n=52) graphs to assess item performance, HbA1c
Main study (n=155) • Item performance eg. graphs, levels of missing data, skewness of
responses (low and high response rates)
• Devised 3-point Likert scale to give traffic-light feedback (red, amber, green), simple signposting
Reliability • Internal consistency – item total correlations
• Test-retest reliability in subgroup of young people (kappa statistics)
Validity • Concurrent - correlation with SMOD-A tool domains
• Convergent – correlation with HbA1c
• Item response analysis to validate additive score
• Respondent – focus groups at end of study
-
ADNAT: 117 questions divided between 6 domains
No. of quests. answered depends on insulin regime and lifestyle choices
Questions prompt self-awareness: education focus
Assessment of core needs: 20 questions = overall self-care score & 16 self-
perception questions = self-evaluation of psycho-social health
Positively evaluated for appropriateness & readability.
Tests for validity: SMOD-A* (r = -0.41, p < 0.001); weak correlation with
HbA1c (r = -0.16, p=0.056) but this compared favourably with SMOD-A (r= -
0.06, p=0.52).
Item Response Analysis validated use of a simple additive score
ADNAT DOMAINS 1. ALL ABOUT ME 2. PHYSICAL ACTIVITY 3. EATING 4. MEDICATION
5. BLOOD GLUCOSE MONITORING 6. LIVING WITH DIABETES.
*Self-management of Diabetes in Adolescence/USA
-
BG testing,
managing BG,
hypo/hyperglycaemia
Frequency,
responsibilities,
forgetfulness, motivators
& barriers to injecting,
use of set programs
(pump)
Illness, going out for the day,
sleep over, smoking, alcohol,
going to parties, eye & foot
care, personal support &
understanding, self-appraisal
of life quality
Age, gender,
diagnosis, home
situation, height,
weight, insulin
regime, feelings
Types, intensity,
duration, setting,
barriers, reasons,
managing blood
glucose levels
Meal & food
choices,
responsibilities,
management of
diabetes, weight
Consensus Meeting
117 Items:
81 Educational
20 self-care score
16 Psychosocial
(self-perception)
-
20 Self-care score items
-
20 Self-care score items
Previously asked:
How many portions of fruit and vegetables do you normally eat in a day?
-
ADNAT SCORING ALGORITHM FOR 20 GENERIC QUESTIONS
Low score = low educational need, high score = high educational need Scoring range: 0 – 40
Do
ma
in Item
No.
Question Red (score = 2)
>25
Overall = 26-40
Amber (score = 1)
>12
Overall = 13-25
Green (score = 0)
Overall = 0-12
2
16 How many hours of pulse-raising exercise
or physical activity did you do last week?
Less than 1 hour
1-2 hours
3-4 hours 5-6 hours
7-8 hours More than 8 hours
18 What stops or prevents you from starting
to do exercise or physical activity?
I am not very good at it
I don’t like/enjoy
exercise or sport
Would rather watch TV
or play computer
games
My weight
I don’t have time
If no red options ticked + at
least one of:
It is difficult to manage my diabetes when I do
exercise
I don’t have the opportunity
It is difficult to do exercise with an insulin
pump
If no red or amber options
ticked + at least one of:
Nothing Being Ill eg. infection If my blood glucose is
low
21 What makes it difficult to manage your
blood glucose levels when exercising or
doing physical activity?
Exercise makes my
diabetes control worse
I am frightened of
getting a hypo
I’d need to take more
insulin
It is difficult with my
pump
If no red options ticked + at
least 1 of:
I’d need to eat more If my blood sugar is low
Only ticked:
I don’t find it difficult
22 What usually happens to your blood
glucose levels when you do exercise or
physical activity?
They usually go higher
I don’t know
I end up having a hypo
They vary They stay the same
They usually go lower but not so I am hypo
3
34 Do you eat fruit and/or vegetables? No or rarely Sometimes but not every day
Yes, usually every day
35 Do you eat treats such as sweets,
chocolate, fast food, takeaways?
Yes usually every day Sometimes but not every day
No or rarely
-
Proposed Path of learning
-
ADNAT combines reflective awareness raising with needs assessment to
facilitate patient-centred clinical consultations.
“I think it raised awareness cause obviously I know where I’m going wrong but I think it raised further awareness/you need to properly register and accept you need to change something about your diet or your lifestyle”
“a private space to address my illness”.
“..when I’m on the computer I feel as though it’s just me on that computer, not everyone’s got access to it....so I feel as though I’m safe”
“We had an actual area to talk on instead of going in, just having a chit chat about general health”
-
6. CREATING AN APP
http://blog.williamferriter.com/2013/07/11/technology-is-a-tool-not-a-learning-outcome/
http://blog.williamferriter.com/2013/07/11/technology-is-a-tool-not-a-learning-outcome/
-
6. CREATING AN APP
http://blog.williamferriter.com/2013/07/11/technology-is-a-tool-not-a-learning-outcome/
Assess Needs
(ADNAT)
Carers/Choice/
Talking Diabetes
etc.
Young person
led
http://blog.williamferriter.com/2013/07/11/technology-is-a-tool-not-a-learning-outcome/
-
Version 1
Website: www.adnat.co.uk
http://www.myadnat.co.uk/
-
Version 2
ADNAT App Website: www.myadnat.co.uk
http://www.myadnat.co.uk/
-
Version 3
Future development
Red Ninja Studios - Design and Technology
We
design
the
future
http://www.redninja.co.uk/
-
7. FUTURE PLANS
DEVELOPMENT PHASE √
TESTING PHASE √
RESPONDANT VALIDATION PHASE √
ADNAT √
Website √ www.myadnat.co.uk
APP for mobile devices √
EVALUATION PHASE √
Longitudinal cohort study of ADNAT
-
References • Cooper H, Spencer J, Lancaster G, Titman A, Johnson M, Wheeler S, Lwin R. (2014)
Development and psychometric testing of the on-line ‘Adolescent Diabetes Needs
Assessment Tool’ instrument. Journal of Advanced Nursing 70(2), 454–468.
• Cooper H., Spencer J., Lancaster J., Johnson M. & Lwin R. (2012) Perceptions of the clinical usefulness of the Adolescent Diabetes Needs Assessment Tool (ADNAT).
Diabetes Care for Children and Young People 1(2), 55–61.
• Spencer J., Cooper H. & Milton B. (2010) Qualitative studies of Type 1 diabetes in adolescence: a systematic literature review. Pediatric Diabetes 11, 364–375Cooper H.,
Cooper J. & Milton B. (2009) Technology-based approaches to patient education for
young people living with diabetes: a systematic literature review. Pediatric Diabetes 10,
474–483.
• Spencer J. & Cooper H. (2011) A multidisciplinary paediatric diabetes health care team: perspectives on adolescent care. Practical Diabetes 28, 210–2.
• Spencer J., Cooper H. & Milton B. (2013) A qualitative phenomenological study to explore the lived experiences of young people (13-16 years) with type 1 diabetes and
their parents. Diabetic Medicine 30, e17–e24.
• Cooper H. & Geyer R. (2007) Riding the Diabetes Roller Coaster: A New Perspective for Patients, Carers and Health Professionals. Radcliffe Publishing, Oxford.
• Cooper H. & Geyer R. (2008) Using ‘Complexity’ for improving educational research in health care. Social Science and Medicine 67, 177–182.
-
Malawi Developmental
Assessment Tool (MDAT)
-
Acknowledgments Principal Investigator: Melissa Gladstone
MD supervisors: Gill Lancaster, Prof Ros Smyth Institute of Child Health, University of Liverpool
Ashley Jones Centre for Medical Statistics and Health Evaluation, University of Liverpool
Ed Umar, Maggie Nyirenda, Edith Kayira, Ken Maleta Depts of Comm Health/Paediatrics/Wellcome Trust lab, Blantyre College of Medicine
APPLe trial: Prof Nynke van den Broek Liverpool School of Tropical Medicine
Lungwena Cohort study: Prof Per Ashorn, Ed Mtitimila Dept International Health, Tampere University Medical School, Finland
-
1. Background
•80% of the worlds disabled population (600 million) live in low income countries, many in Africa
•Early identification of children with disabilities can reduce the impact of impairment later in life –World Health Organisation - priority area
•Developmental delays in infants are often impossible to detect through routine physical examination –may only come to light once children start school
• Developmental assessment tools are therefore vital for identifying these children as early as possible
-
Developmental assessment
•Developmental tools in current use are: –Denver Developmental Screening Test, Denver II –Bayley’s Scales of Infant Development (BSID) –Griffith’s Developmental Scales –Kaufman Assessment Battery for Children (K-ABC)
•But they are designed and mainly validated in Western countries – may include tasks and materials alien to African
children
•Tools may fail to identify and assess children appropriately in an African setting
-
Example: Denver II tool American tool assessing 4 main sections:
a) Gross Motor skills eg. rolls over (3 months), walks
well (12 months), hops (3-4 years)
b) Fine Motor skills eg. grasps rattle (3 months),
scribbles (12 months), copies circle (3-4 years)
c) Social skills eg. smiles responsively (1-2 months),
drinks from cup (12 months), plays board/card
games (3-4 years)
d) Language eg. laughs (2 months), speaks 3 words
(12 months), names 4 colours (3-4 years)
-
Setting milestones in rural Africa
Culturally important milestones may differ in Africa from the West
For example:
In Malawi: tells stories to friends and parents, knows songs and dances, carries water on head (Kambalametore et al., 2000)
In Kenya: works in fields, integrates well into community (Levine et al., 1994)
More emphasis on social intelligence (eg. social duties and roles, obedience and good behaviour)
-
Aims Aim: To develop a more culturally acceptable child developmental assessment
tool for use in rural Africa
Preliminary Study: Lungwena in 2000/1
(n=1130 children, aged 0-6 years)
Main Study: Blantyre area in 2006/7
(n=1446 children)
-
Research Teams
Main Study: Blantyre area (as part of Azithromycin Presumptive
Preterm Labour Trial (APPLe trial))
Preliminary Study: Lungwena (as part of Lungwena Child Survival
Cohort Study)
-
2. Methodology Item selection
Examine existing evidence & tools in literature;
focus groups to develop themes; translation of
items; build on work done in preliminary study.
Item review and reduction Group items into hypothesised domains; PPI
involvement – research midwives and villagers; review
items by language expert and research team.
Pre-testing (n=6-19) Face and content validity and
comprehensiveness of items; Item
modification/elimination.
Piloting (n=80) Testing procedures with 6 research midwives;
review meeting every 2 weeks for 6 weeks;
wording, ceiling effects, item gaps, use timers.
Quantitative Item Reduction and Validation (n=1446) Assess ch ildren; Item by ite m plots, item reduction, item invariance;
reliability; consensus meeting; produce ‘norms’; scoring and validity
testing.
-
Main stages of development
-
Preliminary study • Western tools (Denver, Griffiths) assessed by MG, Malawian
doctors and language expert from University of Malawi
– culturally appropriate questions translated into Yao; some items modified
• Focus groups (8 female research workers) - new items added
• Piloting - items removed, modified, re-translated
Items excluded: ‘prepares cereal’, ‘uses a spoon and fork’
Items modified: ‘uses stick to draw on ground’ or ‘piece of chalk on
concrete floor’ instead of ‘uses a pencil and paper’; used objects such
as plastic cup, spoon, plate for recognition rather than pictures of them
• 1130 children 0-6 yrs were assessed; methodology developed
• Validity (face and content, respondent) was evaluated
• 97% GM, 79% FM, 91% Language, 49% Social items reliable
-
Qualitative study - focus groups • Malawian Professionals in English (physiotherapy, occupational health, nursing, special needs, paediatrics)
• Villagers in Yao or Chichewa (men, women, mothers, fathers, grandparents)
Mothers focus group in Namitambo Grandmothers focus group
-
Area of
development
discussed:
Quotes relating to social development from village
focus groups:
Area of
village:
Social duties and
chores:
“If it is a girl, can go and fetch water. If it is a boy, can go to
buy salt from the grocery” (Grandparents)
“When it’s a male child and you have taken him to the
garden, he will cry for the hoe and you know that he is
really a man. If you have not taken 2 hoes, you can not
work”. (Grandparents)
1
1
Community
roles:
“my child is 2 years 8 months....everything that has
happened at the house he tells the relatives when they are
back from school... showing that he is growing well”
(women).
2
Good behaviour
and obedience:
“When a child of this age is sent to draw drinking water in
the house and when giving it, the child kneels down, you
know that this child is intelligent (women)”
“When you call the child to come, she kneels down and
says what are you calling me for?” (women)
2
3
Feeding and
toileting:
“At 1 year 2 months, any mother knows that her child is
giving signs of visiting the toilet, so you undress the child
and tell them to pass stool” (mothers)
“you go with the child to the toilet and hold the child to
avoid him falling inside” (grandparents).
3
4
Intelligence and
cognition:
“If it is a female child, when sees a mother putting a pot on
the fire, she takes a tin and places it somewhere and
pretends cooking nsima and we also see that she is
intelligent”(mothers)
2
1 Bangwe (semi–urban)
2 Nguludi (rural)
3 Namitambo (rural)
4 Mikolongwe (rural)
-
Main study participants • Quota sample – 1446 children aged 0-6 years from Blantyre
in Southern Malawi (preparation on pilot sample of n=80)
• Recruited from Azithromycin Presumptive Preterm Labour Trial (APPLe) trial
– Pregnant women presenting to local antenatal clinics between June 2006 and July 2007 with normal healthy child aged 0-6 years; asked to bring child to next clinic
–3 rural areas, 1 semi-urban area
• Children assessed at clinic by 6 local research assistants
• Exclusion criteria: children born prematurely (
-
Data Analysis
Pilot study (n=80) over 6 weeks, discussions with research midwives
Main study (n=1446) • Item by item performance - logistic regression graphs, covariate gender
• Pass/fail scoring for each item and domain
Reliability
• Intra-observer reliability, same child, same observer, 2 weeks apart (n=124)
• Inter-observer reliability (immediate) same child, same day (n=56)
• Inter-observer reliability (delayed) same child, later same day (n=52)
Validity
• Face validity - Do the items look acceptable to untrained judges?
• Content validity - Does the tool examine all the domains it is meant to measure?
• Construct validity – extreme groups (disabled n=80, malnourished n=120, compared to age matched normal children)
-
3. Outcomes and results • 1446 of 1657 children approached, were eligible & assessed
• Focus groups (professionals and village) useful for
developing the gross motor, language and social domains
• Piloting - questions removed, modified, re-translate
• Excellent reliability (inter-observer immediate kappa>0.75
for 99% items, inter-observer delayed for 89% items, intra-
observer 2-wk for 71% items; remaining items k>0.4, 2 poor)
• Consensus meeting - final set of items agreed (34 items in
each of the 4 domains)
• Validity - high sensitivity/specificity across domains; disabled
vs normals 63.9 points lower, malnourished 14.9 points lower
http:kappa>0.75
-
Gross motor skills (draft MDAT III)
-
Assessment Basket of props
-
Item by Item Analysis • Logistic regression was carried out for each item with decimal age as a covariate in the model to obtain
predicted probabilities
• The observed percentage of children in each of the 33
age groups passing an item was found to obtain
observed grouped probabilities.
• Graphs were then drawn of decimal age (for the 33 age
groups) against the predicted probability of passing, the
observed probability of passing and the observed 0/1
responses.
• This was done for all the question in the four domains
-
Example: Fine Motor Question ‘Transfers from hand to hand’
0.2
.4.6
.81
Pro
bability
0 1 2 3 4 5 6 7Age (years)
Predicted Observed grouped Observed
FM08
-
Assessing the fit of the models
•For each question the ‘goodness of fit’ of the model was tested using the Hosmer and Lemeshow test statistic.
•A p-value of less than 0.05 was taken as statistically significant.
• If the model fitted the data well, then the ages corresponding to 25%, 50%, 75% & 95% percent passing, were determined
•Questions that gave a poor fit to the data, were refitted using triple split joined spline regression.
-
Example: Gross Motor Question
‘Walks backwards’
0.2
.4.6
.81
Pro
bab
ility
0 1 2 3 4 5 6 7Age (years)
Predicted Observed grouped Predicted Spline
GM17
-
Examples of poor questions
0.2
.4.6
.81
Pro
bability
0 1 2 3 4 5 6 7Age (years)
Predicted Observed grouped Observed
FM17
0.2
.4.6
.81
Pro
ba
bili
ty
0 1 2 3 4 5 6 7Age (years)
Predicted Observed grouped Observed
GM08
‘Stands holding on’ ‘Moulds ball with clay’
0.2
.4.6
.81
Pro
ba
bili
ty
0 1 2 3 4 5 6 7Age (years)
Predicted Observed grouped Observed
SOC 16
0.2
.4.6
.81
Pro
babili
ty
0 1 2 3 4 5 6 7Age (years)
Predicted Observed grouped Observed
SOC 04
‘Can put clothes on with help’ ‘Spends most of time on mum’s back’
-
Reliability
-
Consensus Meeting
•Once all questions had been analysed, an ‘expert’ panel met to review which questions
should remain, be modified or removed
•Each question was judged on: –Graphical representation
–Goodness of fit on logistic and spline regression
–Inter and intra-observer reliability
–Interpretability by participants and researchers
-
Charts of ‘norms’ – main study
-
Scoring ‘Fail’ if failed 2 items before line at 90th percentile ‘norm’
-
Validity
-
5. Conclusion •Malawi Developmental Assessment Tool
(MDAT) with normal reference values to
assess child development up to 6 years of
age in a rural Africa setting
•Useful as: – clinical tool for the early identification
of neuro-disabilities
– school assessments by teachers
– outcome measure in international intervention
programmes designed to improve child
development
-
6. Future Work – scoring methods
• PhD student (Phillip Gichuru)
• Item by item scoring – Estimate the age of a ‘normal’ child passing an item
at a specified probability using: GLM - Logistic Regression
GAM - Generalized Additive Models Unconstrained Generalized Additive Models
SCAM - Shape Constrained Additive Models
• Overall scoring – Using all the responses of a child to obtain an overall score to characterise a child’s developmental status given their age using: Simple score versus model based methods (IRT)
Z-score versus smoothed Z-score
-
GAMLSS Smoothed Z-Scores
Figure a) : Empirical Z-score means and standard
deviations Figure b): GAMLSS Smoothed Z-means and standard
deviations
-
Density Comparison – Smoothed Z-Scores
Fig. a): Scatter plot of smoothed scores versus age in normal, disabled
and malnourished samples in Gross Motor domain.
Fig. b): Density plot of the smoothed Z-scores in Gross Motor
domain for Normal, disabled and malnourished samples.
-
GENERAL PRINCIPLES AND
STATISTICAL ISSUES
-
Construct a Research Plan Item selection
xamine existing evidence & tools in literature,
olicy documents, guidelines; focus groups,
terviews – develop themes; systematic review.
E
p
in
Item review and reduction Group items into hypothesised domains; PPI
involvement - young people/carers on steering group;
Delphi study to review/rate items (esp.if newly created)
Pre-testing (n=10-20) Cognitive interviews; Face and content
validity and comprehensiveness of
items; Item modification/elimination.
Piloting (n=50) Test procedures; mean, range of responses,
% missing data, floor and ceiling effects; item
by item plots; initial reliability assessment.
Quantitative Item Reduction and Validation (n=300) Distribute questionnaire; item performance, item-total correlations,
item reduction, item invariance; consensus meeting to finalise tool;
devise scoring method; internal consistency, reliability, validity.
-
Scoring of items
•Theoretically driven – domain scores and/or overall scores, relationship with age
eg. MDAT (gross motor, fine motor, language and
social domains)
•Logistic regression – weighting of items, predictive score eg. EARLI
•Item-response analysis (eg. Rasch model, graded response model)
eg. ADNAT, new work on MDAT (GAMLSS, IRT)
-
Reliability Concerned with the consistency of measurement
•Test-retest reliability –assesses the repeatability of the method – are assessments made by the same person on two
separate occasions under the same conditions reliable?
• Inter-rater reliability –assesses the reproducibility of the method – do two different raters agree in their assessments using
the same method under the same conditions?
• Internal reliability (eg. item total correlations, Cronbach’s alpha)
-
Validity Concerned with the accuracy of measurement
• Construct validity
– Convergent validity - should perform similarly to other
known constructs eg. ADNAT to HbA1c levels
– Divergent validity eg. should differentiate between
normal/abnormal people (extreme groups)
–Criterion validity •How well does the method compare to an established ‘gold standard’ accurate method in terms of sensitivity/specificity
•Concurrent or predictive
• Respondent validity
-
PROM Design - a wider perspective Stage Issues Study design Methods of analysis
1. Developing the
instrument
(a) Physiological Device Does this device agree
sufficiently well with
one in current use?
How do I handle
repeated measures
and duplicates?
How do I compare this
device to a known gold
standard?
Method comparison
study
Diagnostic test study
1Mean difference, 95% limits of
agreement 2Kappa with 95% CI
1ANOVA or confirmatory factor
analysis 1,2Multi-level modelling 1ROC curve, Area under Curve
with 95% CI 2Sensitivity, specificity, positive
predicted value, 95% CI
(b) Questionnaire Creation of items
Do the items (eg.
measured on Likert
scale) work reasonably
well?
Focus groups,
interviews, expert
review
Pilot study to evaluate
item performance on a
small sample
Qualitative analysis to identify
themes and assess face, content
validity
Mean, minimum, maximum
responses; % missing; floor and
ceiling effects; item-total
correlations
1measurements taken on a continuous scale (can include ordinal data with many categories) satisfying the assumptions of the method, 2binary and/or ordered categorical measurements
-
Stage Issues Study design Methods of analysis
2. Establishing
reliability and validity
a) Reliability Does the instrument
give stability and
consistency of
measurement?
Does the instrument
have good internal
properties
(reliability)?
(for questionnaire
type instruments
typically using
dichotomous items
or Likert scales)
Intra-rater/test-retest
(using the same
assessor on two
different occasions)
Inter-rater (using 2
different assessors on
the same occasion)
Item performance
evaluation on larger
sample
Examine internal
structure and number
of domains
1ICC (also mean difference, 95%
limits of agreement may be
useful) 2Kappa, weighted kappa with
95% CI, latent class analysis
As in Stage 1 above for pilot
study.
Cronbach’s alpha, split-half
coefficient, Kuder-Richardson 20
method 1,2Factor analysis, 2item-response
analysis
Reference: Lancaster GA, Statistical issues in the assessment of health outcomes in children: a methodological review. JRSS A 2009, vol 172.
-
Stage Issues Study design Methods of analysis
b) Validity Does the instrument
measure what it is
supposed to be
measuring?
Convergent/divergent
validity
Extreme groups;
known-groups;
concurrent validity
using external criterion;
predictive validity
Criterion validity (with
gold standard)
1Correlation (Pearson or
Spearman rank)
1T-test, Mann-Whitney U, Cuzick
test for trend, ANOVA, Kruskal-
Wallis etc. 2Chi-squared test
As in Stage 1 above for diagnostic
test study
-
Stage Issues Study design Methods of analysis
3. Measuring change over
time and discriminating
between subjects
Magnitude of change
Measurement error
Multiple measurements
How do I assess the
magnitude of change
from baseline to
endpoint?
Is there a better way of
taking into account pre-
test/baseline measures
(eg. to avoid regression
to mean)?
How do I handle
measurement error
(particularly in dietary
data)?
How do I account for
multiple measurements
taken over time?
Pilot study and/or
main intervention
study
Randomised or non-
randomised two group
comparison on large
sample
Take additional
measurements using a
reference instrument
1Cohen’s effect size, Guyatt’s
responsiveness statistic, standard
error of measurement
1ANCOVA to compare groups 2Logistic/ordinal regression
Use reference instrument (eg. diet
diary, biomarker, till receipts) to
calibrate/adjust results
Select the baseline and most
important follow up time point only
and analyse as above;
Use summary measures (eg. mean,
peak, area under curve, gradient)
to obtain one measure for each
child; 1,2Longitudinal data analysis,
growth curve modelling
-
Stage Issues Study design Methods of analysis
4. Reference values for
a healthy population
Reference range
Gender-specific
Age-specific
Z-scores
Anthropometric
measurements
How do I create reference
values for a normal,
healthy population?
What if the
measurements vary by
gender (usually bimodal
distribution)?
What if the
measurements vary for
children of different
ages?
How do I measure and
compare deviations from
the average across
different groups of
children?
Is there a way of dealing
with variability in growth
and non-linear change
eg. in weight for age,
weight for height, BMI?
Take measurements from
a large consecutive or
random sample of
children from school or
community
Separate the sample
measurements into those
for boys and girls
Separate sample
measurements into age
groups (could take quota
sample);
Use whole sample
1Mean, 95% reference range
or 1Median, 2.5th and 97.5th centiles
Calculate reference range for each
gender group separately
Calculate age-specific reference ranges
for each age group separately 1Linear regression, fractional
polynomials 2Logistic regression, regression splines
Express measurements in terms of z-
score units from the mean (ie.
reference mean for that age and
gender) and compare summary
statistics for z-scores across groups
Z-scores, LMS (Lambda, Mu, Sigma)
method (skewed data), GAMLSS
(Generalised Additive Models for
Location, Shape and Scale)
-
References • Gladstone M., Lancaster G.A., Umar E., Nyirenda M., Kayira E., Van Den
Broek N. and Smyth R.L. (2010). The Malawi Developmental Assessment
Tool (MDAT): creation, validation, and reliability of a tool to assess child
development in rural African settings. Public Library of Science (PLoS)
Medicine, 7(5): e1000273.
• Gladstone M., Lancaster G.A., Umar E., Nyirenda M., Kayira E., Van Den Broek N. and Smyth R.L. (2010). Perspectives of normal child development
in rural Malawi - a qualitative analysis to create a more culturally appropriate
developmental assessment tool. Child: care, health and development 36(3):
346-353.
• Gladstone M., Lancaster G.A., Jones A.P., Maleta K., Mtitimila E., Ashorn P. and Smyth R.L. (2008). Can Western developmental screening tools be
modified for use in a rural Malawian setting? Archives of Disease in
Childhood 93: 23 – 29.
• Lancaster G.A. (2009) Statistical issues in the assessment of health outcomes in children: a methodological review. JRSS A 172(4): 707-727.
Patient Reported Outcome Measures (PROMs) for use with children and adolescents: a view across disclipinesPatient Reported Outcome Measures (PROMAcknowledgments. Statisticians: Gill Lancaster, Andrew TiIT Specialist: Mark Johnson DIFFERENT. Adolescent Diabetes Management: Complex Fitness Landscape. Study Aims. Paediatric Diabetes Education*. Study participants .Data Analysis .20 Self-care score items. ADNAT SCORING ALGORITHM FOR 20 GENERIC QProposed Path of learning. 6. CREATING AN APP Version 1 Website: www.adnat.co.uk Version 2 ADNAT App Version 3. Future development. 7. FUTURE PLANS References. Malawi Developmental .Assessment Tool (MAcknowledgments. Ashley Jones Ed Umar, Maggie Nyirenda, Edith Kayira, 1. Background. Developmental assessment. Example: Denver II tool. Setting milestones in rural Africa. Aims. Research Teams. 2. Methodology. Main stages of development. Preliminary study. Qualitative study -focus groups. Main study participants. Data Analysis .3. Outcomes and results. Gross motor skills (draft MDAT III). Assessment. Item by Item Analysis. Example: Fine Motor Question. ‘TransfersAssessing the fit of the models. Example: Gross Motor Question .‘Walks baReliability. Consensus Meeting. Charts of ‘norms’ – main study. Scoring Validity. 5. Conclusion. 6. Future Work – scoring methods. GAMLSS Smoothed Z-Scores. Density Comparison – Smoothed Z-Scores GENERAL PRINCIPLES. AND .STATISTICAL ISSConstruct a Research Plan. Scoring of items. Reliability. Validity. PROM Design -a wider perspective. References.