ecnis web-based course in molecular epidemiology in cancer biomarker validation
DESCRIPTION
ECNIS Web-based course in Molecular Epidemiology in Cancer Biomarker Validation. Slide B1. 1. - PowerPoint PPT PresentationTRANSCRIPT
ECNIS Web-based course inMolecular Epidemiology in Cancer
Biomarker Validation
Slide B1.1
Commenti:intraindividual variation si riduce con biomarkers per
esempio con la misurazione di POPs! (PCB ecc) perchè sono long-term e stabili (lipofilici)
- adducts in cord blood? (Perera)
http://www.aacrmeetingabstracts.org/cgi/content/abstract/2005/1/512-b
Characteristics of ideal biomarkers
1. Sensitive and specific2. Relation with exposure3. Standardized and validated4. Relatively easy to perform5. Non-invasive6. High throughput7. Inexpensive
Slide B1.3
Levels of validation:
Intra-individual variationInter-individual variation
and confounding
Intra-laboratory variationInter-laboratory variation
Validity (vs a standard) and predictive value
Time relationships
Dose-response
Ability to predict outcome
Slide B1.4
QUALITY CONTROL OF A BIOMARKER: MEASUREMENT ERROR
Measurement error is classified as:
preanalytical (biological and sampling error) Oranalytical (laboratory) error
Laboratory error focuses on method, instrument, reagent or matrix effects.
Preanalytical Error
- individual genetic, environmental, behavioural and health status-related variability (including smoking status, weight and weight loss, physical exercise)
Example of genetic source of variation: FOLATE AND MTHFR
Health status-related: retinol or ascorbate and trauma, several biomarkers and inflammation
- sampling error: within subject variation due to hourly, daily, weekly, monthly … changes
To reduce Analytical Errors – Quality Control measures
Example of Quality Control Program: national cholesterol Education Program (US)
Goals:
1. Attain analytical accuracy and precision (<3% cv)2. Identify individual determinants of cholesterol variation (lifestyle factors)3. Identify clinical determinants of variation (metabolic states, illness)4. Other sampling sources (fasting status, posture, serum vs. plasma)
The NCEP guidelines have proven adequate to ensure 90% correct classification
Overall measure of error is the
COEFFICIENT OF VARIATION = SD/MEAN x 100%(SD in repeated measurements)
CV is ideally calculated for samples at the bottom, middle and top of the reference concenttration range determined in healthy subjects
Log transformation is even better (Rappaport book)
OTHER EXAMPLES OF QUALITY CONTROL:
- Gunter et al (1996), international round-robin for folate involving 20 labs: CV of 27% for serum folate, and 36% for whole blood folate, with substantial intermethod variation
- Pfeiffer et al (1999), interlaboratory comparison of homocysteine in plasma samples (14 labs): CV=9% among labs, and 6% within labs
TWO MAJOR APPROACHES TO REDUCING MEASUREMENT ERROR ARE:
1. TO BLIND THE ANALYST TO THE CASE-CONTROL STATUS OF SPECIMENS
2. TO ELIMINATE SYSTEMATIC DIFFERENCES IN THE WAY CASE AND CONTROL SPECIMENS ARE HANDLED
Validation and relevance: some examples
Inter-centre variation (and potential confounding) for an intermediate marker (plasma DNA amount in EPIC)
Slide B1.11
Genetic alterations in plasma DNA
* Useful when tumours not available
* Good concordance between tumour and plasma mutations
* When does tumour DNA appear in the blood?
* Can plasma DNA be used as a biomarker for genotoxic exposure?
Slide B1.12
GENAIR DNA concentration
0200
400600
8001000
12001400
16003
96
7
73
13
39
39
51
71
23
57
26
37
28
75
32
39
35
05
45
55
48
21
55
21
36
87
52
97
74
13
59
74
59
60
68
41
64
78
67
02
MOC number
DN
A c
on
ce
ntr
ati
on
(n
g/m
l)
DNA concentration sorted by EPIC number (origin)
Utrecht
Slide B1.13
Univariate and multivariate analysis: plasma DNA amount (logarithm transformation, dependent variable), by center, age, gender and time between blood drawing and diagnosis (for cases only).
Univariate analysis:Variable F-value DF p-value
Controls only (N=778)Center 11.23 22 <0.0001Age 5.21 1 (a) 0.023Gender 0.52 1 0.47
Cases and controls (N=1185):F-value DF p-value
Center 16.6 23 <0.0001Age 1.56 1 (a) 0.21All deathsand tumours 2.3 6 0.03
Slide B1.14
Biomarkers vs external/other measurements
Cotinine measurements. Means, SD and distribution by detectable (greater than 0.05 ng/ml) and undetectable levels, by ETS status. Only subjects with ETS information (N=374). Values of cotinine greater than 10 ng/ml (N=11) excluded (Vineis et l, BMJ 2004).
ETS status Mean cotinine (N, SD) Yes 0.55 (189, 0.96)No 0.17 (174, 0.49)p-value<0.0001 (Wilcoxon Rank-sum test)
ETS status Cotinine Detectable Undetectable
Yes 89 100No 37 137
OR=3.30 (95% CI 2.07, 5.23) p<0.0001
Cotinine and ETS (environmental tobacco smoke) from questionnaires
Slide B1.16
Bulky DNA adducts and dose-response relationship (Peluso et al, AJE 2001
Slide B1.17
Comments
- The fact that adducts and other markers are related to exposure does not imply that they are a better measure
- Biomarkers can increase biological plausibility of associations
- They can be useful for example if it is possible to show that intra-individual variability is lower with the marker than with external exposure measurements
- They can address issues such as saturation of enzymes at high levels of exposure (dose-response, risk assessment)
- DNA adducts are an integrated marker (over several sources of exposure) that expresses also individual susceptibility (eg for DNA repair), and can be predictive of cancer onset
Slide B1.18
Genotypes vs exposures
Comparison of four genotyping methods at the Cambridge laboratory. The standard is represented by a panel evaluation of all results (courtesy of A Dunning).
Method Sensitivity % Specificity %
ASO 836/864 97 753/836 90
Taqman 826/864 96 812/826 98
RsaI digest 125/173 72 103/125 82
Invader 62/92 67 45/62 73
Genotyping
Slide B1.20
Problems with studies on gene-environment interactions:
- low study power- frequent false positives due to multiple testing
- functional data often missing- early studies not confirmed by subsequent larger or better
conducted studies- publication bias
Slide B1.21
Effects of random classification error on relative risk estimates
R=correlation coefficients between the measurement of exposure/genotype by different assessors and a reference standard, and the resulting observed relative risks (modified from Hankinson et al, 1994, ref. 3).
True relative risks (RRt)Assessor R 1.5 2.0 2.5
Observed relative risks
1 0.10 1.1 1.1 1.12 0.60 1.3 1.5 1.73 0.80 1.4 1.8 2.14 0.90 1.4 1.9 2.3
Observed RR=exp (ln RRt*R)
Slide B1.22
According to estimates, the common genotyping method Taqman has 96% sensitivity and 98% specificity, thus allowing little error in classification.
On the contrary, sensitivity in environmental exposure assessment is quite often lower than 70% and specificity even lower.
Slide B1.23
Genotype is stable, measured accurately (sens, spec=90-100%), frequency of alleles is high
Environmental exposures are changing (life-course events), often measured inaccurately, frequency may be
too low
Slide B1.24
In addition, genetic polymorphisms are investigated with high-throughput technologies that allow researchers to investigate hundreds of thousands of SNP at a time:
with the usual p-values this originates a large number of false positives (see Bayesian strategy proposed by
Colhoun et al, Lancet 2003 361: 865-872)
In environmental research false negatives are an important problem
Slide B1.25
Limitations of current biomarker studies
Some markers are not very reliable (e.g. interlaboratory variation for adducts)
Biological meaning not always clear (e.g. mutations in plasma DNA)
Long gap between marker development and its validation Unknown or unsatisfactory time relationships between
exposure, marker measurement, disease Usually only one spot biosample available (little known on
intra-individual variation) Little known on potential confounders
Slide B1.26
1. Knowledge of pharmacokinetics and relevance of measurements in time
2. Insights in reasons for inter- and intra-individual variationMeasured variation = Inter + Intra + variation in
assay
3. Surrogate vs. Target tissueWBC vs. lung tissue in smokers
4. Comparison with other Biomarkers (“gold standard”)for example 1-OH-Pyrene in urine
Further issues in validation of biomarkers and examples (from R Godschalk)
Slide B1.27
Exposure to cig smoke lung DNA damage
DNA damage/mutagenesis
Cancer
Cig. smoke (PAH)
ActivationCYP450s MPO
PAH-DNA adducts
DNA repairNER BER
DetoxicationGSTsNATs
Neutrophils
Activation
MPO
H2O2
HOCl
+Cl
ROS
Cig. smoke (Particles)
and inflammationand inflammation
Slide B1.28
Characteristics of ideal biomarkers
1. Sensitive and specific -2. Relation with exposure +3. Standardized and validated -4. Relatively easy to perform +/-5. Non-invasive +6. High throughput -7. Inexpensive +
Immunocytochemical staining of PAH-DNA adducts in Mouth BrushesImmunocytochemical staining of PAH-DNA adducts in Mouth Brushes
Slide B1.29
The use of Induced Sputum (IS) in smoking-related DNA adducts analyses
Department of Health Risk Analysis and Toxicology, Maastricht University, Maastricht,
The Netherlands
Objectives • To study the applicability of Induced Sputum (IS) as source of lung derived cells
• To establish correlation between DNA adduct levels in IS derived cells and smoking intensity
• To compare DNA adduct levels in IS with PBL
Slide B1.30
Intra-individual DNA adduct analysis
Intra-individual variation in IS is higher than in PBL
Slide B1.31
When quantitating the adduct levels in Induced Sputum of certain individuals considerable variation could be observed. We could not find a reason for that since smoking habits and dietary conditions were kept similar over time as much as possible. There is also some variation in PBL but to a lower extent.
When quantitating the adduct levels in Induced Sputum of certain individuals considerable variation could be observed. We could not find a reason for that since smoking habits and dietary conditions were kept similar over time as much as possible. There is also some variation in PBL but to a lower extent.
Characteristics of ideal biomarkers
1. Sensitive and specific +/-
2. Relation with exposure +
3. Standardized and validated +/-
4. Relatively easy to perform -
5. Non-invasive +/-
6. High throughput -
7. Inexpensive +/-
Postlabeling of DNA adducts in Induced SputumPostlabeling of DNA adducts in Induced Sputum
Slide B1.32
Stability of DNA adducts in PBL of smokers
Godschalk et al. CEBP 1998 Godschalk et al. CEBP 1998 Slide B1.33
21 weeks stopped smoking. Decay curve exponentially. BG in nonsmokers.21 weeks stopped smoking. Decay curve exponentially. BG in nonsmokers.
Saturation in DNA adduct levels at high exposure levels
A) DNA adduct levels B) efficiency of DNA adduct formation
in (○) smoking and (•) non-smoking aluminium workers exposed to PAH
1 foundry2 electrolysis3 bake oven4 anode factory5 pot-relining department
A) DNA adduct levels B) efficiency of DNA adduct formation
in (○) smoking and (•) non-smoking aluminium workers exposed to PAH
1 foundry2 electrolysis3 bake oven4 anode factory5 pot-relining department
Van Schooten et al. Mut Res 1997Van Schooten et al. Mut Res 1997Slide B1.34
Correlation between surrogate and target tissue
Wiencke et al (1999) J Natl Cancer Inst;91(7):614-9.
HOWEVER,
* Not all studies find such a relationship.
* DNA adduct levels in PBL/ WBC of low exposed subjects are often below or near the limit of detection
Slide B1.35
Differences between tissues
The example of cigarette smoke exposure
0
2
4
6
8
10
12
14
16
DL = detection limit
Add
ucts
per
108
nuc
leot
ides
Lung BAL IS Mono- Lympho- Granulo- cytes cytes cytes
WBC
Slide B1.36
Overall Conclusions
Type of tissue and choice of biomarker depends on research goal
Most biomarkers AND surrogate tissues still need further validation.
Laboratory validation‘Field’ validation
Non-invasive and high throughput methodologies are required for Molecular Epidemiology studies
Slide B1.37
The end