b18 combining machine learning with flow cytometry ... › content.stockpr.com › anixa › ... ·...

1
RESULTS METHODS An overview of the methods are described in Figure 1. With IRB approval, peripheral blood was collected from 188 subjects prior to a transrectal ultrasound guided prostate (TRUSP) biopsy and 91 tumor free control donors (HD). Subjects were excluded from the study if they had a previous history of cancer (excluding active surveillance), had a medical intervention for prostate cancer, or were receiving a dihydrotestosterone DHT/alpha-1 blocker for active treatment of BPH. A total of 107 subjects with biopsy confirmed prostate cancer (PCa) and 81 subjects with biopsy confirmed BPH were included (Table 1). Peripheral blood mononuclear cells (PBMCs) were isolated 20 to 30 hours post-collection using standard methods. Myeloid and lymphocyte cell populations were analyzed on a BD LSRII flow cytometer (Table 2). Compensated channel values for each event were exported from FCS format to CSV format using FlowJo software. To prepare the data for network input, we used a procedure wherein each channel of flow cytometry data was used as an axis in a multidimensional space (Figure 2). Each axis was then divided into four segments and each event was defined by its segment location on each axis. For example, if the flow cytometry data has 2 channels, each channel is divided into four segments thus resulting in 4 2 or 16 volumetric regions in the multidimensional space; these regions are referred to as hypervoxels (Figure 3). For our study, two separate panels were used with one containing seven channels (lymphoid) and one containing ten channels (myeloid) resulting in each having 4 7 , or 16,384, hypervoxels and 4 10 , or 1,048,576, hypervoxels, respectively. A count was then made of the number of events falling within each hypervoxel resulting in a common feature for all samples. Each hyperspace was then converted to a one-dimensional vector and used as input for a separate NN. The outputs of the two networks were then used as an input to a third NN which integrates the two inputs generating the final classification. After data preparation, a series of feedforward ANNs were created with network inputs consisting of the numerical event counts in each hypervoxel; three datasets were then constructed: the training dataset ‘teaches’ the two output categories through backpropagation and fitting; the validation dataset evaluates the fit to minimize overfitting; and the test dataset ranks the trained networks against each other and estimates performance. Finally, a naïve testing set (never seen by the network) was used to evaluate overall performance of the top- ranking networks after voting. Our two-network approach was developed to predict whether a subject was at higher risk for PCa (HR-PCa) and should be recommended for biopsy (G≥4+3) or be actively monitored (BPH or G≤3+4). Neural Network 1 predicts whether the sample looks like HD or HR-PCa. If the sample is predicted to be a possible HR-PCa, then it is tested by Neural Network 2 and predicted as either benign/at lower risk (Benign/LR-PCa) for PCa or HR-PCa (Figure 1). Sensitivity and Specificity of 89% and 100%, respectively, were achieved by the two- network test distinguishing between naïve HD and HR-PCa samples (Table 3). For all samples, a sensitivity and specificity of 84% and 96%, respectively, were achieved. For samples that are either biopsy confirmed BPH or PCa (G≥6), the networks identified 8 out of 9 naïve samples correctly as needing a biopsy (Biopsy; G>3+4) and 20 out 36 naïve samples correctly as not needing a biopsy (No Biopsy; BPH/LR-PCa) resulting in a sensitivity and specificity of 89% and 56%, respectively (Table 4). For all subjects who underwent a TRUSP biopsy, 36 out of 43 were correctly identified as Biopsy and 102 out of 145 samples were correctly identified as No Biopsy. BACKGROUND AND PURPOSE We immunophenotyped various myeloid and lymphoid cell populations using standard multiparametric flow cytometry from healthy individuals along with subjects with biopsy-confirmed prostate cancer (PCa) or benign prostatic hyperplasia (BPH). Cell surface markers were included to identify myeloid-derived suppressor cell (MDSC) subsets a heterogeneous group of immature myeloid cells that suppress T-cell responses and support tumor progression. We created a technique for processing flow cytometry data to be used as inputs for feedforward artificial neural networks (ANNs), or multilayer perceptrons (MLPs), in order to categorize each sample. The purpose of this study was to use supervised machine learning to create a classifier that analyzes immunophenotyping flow cytometry data to distinguish individuals with a prostate malignancy. Combining machine learning with flow cytometry immunophenotyping of myeloid and lymphoid cell populations to distinguish subjects with prostate cancer (PCa) from benign prostatic hyperplasia (BPH) Presented at the 30 th Anniversary AACR Special Conference Convergence: Artificial Intelligence, Big Data and Prediction in Cancer; October, 14-17, 2018; Newport, Rhode Island, USA B18 Population Surface Markers eMDSC Lineage - HLA-DR - CD33 + PMN-MDSC CD14 - CD11b + CD33 + CD15 + SSC Hi M-MDSC CD14 + HLA-DR -/lo CD4 + T Cell CD3 + CD4 + CD8 + T Cell CD3 + CD8 + B Cell CD3 - SSC lo CD19 + NK Cell CD3 - SSC lo CD56 + PMN CD14 - CD15 + Monocyte CD14 + CD15 - LOX-1 + PMN 2 CD14 - CD15 + LOX-1 + George A. Dominguez 1* , John Roop 1 , Alexander Polo 1 , Anthony Campisi 1 , Dmitry I. Gabrilovich 2 , Amit Kumar 1 1 Anixa Biosciences, Inc., San Jose, CA; 2 The Wistar Institute, Philadelphia, PA HD BPH PCa Number 91 81 107 Median Age 53 62 67 Age Range 22 79 40 81 42 86 Gleason 6 41 Gleason 7(3+4) 23 Gleason 7(4+3) 22 Gleason 8 or higher 21 Stage T1c 76 Stage T2a 6 Stage T2c 1 Stage Unknown 24 CONCLUSIONS Based on this preliminary data, our technology would eliminate unnecessary biopsies for 56-70% of men who would ordinarily get a biopsy. By segmenting the data into hypervoxels, we are able to evaluate numerous complex relationships in multidimensional flow cytometer data simultaneously which cannot be done using traditional manual gating. Using this technique, we are able to predict which subjects should undergo a biopsy. To our knowledge, this is the first application of flow cytometry event data used to directly train artificial neural networks to detect the presence of a malignancy. We expect network performance to improve as the sample number increases. This technique may prove beneficial in a clinical setting by reducing the number of unnecessary TRUSP biopsies performed each year on patients suspected of prostate cancer (elevated PSA, abnormal DRE, etc.). We believe that this technology could be used for other types of binary classifications, such as predicting patient responses to immunotherapies and/or monitoring for the recurrence of tumors. Table 1. Demographics of healthy, benign, and prostate cancer participants References Gnjatic et al. J for Immuno Cancer. 5:44 2017. 2. Condamine et al. Science Immuno. 1:2 2016. Acknowledgements The authors would like to thank the patients and their families, all the clinical investigators, and the study staff from each participating sites. Contact information: [email protected] Table 2. Cell surface markers used for immunophenotyping Figure 1. Illustrated overview of methods in this study from sample collection to data processing and analysis. METHODS Continued 1 5 9 13 2 6 10 14 3 7 11 15 4 8 12 16 Channel 1 Channel 2 Hypervoxel Count 1 120 2 952 3 87 4 15 5 42 6 1214 7 602 8 15 9 62 10 1841 11 1642 12 1752 13 210 14 284 15 1211 16 5126 Figure 2. Hypothetical visualization of 10 channel multidimensional space Figure 3. Example of hypervoxel generation for two channels and event count. Table 3. Sensitivity and Specificity analysis for two-network test to predict HD vs HR-PCa neural network testing (NN1) Table 4. Sens. and Spec. for two-network test to distinguish between benign conditions and HR-PCa CD14 CD15 HLA-DR Lin(CD3/CD19/CD56 ) FL1 FL2 SSC FL1 FL2 FL3 1 954 635 845 693 2 548 632 745 985 3 256 354 966 325 . . . . . . . . . . . . N 456 789 654 987 Sensitivity & Specificity Analysis Neural Network 2 BPH/LR-PCa Monitor HR-PCa Biopsy Neural Network 1 HD Monitor HR-PCa NN2 HD BPH/ LR-PCa HR-PCa HR-PCa TRUSP Biopsy Import data into Artificial Intelligence (AI) Software Blood sample from subject identified as high risk for PCa Isolate PBMCs using density gradient Label PBMCs with fluorescent antibody conjugates Acquire events on flow cytometer Export channel values into CSV format for all events Naïve All HR- PCa HD HR- PCa HD Samples Tested 9 25 43 91 Predicted HR-PCa 8 0 36 4 HD 1 25 7 87 Sensitivity 89% 84% Specificity 100% 96% Accuracy 97% 92% Naïve All HR-PCa Benign/ LR-PCa HR-PCa Benign/ LR-PCa Samples Tested 9 36 43 145 Recommended Biopsy 8 16 36 43 No Biopsy (Monitor) 1 20 7 102 Sensitivity 89% 84% Specificity 56% 70% Accuracy 62% 73%

Upload: others

Post on 07-Jul-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: B18 Combining machine learning with flow cytometry ... › content.stockpr.com › anixa › ... · An overview of the methods are described in Figure 1. With IRB approval, peripheral

RESULTS

METHODSAn overview of the methods are described in Figure 1. With IRB approval, peripheral blood was

collected from 188 subjects prior to a transrectal ultrasound guided prostate (TRUSP) biopsy and 91

tumor free control donors (HD). Subjects were excluded from the study if they had a previous history

of cancer (excluding active surveillance), had a medical intervention for prostate cancer, or were

receiving a dihydrotestosterone DHT/alpha-1 blocker for active treatment of BPH. A total of 107

subjects with biopsy confirmed prostate cancer (PCa) and 81 subjects with biopsy confirmed BPH

were included (Table 1). Peripheral blood mononuclear cells (PBMCs) were isolated 20 to 30 hours

post-collection using standard methods. Myeloid and lymphocyte cell populations were analyzed on a

BD LSRII flow cytometer (Table 2). Compensated channel values for each event were exported from

FCS format to CSV format using FlowJo software.

To prepare the data for network input, we used a procedure wherein each channel of flow cytometry

data was used as an axis in a multidimensional space (Figure 2). Each axis was then divided into

four segments and each event was defined by its segment location on each axis. For example, if the

flow cytometry data has 2 channels, each channel is divided into four segments thus resulting in 42 or

16 volumetric regions in the multidimensional space; these regions are referred to as hypervoxels

(Figure 3). For our study, two separate panels were used with one containing seven channels

(lymphoid) and one containing ten channels (myeloid) resulting in each having 47, or 16,384,

hypervoxels and 410, or 1,048,576, hypervoxels, respectively. A count was then made of the number

of events falling within each hypervoxel resulting in a common feature for all samples. Each

hyperspace was then converted to a one-dimensional vector and used as input for a separate NN.

The outputs of the two networks were then used as an input to a third NN which integrates the two

inputs generating the final classification. After data preparation, a series of feedforward ANNs were

created with network inputs consisting of the numerical event counts in each hypervoxel; three

datasets were then constructed: the training dataset ‘teaches’ the two output categories through

backpropagation and fitting; the validation dataset evaluates the fit to minimize overfitting; and the

test dataset ranks the trained networks against each other and estimates performance. Finally, a

naïve testing set (never seen by the network) was used to evaluate overall performance of the top-

ranking networks after voting. Our two-network approach was developed to predict whether a subject

was at higher risk for PCa (HR-PCa) and should be recommended for biopsy (G≥4+3) or be actively

monitored (BPH or G≤3+4). Neural Network 1 predicts whether the sample looks like HD or HR-PCa.

If the sample is predicted to be a possible HR-PCa, then it is tested by Neural Network 2 and

predicted as either benign/at lower risk (Benign/LR-PCa) for PCa or HR-PCa (Figure 1).

✓Sensitivity and Specificity of 89% and 100%, respectively, were achieved by the two-

network test distinguishing between naïve HD and HR-PCa samples (Table 3). For

all samples, a sensitivity and specificity of 84% and 96%, respectively, were achieved.

✓For samples that are either biopsy confirmed BPH or PCa (G≥6), the networks

identified 8 out of 9 naïve samples correctly as needing a biopsy (Biopsy; G>3+4) and

20 out 36 naïve samples correctly as not needing a biopsy (No Biopsy; BPH/LR-PCa)

resulting in a sensitivity and specificity of 89% and 56%, respectively (Table 4).

✓For all subjects who underwent a TRUSP biopsy, 36 out of 43 were correctly identified

as Biopsy and 102 out of 145 samples were correctly identified as No Biopsy.

BACKGROUND AND PURPOSE❖ We immunophenotyped various myeloid and lymphoid cell populations using standard

multiparametric flow cytometry from healthy individuals along with subjects with biopsy-confirmed

prostate cancer (PCa) or benign prostatic hyperplasia (BPH).

❖ Cell surface markers were included to identify myeloid-derived suppressor cell (MDSC) subsets – a

heterogeneous group of immature myeloid cells that suppress T-cell responses and support tumor

progression.

❖ We created a technique for processing flow cytometry data to be used as inputs for feedforward

artificial neural networks (ANNs), or multilayer perceptrons (MLPs), in order to categorize each

sample.

❖ The purpose of this study was to use supervised machine learning to create a classifier that analyzes

immunophenotyping flow cytometry data to distinguish individuals with a prostate malignancy.

Combining machine learning with flow cytometry immunophenotyping of myeloid and lymphoid cell

populations to distinguish subjects with prostate cancer (PCa) from benign prostatic hyperplasia (BPH)

Presented at the 30th Anniversary AACR Special Conference – Convergence: Artificial Intelligence, Big Data and Prediction in Cancer; October, 14-17, 2018; Newport, Rhode Island, USA

B18

Population Surface Markers

eMDSC Lineage-HLA-DR-CD33+

PMN-MDSC CD14-CD11b+CD33+CD15+SSCHi

M-MDSC CD14+HLA-DR-/lo

CD4+ T Cell CD3+CD4+

CD8+ T Cell CD3+CD8+

B Cell CD3-SSCloCD19+

NK Cell CD3-SSCloCD56+

PMN CD14-CD15+

Monocyte CD14+CD15-

LOX-1+PMN2 CD14-CD15+LOX-1+

George A. Dominguez1*, John Roop1, Alexander Polo1, Anthony Campisi1, Dmitry I. Gabrilovich2, Amit Kumar1

1Anixa Biosciences, Inc., San Jose, CA; 2The Wistar Institute, Philadelphia, PA

HD BPH PCa

Number 91 81 107

Median Age 53 62 67

Age Range 22 – 79 40 – 81 42 – 86

Gleason 6 41

Gleason 7(3+4) 23

Gleason 7(4+3) 22

Gleason 8 or higher 21

Stage T1c 76

Stage T2a 6

Stage T2c 1

Stage Unknown 24

CONCLUSIONS

➢ Based on this preliminary data, our technology

would eliminate unnecessary biopsies for 56-70% of

men who would ordinarily get a biopsy.

➢ By segmenting the data into hypervoxels, we are able to

evaluate numerous complex relationships in

multidimensional flow cytometer data simultaneously

which cannot be done using traditional manual gating.

➢ Using this technique, we are able to predict which

subjects should undergo a biopsy.

➢ To our knowledge, this is the first application of flow

cytometry event data used to directly train artificial

neural networks to detect the presence of a malignancy.

➢ We expect network performance to improve as the

sample number increases.

➢ This technique may prove beneficial in a clinical setting

by reducing the number of unnecessary TRUSP

biopsies performed each year on patients suspected of

prostate cancer (elevated PSA, abnormal DRE, etc.).

➢ We believe that this technology could be used for other

types of binary classifications, such as predicting patient

responses to immunotherapies and/or monitoring for the

recurrence of tumors.

Table 1. Demographics of healthy, benign,

and prostate cancer participants

References Gnjatic et al. J for Immuno Cancer. 5:44 2017. 2. Condamine et al. Science Immuno. 1:2 2016. Acknowledgements The authors would like to thank the patients and their families, all the clinical investigators, and the study staff from each participating sites. Contact information: [email protected]

Table 2. Cell surface markers used for

immunophenotyping

Figure 1. Illustrated overview of methods in this study from sample collection to data processing and analysis.

METHODS Continued

1

5

9

13

2

6

10

14

3

7

11

15

4

8

12

16

Channel 1

Ch

an

ne

l 2

Hypervoxel Count

1 120

2 952

3 87

4 15

5 42

6 1214

7 602

8 15

9 62

10 1841

11 1642

12 1752

13 210

14 284

15 1211

16 5126

Figure 2. Hypothetical visualization of 10

channel multidimensional space

Figure 3. Example of hypervoxel generation

for two channels and event count.

Table 3. Sensitivity and Specificity analysis for

two-network test to predict HD vs HR-PCa

neural network testing (NN1)

Table 4. Sens. and Spec. for two-network test to

distinguish between benign conditions and HR-PCa

CD14

CD15HLA-DR

Lin(CD3/CD19/CD56

)

FL

1

FL2

SSC FL1 FL2 FL3

1 954 635 845 693

2 548 632 745 985

3 256 354 966 325

. . . .

. . . .

. . . .

N 456 789 654 987

Sensitivity

&

Specificity

Analysis

Neural Network 2

BPH/LR-PCa Monitor

HR-PCa Biopsy

Neural Network 1

HD Monitor

HR-PCa NN2

HDBPH/

LR-PCa

HR-PCa HR-PCa

TRUSP Biopsy

Import data into Artificial

Intelligence (AI) Software

Blood sample from subject

identified as high risk for PCa

Isolate PBMCs using

density gradient

Label PBMCs with fluorescent

antibody conjugates

Acquire events on flow

cytometer

Export channel values into

CSV format for all events

Naïve All

HR-

PCaHD

HR-

PCaHD

Samples Tested 9 25 43 91

Pre

dic

ted

HR-PCa 8 0 36 4

HD 1 25 7 87

Sensitivity 89% 84%

Specificity 100% 96%

Accuracy 97% 92%

Naïve All

HR-PCaBenign/

LR-PCaHR-PCa

Benign/

LR-PCa

Samples Tested 9 36 43 145

Re

co

mm

en

de

d

Biopsy 8 16 36 43

No Biopsy

(Monitor)1 20 7 102

Sensitivity 89% 84%

Specificity 56% 70%

Accuracy 62% 73%