vesicle: volumetric evaluation of synaptic interfaces using … · 2015. 12. 15. · vesicle:...

1
VESICLE: Volumetric Evaluation of Synaptic Interfaces using Computer Vision at Large Scale William Gray Roncal 1,2 · [email protected] Michael Pekala 2 · [email protected] Verena Kaynig-Fittkau 3 · [email protected] Dean M Kleissas 2 · [email protected] Joshua T Vogelstein 4,5 · [email protected] Hanspeter Pfister 3 · pfi[email protected] Randal Burns 1 · [email protected] R Jacob Vogelstein 2 · [email protected] Mark A Chevillet 2 · [email protected] Gregory D Hager 1 · [email protected] 1 Johns Hopkins University, Baltimore, Maryland Department of Computer Science 2 JHU Applied Physics Laboratory, Laurel, Maryland Research and Exploratory Development 3 Harvard University, Cambridge, Massachusetts School of Engineering and Applied Sciences 4 Johns Hopkins University, Baltimore, Maryland Department of Biomedical Engineering, 5 Johns Hopkins University, Baltimore, Maryland Institute for Computational Medicine An open challenge at the forefront of modern neuroscience is to obtain a comprehensive mapping of the neural pathways that underlie human brain function; an enhanced understanding of the wiring diagram of the brain promises to lead to new breakthroughs in diagnosing and treating neurological disorders. Inferring brain structure from image data, such as that obtained via electron microscopy (EM), entails solving the prob- lem of identifying biological structures in large data volumes. Synapses, which are a key communication structure in the brain, are particularly dif- ficult to detect due to their small size and limited contrast. Prior work in automated synapse detection has relied upon time-intensive, error-prone biological preparations (isotropic slicing, post-staining) in order to sim- plify the problem. This paper presents VESICLE, the first known approach designed for mammalian synapse detection in anisotropic, non-poststained data. Our methods explicitly leverage biological context, and the results exceed ex- isting synapse detection methods in terms of accuracy and scalability. We provide two different approaches - a deep learning classifier (VESICLE- CNN) and a lightweight Random Forest approach (VESICLE-RF), to offer alternatives in the performance-scalability space. Addressing this synapse detection challenge enables the analysis of high-throughput imag- ing that is soon expected to produce petabytes of data, and provides tools for more rapid estimation of brain-graphs. Finally, to facilitate com- munity efforts, we developed tools for large-scale object detection, and demonstrated this framework to find 50,000 synapses in 60,000 μ m 3 (220 GB on disk) of electron microscopy data. Figure 1: Previous work on synapse detection has focused on isotropic post-stained data (left), which shows crisp membranes and dark fuzzy post synaptic densities (arrows) from all orientations. The alternative imaging technique of non post-stained, anisotropic data (middle, right) promises higher throughput, lack of staining artifacts, reduction in lost slices, and less demanding data storage requirements - all critically im- portant for high-throughput connectomics. The XZ plane of a synapse in anisotropic data is shown (right), illustrating the effect of lower resolu- tion. We address this more challenging environment, in which membranes appear fuzzier and are harder to distinguish from synaptic contacts. Data courtesy of Graham Knott (left) and Jeff Lichtman (middle, right). In Figure 1, we provide examples demonstrating the challenging set- ting of our detection task. Figure 2 illustrates the importance of leverag- ing contextual information, especially neurotransmitter-containing vesi- cles and cell membranes. Finally, Figure 3 presents a summary of classi- fier performance; visualizations of the results are not shown here, but are available in the full paper and on our website. Our code and data are open source and available at: openconnecto.me/vesicle. Figure 2: A single cross-section of EM data is shown (upper left). The detection task is to identify synapses shown in green (upper right). These synapses are known to exist at the interface of two neurons; these bound- aries can be approximated by previously computed membranes, allowing us to restrict the evaluation regions to the green pixels (lower left). Clus- ters of vesicles are a good indicator of an axonal bouton, suggesting that one or more synaptic sites is likely nearby. Vesicles found by our auto- mated detection step are highlighted in green (lower right). Figure 3: VESICLE-RF and VESICLE-CNN significantly outperform prior state-of-the art, particularly at high recall rates. The relatively abrupt endpoint of the Becker2013 method occurs because beyond this point, thresholded probabilities group into large detected regions rather than in- dividual synapses, which are disallowed.

Upload: others

Post on 19-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: VESICLE: Volumetric Evaluation of Synaptic Interfaces using … · 2015. 12. 15. · VESICLE: Volumetric Evaluation of Synaptic Interfaces using Computer Vision at Large Scale William

VESICLE: Volumetric Evaluation of Synaptic Interfaces using Computer Vision at Large Scale

William Gray Roncal1,2 · [email protected]

Michael Pekala2 · [email protected]

Verena Kaynig-Fittkau3 · [email protected]

Dean M Kleissas2 · [email protected]

Joshua T Vogelstein4,5 · [email protected]

Hanspeter Pfister3 · [email protected]

Randal Burns1 · [email protected]

R Jacob Vogelstein2 · [email protected]

Mark A Chevillet2 · [email protected]

Gregory D Hager1 · [email protected]

1 Johns Hopkins University, Baltimore, MarylandDepartment of Computer Science

2 JHU Applied Physics Laboratory, Laurel, MarylandResearch and Exploratory Development

3 Harvard University, Cambridge, MassachusettsSchool of Engineering and Applied Sciences

4 Johns Hopkins University, Baltimore, MarylandDepartment of Biomedical Engineering,

5 Johns Hopkins University, Baltimore, MarylandInstitute for Computational Medicine

An open challenge at the forefront of modern neuroscience is to obtaina comprehensive mapping of the neural pathways that underlie humanbrain function; an enhanced understanding of the wiring diagram of thebrain promises to lead to new breakthroughs in diagnosing and treatingneurological disorders. Inferring brain structure from image data, suchas that obtained via electron microscopy (EM), entails solving the prob-lem of identifying biological structures in large data volumes. Synapses,which are a key communication structure in the brain, are particularly dif-ficult to detect due to their small size and limited contrast. Prior work inautomated synapse detection has relied upon time-intensive, error-pronebiological preparations (isotropic slicing, post-staining) in order to sim-plify the problem.

This paper presents VESICLE, the first known approach designed formammalian synapse detection in anisotropic, non-poststained data. Ourmethods explicitly leverage biological context, and the results exceed ex-isting synapse detection methods in terms of accuracy and scalability. Weprovide two different approaches - a deep learning classifier (VESICLE-CNN) and a lightweight Random Forest approach (VESICLE-RF), tooffer alternatives in the performance-scalability space. Addressing thissynapse detection challenge enables the analysis of high-throughput imag-ing that is soon expected to produce petabytes of data, and provides toolsfor more rapid estimation of brain-graphs. Finally, to facilitate com-munity efforts, we developed tools for large-scale object detection, anddemonstrated this framework to find ⇡ 50,000 synapses in 60,000 µm3

(220 GB on disk) of electron microscopy data.

Figure 1: Previous work on synapse detection has focused on isotropicpost-stained data (left), which shows crisp membranes and dark fuzzypost synaptic densities (arrows) from all orientations. The alternativeimaging technique of non post-stained, anisotropic data (middle, right)promises higher throughput, lack of staining artifacts, reduction in lostslices, and less demanding data storage requirements - all critically im-portant for high-throughput connectomics. The XZ plane of a synapsein anisotropic data is shown (right), illustrating the effect of lower resolu-tion. We address this more challenging environment, in which membranesappear fuzzier and are harder to distinguish from synaptic contacts. Datacourtesy of Graham Knott (left) and Jeff Lichtman (middle, right).

In Figure 1, we provide examples demonstrating the challenging set-ting of our detection task. Figure 2 illustrates the importance of leverag-ing contextual information, especially neurotransmitter-containing vesi-cles and cell membranes. Finally, Figure 3 presents a summary of classi-fier performance; visualizations of the results are not shown here, but areavailable in the full paper and on our website. Our code and data are opensource and available at: openconnecto.me/vesicle.

Figure 2: A single cross-section of EM data is shown (upper left). Thedetection task is to identify synapses shown in green (upper right). Thesesynapses are known to exist at the interface of two neurons; these bound-aries can be approximated by previously computed membranes, allowingus to restrict the evaluation regions to the green pixels (lower left). Clus-ters of vesicles are a good indicator of an axonal bouton, suggesting thatone or more synaptic sites is likely nearby. Vesicles found by our auto-mated detection step are highlighted in green (lower right).

Figure 3: VESICLE-RF and VESICLE-CNN significantly outperformprior state-of-the art, particularly at high recall rates. The relatively abruptendpoint of the Becker2013 method occurs because beyond this point,thresholded probabilities group into large detected regions rather than in-dividual synapses, which are disallowed.