CAVE: Connectome Annotation Versioning Engine

Dorkenwald, Sven; Schneider-Mizell, Casey M; Brittain, Derrick; Halageri, Akhilesh; Jordan, Chris; Kemnitz, Nico; Castro, Manual A; Silversmith, William; Maitin-Shephard, Jeremy; Troidl, Jakob; Pfister, Hanspeter; Gillet, Valentin; Xenes, Daniel; Bae, J Alexander; Bodor, Agnes L; Buchanan, JoAnn; Bumbarger, Daniel J; Elabbady, Leila; Jia, Zhen; Kapner, Daniel; Kinn, Sam; Lee, Kisuk; Li, Kai; Lu, Ran; Macrina, Thomas; Mahalingam, Gayathri; Mitchell, Eric; Mondal, Shanka Subhra; Mu, Shang; Nehoran, Barak; Popovych, Sergiy; Takeno, Marc; Torres, Russel; Turner, Nicholas L; Wong, William; Wu, Jingpeng; Yin, Wenjing; Yu, Szi-chieh; Reid, R Clay; da_Costa, Nuno Maçarico; Seung, H Sebastian; Collman, Forrest

doi:10.1101/2023.07.26.550598

Abstract Advances in Electron Microscopy, image segmentation and computational infrastructure have given rise to large-scale and richly annotated connectomic datasets which are increasingly shared across communities. To enable collaboration, users need to be able to concurrently create new annotations and correct errors in the automated segmentation by proofreading. In large datasets, every proofreading edit relabels cell identities of millions of voxels and thousands of annotations like synapses. For analysis, users require immediate and reproducible access to this constantly changing and expanding data landscape. Here, we present the Connectome Annotation Versioning Engine (CAVE), a computational infrastructure for immediate and reproducible connectome analysis in up-to petascale datasets (∼1mm³) while proofreading and annotating is ongoing. For segmentation, CAVE provides a distributed proofreading infrastructure for continuous versioning of large reconstructions. Annotations in CAVE are defined by locations such that they can be quickly assigned to the underlying segment which enables fast analysis queries of CAVE’s data for arbitrary time points. CAVE supports schematized, extensible annotations, so that researchers can readily design novel annotation types. CAVE is already used for many connectomics datasets, including the largest datasets available to date.

More Like this