The Temple University Hospital Digital Pathology Corpus

Houser, D.; Shadhin, G.; Anstotz, R.; Campbell, C.; Obeid, I.; Picone, J.; Farkas, T.; Persidsky, Y.; Jhala, N.

doi:10.1109/SPMB.2018.8615619

Citation Details

The Temple University Hospital Digital Pathology Corpus

Digital pathology is a relatively new field that stands to gain from modern big data and machine learning techniques. In the United States alone, millions of pathology slides are created and interpreted by a human expert each year, suggesting that there is ample data available to support machine learning research. However, the relevant corpora that currently exist contain only hundreds of images, not enough to develop sophisticated deep learning models. This lack of publicly accessible data also hinders the advancement of clinical science. Our digital pathology corpus is an effort to place a large amount of clinical pathology images collected at Temple University Hospital into the public domain to support the development of automatic interpretation technology. The goal of this ambitious project is to create a corpus of 1M images. We have already released 10,000 images from 600 clinical cases. In this paper, we describe the corpus under development and discuss some of the underlying technology that was developed to support this project. more »

Award ID(s):: 1726188

PAR ID:: 10122974

Author(s) / Creator(s):: Houser, D.; Shadhin, G.; Anstotz, R.; Campbell, C.; Obeid, I.; Picone, J.; Farkas, T.; Persidsky, Y.; Jhala, N.

Date Published:: 2018-12-01

Journal Name:: Proceedings of the IEEE Signal Processing in Medicine and Biology Symposium

Page Range / eLocation ID:: 1 to 7

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/SPMB.2018.8615619

More Like this