Scaling multi-instance support vector machine to breast cancer detection on the BreaKHis dataset

Seo, Hoon; Brand, Lodewijk; Barco, Lucia Saldana; Wang, Hua

doi:10.1093/bioinformatics/btac267

Citation Details

Scaling multi-instance support vector machine to breast cancer detection on the BreaKHis dataset

Abstract MotivationBreast cancer is a type of cancer that develops in breast tissues, and, after skin cancer, it is the most commonly diagnosed cancer in women in the United States. Given that an early diagnosis is imperative to prevent breast cancer progression, many machine learning models have been developed in recent years to automate the histopathological classification of the different types of carcinomas. However, many of them are not scalable to large-scale datasets. ResultsIn this study, we propose the novel Primal-Dual Multi-Instance Support Vector Machine to determine which tissue segments in an image exhibit an indication of an abnormality. We derive an efficient optimization algorithm for the proposed objective by bypassing the quadratic programming and least-squares problems, which are commonly employed to optimize Support Vector Machine models. The proposed method is computationally efficient, thereby it is scalable to large-scale datasets. We applied our method to the public BreaKHis dataset and achieved promising prediction performance and scalability for histopathological classification. Availability and implementationSoftware is publicly available at: https://1drv.ms/u/s!AiFpD21bgf2wgRLbQq08ixD0SgRD?e=OpqEmY. Supplementary informationSupplementary data are available at Bioinformatics online. more »

Award ID(s):: 1652943 1849359 1932482

PAR ID:: 10368492

Author(s) / Creator(s):: Seo, Hoon; Brand, Lodewijk; Barco, Lucia Saldana; Wang, Hua

Publisher / Repository:: Oxford University Press

Date Published:: 2022-06-27

Journal Name:: Bioinformatics

Volume:: 38

Issue:: Supplement_1

ISSN:: 1367-4803

Format(s):: Medium: X Size: p. i92-i100

Size(s):: p. i92-i100

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/bioinformatics/btac267

More Like this