<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Colorectal Polyps Detection in Virtual Colonoscopy Using 3D Geometric Features and Deep Learning</title></titleStmt>
			<publicationStmt>
				<publisher>IEEE</publisher>
				<date>05/27/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10643975</idno>
					<idno type="doi">10.1109/ISBI56570.2024.10635785</idno>
					
					<author>Mohamed Yousuf</author><author>Samir Harb</author><author>Islam Alkabbany</author><author>Asem Ali</author><author>Salwa Elshazley</author><author>Aly Farag</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Not Available]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>Colorectal cancer (CRC) begins as small growths (polyps) that attach to the luminal wall of the colon or rectum. These growths must be diagnosed and treated promptly. If colon polyps are not diagnosed and treated at the right time, they may grow in size and become cancerous. That is why the American Cancer Society (ACS) recommends that people at average risk of colorectal cancer should start regular screening starting at age 45 <ref type="bibr">[1]</ref>. The two screening procedures, which have the highest performance among other procedures, are the optical colonoscopy (OC) and Computed Tomography Colonography (CTC). However, one fourth of colorectal polyps are missed with performing screening <ref type="bibr">[2]</ref>. Consequently, an automated system for the detection of polyps could be essential in reducing the number of missed polyps by providing the location and size of any potential polyps during the screening process. The Optical colonoscopy (OC) procedure, in which the colon and rectum are viewed using an endscope, is the gold standard screening approach. The use of colonoscopy has been significant in terms of diagnosis, treatment, and screening. In addition, captured videos during OC can be analyzed to diagnose and detect colon polyps. Many studies have been introduced to analyze OC videos. Many of these algorithms have been discussed in different reviews, e.g., Hassan et al. <ref type="bibr">[2]</ref> and Barua et al. <ref type="bibr">[3]</ref>. Although, colonoscopy is a secure process, yet complications can arise, such as perforatio <ref type="bibr">[4]</ref>, bleeding <ref type="bibr">[5]</ref> among other complications <ref type="bibr">[6]</ref>.</p><p>The second screening procedure of CRC is Computed Tomography Colonography (CTC), which can detect polyps with a similar THIS WORK HAS BEEN FUNDED BY NSF GRANT 2124316. accuracy of OC but with lower cost. The American College of Radiology Imaging Network (ACRIN) <ref type="bibr">[7]</ref> performed standard CTC and OC on 2531 patients at 15 study centers in the US. The study showed that per-patient sensitivity and specificity for CTC were: 0.9 &#177; 0.03 and 0.86&#177;0.02, respectively, which are very close to OC. So, CTC is considered as a reliable and robust procedure to detect colon polyps. Therefore, there are several algorithms have been introduced to detect colon polyps within CT scans. Godkhindi and Gowda <ref type="bibr">[8]</ref> trained a CNN classifier to identify the CT slices, which have polyps. However, their approach does not localize polyps in the CT slices. Uemura and N&#228;ppi <ref type="bibr">[9]</ref> proposed a method to classify polyps using 3D CNN, however, the network needs high computational power in order to be trained on 3D data as shown in the experiments section. Furthermore, Tan's approach <ref type="bibr">[10]</ref> classifies the polyp type if it was benign or malignant using a geometric feature called Gray-Level Co-Occurrence Matrix (GLCM) with AUC = 0.91. Zhang's work <ref type="bibr">[11]</ref> was built on Tan's work by making a multi-model to extract the GLCM features with different sizes, which raised the AUC = 0.94 and raised the accuracy by 0.05 to be 0.964. However, these two methods require a polyp region as an inputs and they can't detect such regions. On the other hand, Virtual Colonoscopy (VC), which is a part of the Computed Tomography Colonography (CTC) procedure, is considered cheaper and safer alternative way for screening <ref type="bibr">[12,</ref><ref type="bibr">13]</ref>. Moreover, VC can be used to perform massive scale screening for polyp early detection and it also can help in performing the early detection in economic depressed regions and rural areas where there are limited availability of OC. Virtual colonoscopy mimics OC by visualizing the luminal surface of the colon using different visualization approaches. So many of the video analysis approaches that have been used in OC can be used with VC.</p><p>In this paper, we propose an approach, shown in Fig. <ref type="figure">1</ref>, for colonic polyps detection in virtual colonoscopy. To develop an efficient approach that overcomes polyps detection and localization challenges, we exploit the 3D surface information in VC to generate different geometric features that discriminate between polyp and non-polyp regions. These features are used to train a CNN-based model for polyps detection. The main contributions of this work include developing: 1. a colon segmentation approach to accurately reconstruct 3D colon surface, 2. multiple approaches to extract 3D surface geometric features that increase detection performance, and 3. a CNN-based model, which is trained and validated using generated features that encode 3D surface geometry.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">PROPOSED APPROACH</head><p>The Computed tomography colonography (CTC) framework consists of four-steps, as shown in Fig. <ref type="figure">2-(a)</ref>, starting from an abdominal  CT scan of a prepped patient. These steps are: 1) Image segmentation to isolate the lumen from the rest of tissues in the abdomen (e.g., the liver, lungs and small intestines, in addition to addressing uncertainties of CT acquisition); 2) 3D model building to construct the colon, which also involves extracting the centerline as a datum for visualization, and the registration of supine and prone CTC scans; 3) Visualization to present the lumen on radiology stations with details in 3D and corresponding 2D CT, in addition to functionalities for polyp editing; and 4) Analysis step which performs polyp detection, classification and archiving, and preparation of a full patient record.</p><p>In this work, we focus on the first and last steps.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Images from Virtual Colonoscopy</head><p>To generate a sequence of images that visualize the luminal surface of a colon, as shown in Fig. <ref type="figure">2-(b)</ref>, the colon should be segmented from a CT scan then a 3D surface should be reconstructed. Finally, virtual camera(s) can be used for visualization, e.g. 1) Fly-Through <ref type="bibr">[14]</ref> uses a virtual camera on the centerline to mimic OC, and 2) Fly-IN <ref type="bibr">[15]</ref> uses a ring of virtual cameras around the centerline showing 360 &#8226; field of view (FOV) of colon segments internally. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>. Colon Segmentation and 3-D reconstruction</head><p>Colon segmentation is a challenging problem due to the colon's asymmetric topology. Also, uncertainties appear due to the presence of Hounsfield (HU) intensity regions consisting of air, soft tissue, and high-attenuation structures like the bones. Also, there are other complications due to the presence of residual stool, parts of the diaphragm, lungs, and disconnected colon segments <ref type="bibr">[16]</ref>. In this paper, we propose a segmentation approach that involves multiple steps as shown in Fig. <ref type="figure">3</ref>. The first step is to calculate the empirical distributions of Hounsfield intensities in a DICOM volume, as shown in Fig. <ref type="figure">3-b</ref>. The main components of a colon are the air, for which the characteristic peaks are almost at -1000 HU <ref type="bibr">[17]</ref>, and the fluid whose Hounsfield intensity is greater than 300 HU.</p><p>To extract the colon components, first, we estimate the marginal densities of air, fat, muscle, and fluid by fitting four Gaussian components using the Expectation Maximization (EM) algorithm, as shown in Fig. <ref type="figure">3-b</ref>. Then, we identify colon regions using two thresholds. The first threshold, &lt; t1, is between air and fat (e.g., &#8764; -577HU), and the second threshold, &lt; t2, is between muscle and fluid (e.g., &#8764; 305HU). The HU values of colon regions should be &lt; t1 and &gt; t2.</p><p>An initial segmentation of both air and fluid could be generated using these two thresholds. However, this simple thresholding technique cannot isolate colon regions from non-colon regions because other tissues, e.g., lungs, have the same low HU as air. Also, there are non-colon regions, e.g., bones, that have a high HU as fluid, as shown in Fig. <ref type="figure">3-c</ref>. Therefore, the initial segmentation is used to generate seeds from colon regions. These seeds are fed to a Graph Cut (GC) algorithm that formulates the segmentation problem as a graph partitioning problem. To extract seeds from the initial segmentation, the rectum region is identified in the first part of the DICOM volume. The rectum can be extracted as a disk-like region that has a low HU, as shown in Fig. <ref type="figure">3-d</ref>. This region is used as a starting seed, from which other colon regions are extracted by region growing. However, since there are non-colon parts in the abdominal CT scan (e.g., small intestine, bones, etc.) that are interwind with colon parts, this yields errors in the region growing step. Therefore, restricted region growing is performed using the morphological operation to guarantee more separation between the two aforementioned classes (i.e., colon and non-colon). The output of the region growing step, shown in Fig. <ref type="figure">3-e</ref>, is used as a seed for GC to generate the final segmentation, shown in Fig. <ref type="figure">3</ref>-f.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.2.">Colon Visualization</head><p>After a colon is segmented, 3D reconstruction to generate a surface representation of the colon is performed and the centerline of the 3D colon is generated <ref type="bibr">[18]</ref>. Then, the virtual colonoscopy Fly-In approach <ref type="bibr">[15]</ref> is developed to provide state of the art accuracy in polyp detection. This method involves a desk-like rig of virtual cameras centered on the centerline of a reconstructed colon, providing 360 &#8226; visualization of a cylindrical region of interest (ROI) which, when projected, provides a "filet"-like display of the internal surface of the ROI, as shown in Fig. <ref type="figure">4</ref>. By moving the ROI along the centerline of the colon, radiologists would be able to examine the luminal surface and detect colonic polyps.</p><p>An optimal visualization approach projects most of the surface cells on the image plane without local deformations or loss. The visualization loss can be defined as a function of three factors: The angle (&#945;) between the projection direction (p) and the camera's principal axis ( --&#8594; look); the angle (&#934;) between the projection direction (p) and the cell's normal vector (n); and finally, the ratio between the camera focal length (f ) and the cell's distance (d) to the projection center on the direction of ( --&#8594; look). Figure <ref type="figure">4</ref> demonstrates the Fly-In on a rendered segment of the colon and shows a rig of 8 cameras over a ring. A near distortionless filet of the colon segment is shown as well. By adjusting the visualization frustum for the cameras, we can control the size of displayed ring and resolution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Polyps Detection based on Modified Fly-In Approach</head><p>In the proposed polyp detection framework, Fly-In is used to generate 2D images of the internal surface of the colon for better visualization of the colon wall. In addition to the conventional images formed by Fly-In (i.e., albedo and lightning), Fig. <ref type="figure">1-c</ref>(III), we generate sets of images coding the geometric surface features: 1. The surface curvature map, as shown in Fig. <ref type="figure">1-c(I)</ref>, in which the curvature is calculated using the algebraic point set surface <ref type="bibr">[19]</ref>.</p><p>The curvature is based on moving least squares (MLS) fitting algebraic spheres to the surface. 2. Normal map which could be computed for each surface by getting two vectors on the surface. Then cross product these two vectors, for each vertex on the surface we can get two vectors by subtracting the 3d coordinates (x, y, z) of the vertex from two neighbors 3d coordinates as shown in Fig. <ref type="figure">1-c</ref>(II). The normal map is represented as a 3-channel (RGB) image to represent the normal vector (Nx, Ny, Nz). 3. The depth map that reflects the smallest distance between each surface point and the centerline, as shown in Fig. <ref type="figure">1</ref>-c(IV). Our hypothesis is that fusing the 2D projections and the 3D colon representation in virtual colonoscopy can enhance polyp detection accuracy. The reason of this expected enhancement is that images formed by Fly-In can't accurately encode convex (e.g., polyps) and concave (e.g., folds) surface regions if improper lightning is used, see Fig. <ref type="figure">5</ref>. Therefore, the generated feature maps are combined as multi-channel image and fed into a deep learning model, which can be trained to extract polyps candidates in these images.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">EXPERIMENTAL RESULTS</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Dataset</head><p>Two sets of data are used to validate our proposed framework. For segmentation evaluation, we use a dataset provided by the American College of Radiology Imaging Network (ACRIN) <ref type="bibr">[7]</ref> and Walter-Reed medical center. The polyps detection modules were trained and validated on a dataset of supine and prone scans of 49 patients, provided by CTC experts from the University of Wisconsin. The dataset contains 59 annotated polyps larger than 6 mm, detected and annotated by one of the three experienced radiologists.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Training and testing procedure</head><p>In order to assess the performance of the proposed automatic segmentation approach, the intersection over union metric, IoU = G&#8745;S G&#8746;S , was used to compare the automatic segmentation S with the ground truth G. The results demonstrate that the proposed automatic segmentation approach was successful in segmenting 74% of the cases with an IoU of more than 90%.</p><p>For the polyp detection approach, we test each geometric feature separately, to know which one is the best to be used. Table <ref type="table">1</ref> shows the comparison for each proposed geometric feature as an input for  <ref type="bibr">[23]</ref>, Sparse RCNN <ref type="bibr">[24]</ref>, and Swin Transformer <ref type="bibr">[25]</ref>.</p><p>For the training procedure, all the detector's training batch size = 16 except the Swin transformer which has batch size = 8 due to the size of the model, but all other hyperparameters were the original hyperparameters from each model's original papers. All the training was performed on Nvidia TITAN RTX 24 Gb. Moreover, all models were trained to a high number of epochs (2000 epochs), and the best validation score for each model was chosen to be the tested model.</p><p>Since the main goal is to detect all polyps, Mean Average Precision (mAP) is the best metric to report in our application. As shown in Table <ref type="table">1</ref>, the curvature feature shows the highest mAP score with the use of YOLOv7, which is the most appearance feature that can distinguish between the curvature type if it is convex curvature or concave curvature. The colon contains a lot of concave curvatures which can deceive the detector, so a feature that can distinguish concave and convex surfaces is needed to make the detector work best since polyps have mostly a convex curvature shape. That is why the curvature feature is the most important. The second best feature is the depth feature with Dynamic RCNN network. Since depth can distinguish between the near and far tissue surfaces, it can give the model the needed feature for distinguishing between polyp and nonpolyp parts. Then comes the normal feature with Dynamic RCNN network. We believe that Swin transformer has a big potential to achieve better results but it needs more polyps data than we have to train.</p><p>4. CONCLUSION In this work, we proposed an approach to identify the locations and sizes of the polyps in virtual colonoscopy. The modified Fly-In method was developed to generate images encoding 3D surface geometry features. Then a CNN-based detector was trained using these images to extract potential polyp regions. High performance, i.e. mAP &#8764; 97.1%, encourages radiologists to use the proposed approach in identifying polyp candidates using virtual colonoscopy.</p></div></body>
		</text>
</TEI>
