CT-based deep learning model for the prediction of DNA mismatch repair deficient colorectal cancer: a diagnostic study

Cao, Wuteng; Hu, Huabin; Guo, Jirui; Qin, Qiyuan; Lian, Yanbang; Li, Jiao; Wu, Qianyu; Chen, Junhong; Wang, Xinhua; Deng, Yanhong

doi:10.1186/s12967-023-04023-8

Research
Open access
Published: 22 March 2023

CT-based deep learning model for the prediction of DNA mismatch repair deficient colorectal cancer: a diagnostic study

Wuteng Cao^1,2^na1,
Huabin Hu^2,3^na1,
Jirui Guo^2,4^na1,
Qiyuan Qin^2,4^na1,
Yanbang Lian⁵^na1,
Jiao Li^1,2,
Qianyu Wu^1,2,
Junhong Chen⁶,
Xinhua Wang^1,2 &
…
Yanhong Deng^2,3

Journal of Translational Medicine volume 21, Article number: 214 (2023) Cite this article

2025 Accesses
3 Citations
1 Altmetric
Metrics details

Abstract

Background

Stratification of DNA mismatch repair (MMR) status in patients with colorectal cancer (CRC) enables individual clinical treatment decision making. The present study aimed to develop and validate a deep learning (DL) model based on the pre-treatment CT images for predicting MMR status in CRC.

Methods

1812 eligible participants (training cohort: n = 1124; internal validation cohort: n = 482; external validation cohort: n = 206) with CRC were enrolled from two institutions. All pretherapeutic CT images from three dimensions were trained by the ResNet101, then integrated by Gaussian process regression (GPR) to develop a full-automatic DL model for MMR status prediction. The predictive performance of the DL model was evaluated using the area under the receiver operating characteristic curve (AUC) and then tested in the internal and external validation cohorts. Additionally, the participants from institution 1 were sub-grouped by various clinical factors for subgroup analysis, then the predictive performance of the DL model for identifying MMR status between participants in different groups were compared.

Results

The full-automatic DL model was established in the training cohort to stratify the MMR status, which presented promising discriminative ability with the AUCs of 0.986 (95% CI 0.971–1.000) in the internal validation cohort and 0.915 (95% CI 0.870–0.960) in the external validation cohort. In addition, the subgroup analysis based on the thickness of CT images, clinical T and N stages, gender, the longest diameter, and the location of tumors revealed that the DL model showed similar satisfying prediction performance.

Conclusions

The DL model may potentially serve as a noninvasive tool to facilitate the pre-treatment individualized prediction of MMR status in patients with CRC, which could promote the personalized clinical-making decision.

Background

Colorectal cancer (CRC) is the third most commonly diagnosed malignancy in the world and the second highest rate of increasing incidence among all gastrointestinal tumors [1, 2]. Patients with DNA mismatch repair deficient (dMMR)/microsatellite instability-high (MSI-H) CRC have a short overall survival (OS) and obtain no benefit from adjuvant chemotherapy [3,4,5]. More importantly, recent studies have demonstrated dMMR/MSI-H is a predictive biomarker for immunotherapy, because dMMR/MSI-H CRCs are associated with a higher mutational burden tumor neoantigen load, and dense immune cell infiltration [6, 7]. In addition, immunotherapy in patients with advanced CRC harboring dMMR/MSI has been approved by the United State Food and Drug Administration (FDA).

The National Comprehensive Cancer Network (NCCN) and the European Society for Medical Oncology (ESMO) guidelines both recommended that all patients with CRC be tested for microsatellite instability, a hypermutable phenotype caused by defects in DNA mismatch repair, which facilitated individualized clinical making-decisions, then maximized the benefits for CRC patients [8,9,10]. Current testing for dMMR/MSI include PCR-based assay for microsatellite markers and immunohistochemical analysis for MMR protein expression [11]. While these existed approaches face distinct drawbacks. First, routine MSI testing using IHC or PCR is not commonly performed on account of tedious procedures and the heavy financial burden [12, 13]. In addition, the procedure of sampling is invasive linked to potentially complications, which limits the dynamic monitoring of biological characteristics and histopathological changes of tumors [14]. Furthermore, the accuracy of conventional biopsy specimens will be influenced by sampling errors, such as insufficient or inappropriate tissue sampling because of tumor heterogeneity. Therefore, a noninvasive and accurate method is highly desirable for pretherapeutic prediction of MMR status, to help better stratify CRC patients before individualized clinical making-decision.

Recently, artificial intelligence (AI) algorithms, particularly deep learning, have shown outstanding performance in medical image processing, advancing the field forward at a rapid pace [15]. A typical approach of deep learning termed convolutional neural network (CNN) has been reported to act as an alternative tool to tackle complex medical issues efficiently and effectively and achieve satisfying performance in many diseases such as breast cancer, pulmonary nodules, gastrointestinal cancer and CRC [16,17,18,19]. Recent studies have proposed deep learning approaches for predicting MMR status based on hematoxylin and eosin histological images in CRC patients [20,21,22]. These studies reported a moderate predictive performance, which suggested that it is reasonable to speculate that deep learning approaches can achieve improved performance for pretherapeutic prediction of MMR status in CRC. However, few studies based on deep learning focus on routine and noninvasive computed tomography (CT) images.

In the present study, we aim to develop and validate a deep learning model based on pretherapeutic CT images for predicting MMR status in CRC. We hypothesized that DL model could make it easier and more precisive to stratify MMR status, which could promote the personalized clinical-making decision.

Methods

Study participants

The retrospective study was approved by Ethics Committees of the two participating institutions and the informed consent requirement was waived due to its retrospective nature. Consecutive patients with histologically confirmed primary CRC between March 2012 and March 2020 were retrospectively reviewed from two medical institutions: the Sixth Affiliated Hospital of Sun Yat-sen University (Guangzhou, China, institution 1) and the First Affiliated Hospital of Zhengzhou University (Zhengzhou, China, institution 2). Our inclusion criteria were patients with (i) pathologically confirmed primary CRC, (ii) available test results for MMR status, (iii) pretherapeutic contrast-enhanced abdominopelvic CT images within 2 weeks before surgery, (iv) available complete clinicopathological data. The exclusion criteria were as follows: (i) receiving any therapy before CT examination, (ii) lacking MMR test, (iii) incomplete clinicopathological data, (iv) the interval between CT examinations and surgery over 2 weeks, and (v) inoperability or refusal of operation.

A total of 1812 eligible patients were enrolled in this study. The patients from institution 1 were divided into a training cohort (n = 1124, March 2012 to March 2018) and an internal validation cohort (n = 482, May 2018 to March 2020) by time. The 206 participants from institution 2 were assigned to an external cohort. It’s noted that no data from the same patients in the training and external cohorts. The detail of the recruitment pathway was presented in Additional file 1: Fig. S1.

Clinicopathological characteristics

The baseline clinicopathological parameters of patients from institution 1, including gender, age, the levels of serum tumor markers, such as carcinoembryonic antigen (CEA), cancer antigen 199 (CA199), cancer antigen 125 (CA125) and cancer antigen 153 (CA153), the clinical T and N stage, the longest diameter, and the location of tumors, were retrospectively reviewed and recorded from the medical record archives. Additionally, the thickness of CT images was also recorded. The overall experimental design is shown in Fig. 1.

Identification of MMR status

Immunohistochemistry (IHC) analysis of mismatch repair (MMR) proteins expression–Formalin-fixed paraffin-embedded (FFPE) tumors were examined the loss of MMR proteins (MLH1, MSH2, MSH6, and PMS2) expression. MMR protein loss is defined as the absence of nuclear staining in neoplastic cells but positive nuclear staining in lymphocytes and normal adjacent colonic epithelium. Primary monoclonal antibodies against MLH1, MSH2, MSH6, and PMS2 were applied. MMR status was determined locally by IHC analysis and tumors displaying loss of at least one of four MMR proteins can be considered as deficient mismatch repair (dMMR), whereas those with intact MMR proteins can be classified as proficient mismatch repair (pMMR).

CT image acquisition

All patients underwent contrast-enhanced CT scans covered the abdominal and pelvic region. CT images were obtained using three CT scanners from two institutions. For institution 1, patients were examined using OPTIMA CT660 (GE Medical Systems, Milwaukee, WI, United States) or AQUILION ONE (TOSHIBA Medical Systems, Japan) scanner. For institution 2, 64-row multidetector device (Discovery CT750HD, GE Medical Systems, Waukesha, WI, United States) CT scanner was used to perform abdominopelvic CT scans. The acquisition parameters of the two institutions were as follows: tube voltage of 120 kV; tube current of 150–550 mA; pitch of 0.97 to 0.99; reconstruction section thickness of 1.25 mm and 5 mm. The contrast agents at the dose of 1.2–1.5 mL/kg weight were injected at a speed of 2.5–3 mL/s with a high-pressure pump syringe. Arterial phase was obtained after 25–30 s of delay after intravenous injection of contrast material, and portal venous phase was performed after 55–70 s of delay. The representative CT and immunohistochemistry images of different MMR statuses were shown in Additional file 5: Material S1.

Preliminary experiment

All CT image data were derived from the Picture Archiving and Communication System (PACS) then converted into a unified Communications in Medicine (DICOM) format and stored as Nifti format on a case-by-case basis for further analysis. A preliminary experiment was conducted to determine the performance of region of interest (ROI)-based labeling approaches (method 1) for modeling versus ROI-free analysis (method 2) in prediction of MMR status for CRC patients, then that approach with better predictive performance would be selected as the final data processing method in the present study. The preliminary experiment consisted of 100 participants randomly selected from institution 1, including 50 patients with dMMR and 50 with pMMR.

For the method 1, three-dimensional manual segmentation of the tumor ROI was performed on the portal venous phase CT images by one radiologist with 10 years of experience with CRC diagnosis, using the free open-source software ITK-SNAP software (version 2.2.0, http://www.itksnap.org), with careful exclusion of pericolonic fat and mesentery air. Note that each segmentation was validated by a senior radiologist, who had 20 years of experience. Next, the obtained images were segmented and stored as PNG image files with the same resolution based on the axial direction, then divided into training group and test group. For the method 2, we directly processed the images of the original Nifti format files without delineating the ROIs, and these images also were converted and stored as PNG image files, then processed by the same approach as described in method 1 above. The images processed by the above two methods were saved separately as independent datasets. Whereafter, a classification neural network architecture (ResNet101) was applied to train the two independent datasets within the same development environment, meanwhile, we set the maximum training epochs to 100 epochs in both experiments. Finally, we compared the accuracy of the two models on the validation set, the model with better predictive performance would be identified, the corresponding data processing method in this study was finalized.

The preliminary experiment suggested that the model based on ROI labeling approach (method 1) showed inferior accuracy in predicting MMR status, with the accuracy of 73% while the other model based on fully automatic deep learning analysis (model 2) yielded an accuracy of more than 90%. Therefore, ROI-free analysis based on ResNet101 was identified as the final data processing method in the present study.

MMRnet development

On the basis of the preliminary experiment, we selected the method 2 (ROI-free) to process CT images data and construct the predictive model named “MMRnet”. It’s noted that all 1812 enrolled patients received abdominopelvic contrast-enhanced CT, which covered the whole tumors of colon or rectum. All CT images data were derived from the PACS, then converted into a unified Communications in Medicine (DICOM) format and stored as Nifti format on a case-by-case basis as similar as the preliminary experiment. By introducing the opencv-python package and writing related packages, the Nifti format files were divided into PNG image files with the same resolution in the axial direction and divided into training files and test files based on the axial direction.

The ResNet101 architecture, one of Resnet model, was utilized to train the pre-therapeutic CT imaging data and build the network to identify the MMR status, which contains one 7*7 convolutional layer, one max pooling layer, 33 bottleneck and one fully connected layer. Each bottleneck contains one 1 × 1 convolutional layer, one 3 × 3 convolutional layer and one 1 × 1 convolutional layer. The final fully connected layer output prediction. Images were transformed into matrices and input to the ResNet101 neural networks. During the training process, we used a cross-entropy loss function and Stochastic Gradient Descent (SGD). We set the initial learning rate to 0.001, the weight decay to 0.005, and the batch size to 128. The final epoch of model training was 100. In this study, we analyzed multi-axial CT images to develop a 3D predictive model, all CT images of the training cohort were automatically recognized and interpreted by ResNet101 with Pytorch (version 1.1.0), and the corresponding information of different axial CT images was obtained. Note that the mapping relationship between different axial images is considerably complex, and it is difficult to fuse complex information by using a general linear model. In order to take full advantage of all image information, we applied the Gaussian process regression (GPR) model to fuse the information of different axial images for automatic interpretation results of CT images. The GPR models are constructed from classical statistical models by replacing latent functions of parametric form by random processes with Gaussian prior, which is widely used in regression and classification tasks [23]. Additionally, the model can utilize prior prediction knowledge to provide predictive results. For regression tasks of large dataset, the GPR can reduce computational complexity [24]. The GPR model was used to perform regression analysis on all CT image interpretation results, which improved the accuracy of fusion results to a certain extent.

The training model performance was evaluated using multi-fold cross-validation. The squared exponential function was finally set as the model kernel function:

$$cov\left({x}_{i},{x}_{j}\right)=\mathrm{exp}(-\frac{{({x}_{i}-{x}_{j})}^{2}}{2})$$

x_i and x_j represents different image interpretation results. The GPR model was fused with neural network and finally achieved automatic machine diagnosis of MMR status (dMMR = 1 or pMMR = 0). The deep learning model workflow is presented in Fig. 2.

Predictive performance of the MMRnet

The predictive performance of MMRnet was trained to its optimum in the training cohort and then tested in the two validation cohorts. Receiver operating characteristic curve (ROC) analysis was performed to evaluate the performance of MMRnet. The optimal cutoff threshold was identified by maximizing Youden index (sensitivity + specificity − 1), then the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) were then calculated with the cutoff of ROC curve identified in the primary cohort, which was also applied to the validation cohorts.

Ablation experiments

To investigate which networks are suitable for accurate MMR status prediction based on ROI-free analysis, we applied the ResNet101 and the VGG-19 network to construct the prediction model respectively, then compared their predictive performance in terms of the AUC value.

Statistical analysis

The clinical parameters were compared and analyzed by Student t test for continual variables and Chi-square, Fisher's exact, or Mann–Whitney U tests for categorical variables. Statistical analysis was conducted with R software (version 3.5.0; http://www.Rproject.org), MATLAB (version 2020a; Mathworks, Natick, MA, USA) and MedCalc Software (version 18.2 Belgium). A result was considered to indicate a significant difference with a p value of less than 0.05.

Results

Patient characteristics

In the present study, 481 patients with dMMR status determined by the IHC analysis, with prevalence of 24.3% (390/1606) in institution 1 and 44.2% (91/206) in institution 2. Patient characteristics and a comparison between patients with dMMR and pMMR in institution 1 are presented in Table 1. In the training cohort and internal validation cohort, no significant difference was found between the dMMR status and pMMR status groups in terms of gender and the levels of serum CEA, CA199, CA153 (p > 0.05). However, regarding the age, N stage and tumor location, significant differences were observed between the two groups in both cohorts from institution 1.

Table 1 Characteristics of patients in institution 1

Full size table

Predictive performance of the ResNet classification model

In comparison of the predictive performance of ResNet101 and VGG-19 in predicting the MMR status, the ResNet101 showed the superior prediction performance, with a higher AUC of 0.997 (95% CI 0.995–1.000, p < 0.001), the corresponding ROC curves were present in Additional file 2: Fig. S2. As demonstrated in the Fig. 3, the heatmap demonstrated that the region of red color means the higher possibility of the presence of predictive features, and these regions were important in making the diagnosis for the neural network, which visualizes the image features corresponding to different convolution layers.

Based on the uniaxial information of CT images, the DL model achieve the AUCs of 0.944, 0.762, 0.984 in the X, Y, Z axis respectively (Additional file 3: Fig. S3). Additionally, in the process of GPR analysis to fuse multi-axial information, we found that the MMRnet model had the optimum prediction performance while the Gaussian process regression squared index was adopted, the details were presented in Additional file 4: Fig. S4. In this condition, for stratifying MMR status, the DL model were developed in the training cohort to automatically classify the MMR status, then tested in two validation cohorts, which achieved promising discriminative ability, with AUCs of 0.986 (95% CI 0.971–1.000) in the internal validation cohort and 0.915 (95% CI 0.870–0.960) in the external validation cohort. The sensitivity, specificity, accuracy, PPV and NPV were also present in Table 2 and Fig. 4.

Table 2 Predictive performance of MMRnet in the internal and external validation cohorts

Full size table

Subgroup analysis

Subgroups analyses also were performed in addition to main analysis in order to assess prediction performance in different subgroups based on the thickness of CT images, clinical T and N stages, gender, the longest diameter and location of tumor in participants enrolled from institution 1. The subgroup analysis revealed that the MMRnet show similar satisfying prediction performance in all groups, as presented in Additional file 6: Material S2.

Discussion

We developed a fully automated classifier for stratifying MMR status in 1812 patients with CRC using clinically acquired pretherapeutic CT images from two institutes. It demonstrated promising performance with an AUC of 0.986 in the internal validation cohort; moreover, when further validated in an external validation cohort, it demonstrated robust performance, with an AUC as high as 0.915. It’s noted that the MMRnet based on fully automatic deep learning successfully triaged MMR status in different groups by subgroups analysis. The outperformance of the MMRnet model indicated that CT-based full-automatic deep learning could serve as a noninvasive tool for the pretreatment prediction of MMR status in CRC, further enabling the clinical implementation of computer-aided personalized management for CRC patients.

An architecture of networks was designed by paralleling a max-pooling layer with a center-cropping layer to extract information from different scales. In our study, the Resnet was used for image processing, one of the most popular deep learning models for image analysis, which ease the training of networks that are deeper than those used previously [25]. In addition, we applied the GPR model to fuse the information of different axial images for automatic interpretation results of CT images, which can reduce computational complexity and improve the accuracy of fusion results to a certain extent compared with the general linear model. Our results displayed excellent performance using the DL model to stratify MMR status. Although the underlying mechanism of using deep learning to predict MMR status remains unclear, we hypothesized that this could be related to tumor heterogeneity. After all, the widespread application of deep learning in the non-invasive analysis of tumor heterogeneity in the field of oncology has been demonstrated in many previous studies [26, 27]. Additionally, it is reported that dMMR/MSI-H tumors tend to have distinct morphological patterns such as poor differentiation, mucinous differentiation, histological heterogeneity, infiltrating lymphocytes, and significant Crohn-like reactions at the tumor frontier [28, 29]. These histopathological features imply that the dMMR tumors may be more heterogeneous than pMMR tumors, which could be captured by the DL model. In our study, the DL model achieved excellent performance in MMR status prediction of CRC patients, which was comparable to and even superior to the results of previous studies. It was indicated that deep learning could gain more high-dimensional image information about tumor heterogeneity that cannot be captured by human eyes [20,21,22]. In addition, CT-based radiomics are predominant in image-based models and have been proposed to preoperative discriminate dMMR and pMMR in colorectal cancer [30,31,32]. However, tumors were usually segmented manually in majority radiomics researches, which was time-consuming and inevitably caused inter-observer variations. These defects could be avoided by the DL method, and it could adaptively extract features according to the data rather than using predefined features.

In the statistical analysis, we found the age and tumor location present statistical significance in the differentiated dMMR group and pMMR group, indicating that the younger patients and the tumor located in the right colon were more likely to show dMMR status, which was consistent with previous studies [33]. Our study identified that dMMR CRC occurs predominantly in the younger population and on the right side, which may be explained by the fact that the proximal and distal colon have different embryonic origins, leading to distinct biological properties [34, 35]. In the present study, the clinical parameters were not incorporated in the final prediction model, mainly based on the following considerations. Firstly, the fully automatic DL model had outperformed discriminative performance in stratifying MMR status, with an AUC of 0.986 in the internal validation cohort. Meanwhile, the predictive performance was robust in the external validation cohort, which achieved an AUC of 0.915. Hence, we have reason to believe that the DL model developed in the present study could be considered an independent tool to predict the MMR status for CRC patients. In addition, due to the excellent predictive performance of this DL model, the actual predictive efficacy of clinical parameters may be obscured when incorporated into the DL model. Therefore, we tend to construct a relatively simple and feasible DL model rather than a combined model incorporating the predictive clinical parameters. For the clinical parameters, subgroups analyses were performed in addition to the main analysis to assess the prediction performance in different subgroups in this study, and the result revealed that the MMRnet model showed similar satisfying prediction performance in all groups, that indicating the pre-therapeutic CT-based DL model could potentially serve as an alternative approach to predict the MMR status, then help in clinical decision-making for CRC patients.

Our DL model has unique advantages. First, the imaging data could be stored and used repeatedly, in addition, the noninvasive deep learning method is more convenient than the pathological approach based on biological specimens. Secondly, we should be cognizant that deep learning models can produce study results more directly and quickly and improve the problem of being time-consuming and burdensome for busy clinicians compared with ROI-based analysis. This may be attributed to the automatic feature extraction based on deep learning models probably have the ability to capture additional differences within tumors [36, 37]. Moreover, we enrolled a higher sample size of over 1800 CRC patients, which met the requirement of millions of weights to train efficiently for CNN, and our model demonstrated superior performance to previous radiological models in terms of AUC values (0.74–0.82 for other models) [31, 38, 39]. In conclusion, it is suggested that the DL model based on CT images has the potential to stratify tumor MMR status with promising effectiveness prior to surgery, chemotherapy, or immunotherapy in our study.

Despite promising findings, our study has some limitations. First, the retrospective nature of this study inevitably leads to selection bias, and a prospective and multicenter study is required to confirm the impact of our model in the future so that it could serve better in clinical application. Second, although we had a favorable predictive performance in participants from an external institution, it did not reach a high level in the internal institution. We considered that no specific schemes were applied to deal with the parameter variations from different scanners. Third, we explored the DL model based on routine CT images rather than other imaging technologies, such as MRI and Dual-Energy computed tomography, which have been reported in previous studies [39, 40]. Actually, CT is the most suitable and routine examination method for colorectal cancer. Finally, since there is no specific definition of these extracted features of deep learning, their interpretability should further explore image encoding processes in the future.

Conclusions

We have constructed and validated the fully automatic DL model derived from pre-therapeutic CT images, which can stratify MMR status in CRC, with superior performance. This method could provide a potential noninvasive tool to triage MMR status in CRC, thus further personalized medicine.

Availability of data and materials

The datasets used during this study are available from the corresponding author on reasonable request.

Abbreviations

CRC:: Colorectal cancer
dMMR:: DNA mismatch repair deficient
MSI-H:: Microsatellite instability-high
FDA:: The United State Food and Drug Administration
NCCN:: The National Comprehensive Cancer Network
CNN:: Convolutional neural network
CEA:: Carcinoembryonic antigen
CA199:: Cancer antigen 199
CA125:: Cancer antigen 125
CA153:: Cancer antigen 153
CT:: Computed Tomography
DL:: Deep learning
PACS:: The Picture Archiving and Communication System
ROI:: Region of interest
GPR:: Gaussian process regression
ROC:: Receiver operating characteristic curve
AUC:: The area under the receiver operating characteristic curve
PPV:: Positive predictive value
NPV:: Negative predictive value

References

Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA Cancer J Clin. 2021;71(1):7–33.
Article PubMed Google Scholar
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209–49.
Article PubMed Google Scholar
Goldstein J, Tran B, Ensor J, Gibbs P, Wong HL, Wong SF, et al. Multicenter retrospective analysis of metastatic colorectal cancer (CRC) with high-level microsatellite instability (MSI-H). Ann Oncol. 2014;25(5):1032–8.
Article CAS PubMed PubMed Central Google Scholar
Venderbosch S, Nagtegaal ID, Maughan TS, Smith CG, Cheadle JP, Fisher D, et al. Mismatch repair status and BRAF mutation status in metastatic colorectal cancer patients: a pooled analysis of the CAIRO, CAIRO2, COIN, and FOCUS studies. Clin Cancer Res. 2014;20(20):5322–30.
Article CAS PubMed PubMed Central Google Scholar
Tran B, Kopetz S, Tie J, Gibbs P, Jiang ZQ, Lieu CH, et al. Impact of BRAF mutation and microsatellite instability on the pattern of metastatic spread and prognosis in metastatic colorectal cancer. Cancer. 2011;117(20):4623–32.
Article CAS PubMed Google Scholar
Llosa NJ, Cruise M, Tam A, Wicks EC, Hechenbleikner EM, Taube JM, et al. The vigorous immune microenvironment of microsatellite instable colon cancer is balanced by multiple counter-inhibitory checkpoints. Cancer Discov. 2015;5(1):43–51.
Article CAS PubMed Google Scholar
Giannakis M, Mu XJ, Shukla SA, Qian ZR, Cohen O, Nishihara R, et al. Genomic correlates of immune-cell infiltrates in colorectal carcinoma. Cell Rep. 2016;15(4):857–65.
Article CAS PubMed PubMed Central Google Scholar
Vilar E, Gruber SB. Microsatellite instability in colorectal cancer-the stable evidence. Nat Rev Clin Oncol. 2010;7(3):153–62.
Article CAS PubMed PubMed Central Google Scholar
Coit DG, Thompson JA, Algazi A, Andtbacka R, Bichakjian CK, Carson WE 3rd, et al. Melanoma, Version 2.2016, NCCN clinical practice guidelines in oncology. Jo Natl Compr Cancer Netw. 2016;14(4):450–73.
Article Google Scholar
Luchini C, Bibeau F, Ligtenberg MJL, Singh N, Nottegar A, Bosse T, et al. ESMO recommendations on microsatellite instability testing for immunotherapy in cancer, and its relationship with PD-1/PD-L1 expression and tumour mutational burden: a systematic review-based approach. Ann Oncol. 2019;30(8):1232–43.
Article CAS PubMed Google Scholar
Cerretelli G, Ager A, Arends MJ, Frayling IM. Molecular pathology of Lynch syndrome. J Pathol. 2020;250(5):518–31.
Article PubMed Google Scholar
Boland CR, Goel A. Microsatellite instability in colorectal cancer. Gastroenterology. 2010;138(6):2073-87.e3.
Article CAS PubMed Google Scholar
Kawakami H, Zaanan A, Sinicrope FA. Microsatellite instability testing and its role in the management of colorectal cancer. Curr Treat Options Oncol. 2015;16(7):30.
Article PubMed PubMed Central Google Scholar
Meng X, Xia W, Xie P, Zhang R, Li W, Wang M, et al. Preoperative radiomic signature based on multiparametric magnetic resonance imaging for noninvasive evaluation of biological characteristics in rectal cancer. Eur Radiol. 2019;29(6):3200–9.
Article PubMed Google Scholar
Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H. Artificial intelligence in radiology. Nat Rev Cancer. 2018;18(8):500–10.
Article CAS PubMed PubMed Central Google Scholar
Jiang Y, Zhang Z, Yuan Q, Wang W, Wang H, Li T, et al. Predicting peritoneal recurrence and disease-free survival from CT images in gastric cancer with multitask deep learning: a retrospective study. Lancet Digital Health. 2022;4(5):e340–50.
Article CAS PubMed Google Scholar
Truhn D, Schrading S, Haarburger C, Schneider H, Merhof D, Kuhl C. Radiomic versus convolutional neural networks analysis for classification of contrast-enhancing lesions at multiparametric breast MRI. Radiology. 2019;290(2):290–7.
Article PubMed Google Scholar
Wang S, Yu H, Gan Y, Wu Z, Li E, Li X, et al. Mining whole-lung information by artificial intelligence for predicting EGFR genotype and targeted therapy response in lung cancer: a multicohort study. Lancet Digital Health. 2022;4(5):e309–19.
Article CAS PubMed Google Scholar
Yuan Z, Xu T, Cai J, Zhao Y, Cao W, Fichera A, et al. Development and validation of an image-based deep learning algorithm for detection of synchronous peritoneal carcinomatosis in colorectal cancer. Ann Surg. 2022;275(4):e645–51.
Article PubMed Google Scholar
Echle A, Grabsch HI, Quirke P, van den Brandt PA, West NP, Hutchins GGA, et al. Clinical-grade detection of microsatellite instability in colorectal tumors by deep learning. Gastroenterology. 2020;159(4):1406–16.
Article CAS PubMed Google Scholar
Jiang W, Mei WJ, Xu SY, Ling YH, Li WR, Kuang JB, et al. Clinical actionability of triaging DNA mismatch repair deficient colorectal cancer from biopsy samples using deep learning. EBioMedicine. 2022;81: 104120.
Article CAS PubMed PubMed Central Google Scholar
Yamashita R, Long J, Longacre T, Peng L, Berry G, Martin B, et al. Deep learning model for the prediction of microsatellite instability in colorectal cancer: a diagnostic study. Lancet Oncol. 2021;22(1):132–41.
Article PubMed Google Scholar
Yu T, Canales-Rodríguez EJ, Pizzolato M, Piredda GF, Hilbert T, Fischi-Gomez E, et al. Model-informed machine learning for multi-component T(2) relaxometry. Med Image Anal. 2021;69: 101940.
Article PubMed Google Scholar
Liu H, Ong YS, Shen X, Cai J. When Gaussian process meets big data: a review of scalable GPs. IEEE Trans Neural Netw Learn Syst. 2020;31(11):4405–23.
Article PubMed Google Scholar
He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. IEEE Confer Comput Vision Pattern Recogn. 2016;2016:770–8.
Google Scholar
Armato SG, Petrick NA, Huynh BQ, Antropova N, Giger ML. Comparison of breast DCE-MRI contrast time points for predicting response to neoadjuvant chemotherapy using deep convolutional neural network features with transfer learning. Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series. 2017;10134:101340U.
Forghani R, Savadjiev P, Chatterjee A, Muthukrishnan N, Reinhold C, Forghani B. Radiomics and artificial intelligence for biomarker and prediction model development in oncology. Comput Struct Biotechnol J. 2019;17:995–1008.
Article PubMed PubMed Central Google Scholar
De Smedt L, Lemahieu J, Palmans S, Govaere O, Tousseyn T, Van Cutsem E, et al. Microsatellite instable vs stable colon carcinomas: analysis of tumour heterogeneity, inflammation and angiogenesis. Br J Cancer. 2015;113(3):500–9.
Article PubMed PubMed Central Google Scholar
Greenson JK, Huang SC, Herron C, Moreno V, Bonner JD, Tomsho LP, et al. Pathologic predictors of microsatellite instability in colorectal cancer. Am J Surg Pathol. 2009;33(1):126–33.
Article PubMed PubMed Central Google Scholar
Cao Y, Zhang G, Zhang J, Yang Y, Ren J, Yan X, et al. Predicting microsatellite instability status in colorectal cancer based on triphasic enhanced computed tomography radiomics signatures: a multicenter study. Front Oncol. 2021;11: 687771.
Article PubMed PubMed Central Google Scholar
Pei Q, Yi X, Chen C, Pang P, Fu Y, Lei G, et al. Pre-treatment CT-based radiomics nomogram for predicting microsatellite instability status in colorectal cancer. Eur Radiol. 2022;32(1):714–24.
Article CAS PubMed Google Scholar
Ying M, Pan J, Lu G, Zhou S, Fu J, Wang Q, et al. Development and validation of a radiomics-based nomogram for the preoperative prediction of microsatellite instability in colorectal cancer. BMC Cancer. 2022;22(1):524.
Article PubMed PubMed Central Google Scholar
Lee MS, Menter DG, Kopetz S. Right versus left colon cancer biology: integrating the consensus molecular subtypes. J Natl Compr Cancer Netw. 2017;15(3):411–9.
Article Google Scholar
De’Angelis GL, Bottarelli L, Azzoni C, De’Angelis N, Leandro G, Di Mario F, et al. Microsatellite instability in colorectal cancer. Acta Biomed. 2018;89(9-S):97–101.
PubMed Google Scholar
Song Y, Wang L, Ran W, Li G, Xiao Y, Wang X, et al. Effect of tumor location on clinicopathological and molecular markers in colorectal cancer in eastern china patients: an analysis of 2,356 cases. Front Genet. 2020;11:96.
Article CAS PubMed PubMed Central Google Scholar
Shi B, Grimm LJ, Mazurowski MA, Baker JA, Marks JR, King LM, et al. Prediction of occult invasive disease in ductal carcinoma in situ using deep learning features. J Am College Radiol. 2018;15(3):527–34.
Article Google Scholar
Zhou J, Zhang Y, Chang KT, Lee KE, Wang O, Li J, et al. Diagnosis of benign and malignant breast lesions on DCE-MRI by using radiomics and deep learning with consideration of peritumor tissue. J Magn Reson Imaging. 2020;51(3):798–809.
Article PubMed Google Scholar
Chen X, He L, Li Q, Liu L, Li S, Zhang Y, et al. Non-invasive prediction of microsatellite instability in colorectal cancer by a genetic algorithm-enhanced artificial neural network-based CT radiomics signature. Eur Radiol. 2022;33(1):11–22.
Article CAS PubMed Google Scholar
Zhang W, Yin H, Huang Z, Zhao J, Zheng H, He D, et al. Development and validation of MRI-based deep learning models for prediction of microsatellite instability in rectal cancer. Cancer Med. 2021;10(12):4164–73.
Article CAS PubMed PubMed Central Google Scholar
Wu J, Zhang Q, Zhao Y, Liu Y, Chen A, Li X, et al. Radiomics analysis of iodine-based material decomposition images with dual-energy computed tomography imaging for preoperatively predicting microsatellite instability status in colorectal cancer. Front Oncol. 2019;9:1250.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Our research supported by National Key Clinical Discipline.

Funding

The work was supported by the National Natural Science Foundation of China (NO. 82001765, NO. 82272800).

Author information

Wuteng Cao, Huabin Hu, Jirui Guo, Qiyuan Qin and Yanbang Lian contributed equally

Authors and Affiliations

Department of Radiology, The Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510655, Guangdong, China
Wuteng Cao, Jiao Li, Qianyu Wu & Xinhua Wang
Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, Guangdong Research Institute of Gastroenterology, The Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510655, Guangdong, China
Wuteng Cao, Huabin Hu, Jirui Guo, Qiyuan Qin, Jiao Li, Qianyu Wu, Xinhua Wang & Yanhong Deng
Department of Medical Oncology, The Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510655, Guangdong, China
Huabin Hu & Yanhong Deng
Department of Colorectal Surgery, Department of General Surgery, The Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510655, Guangdong, China
Jirui Guo & Qiyuan Qin
Department of Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
Yanbang Lian
School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-Sen University, Shenzhen, 518107, Guangdong, China
Junhong Chen

Authors

Wuteng Cao
View author publications
You can also search for this author in PubMed Google Scholar
Huabin Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jirui Guo
View author publications
You can also search for this author in PubMed Google Scholar
Qiyuan Qin
View author publications
You can also search for this author in PubMed Google Scholar
Yanbang Lian
View author publications
You can also search for this author in PubMed Google Scholar
Jiao Li
View author publications
You can also search for this author in PubMed Google Scholar
Qianyu Wu
View author publications
You can also search for this author in PubMed Google Scholar
Junhong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xinhua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yanhong Deng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design: WC, HH, YD. Administrative support: WC, HH, QQ, YD. Provision of study materials or patients: WC, HH, YD, YL. Collection and assembly of data: JG, JL, QW, YL, XW. Data analysis and interpretation: WC, HH, JG, QQ, JL, QW, JC. Manuscript writing: All authors. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yanhong Deng.

Ethics declarations

Ethics approval and consent to participate

The retrospective study was approved by Ethics Committees of the two participating institutions, including the Sixth Affiliated Hospital of Sun Yat-sen University (Guangzhou, China) and the First Affiliated Hospital of Zhengzhou University (Zhengzhou, China). The informed consent requirement was waived due to its retrospective nature. In addition, the study was performed in accordance with the Declaration of Helsinki.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

The flowchart of inclusion and exclusion criteria for eligible patients in the study.

Additional file 2: Figure S2.

The ROC curves of Resnet101 and VGG-19.

Additional file 3: Figure S3.

The ROC curves of DL model based on the CT images of X, Y and Z axis respectively and the Gaussian regression fusion model.

Additional file 4: Figure S4.

The ROC curves of different Gaussian regression models.

Additional file 5.

The representative CT and immunohistochemistry images of different MMR statuses.

Additional file 6.

The results of subgroups analyses.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Cao, W., Hu, H., Guo, J. et al. CT-based deep learning model for the prediction of DNA mismatch repair deficient colorectal cancer: a diagnostic study. J Transl Med 21, 214 (2023). https://doi.org/10.1186/s12967-023-04023-8

Download citation

Received: 28 December 2022
Accepted: 27 February 2023
Published: 22 March 2023
DOI: https://doi.org/10.1186/s12967-023-04023-8

CT-based deep learning model for the prediction of DNA mismatch repair deficient colorectal cancer: a diagnostic study

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Study participants

Clinicopathological characteristics

Identification of MMR status

CT image acquisition

Preliminary experiment

MMRnet development

Predictive performance of the MMRnet

Ablation experiments

Statistical analysis

Results

Patient characteristics

Predictive performance of the ResNet classification model

Subgroup analysis

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Figure S1.

Additional file 2: Figure S2.

Additional file 3: Figure S3.

Additional file 4: Figure S4.

Additional file 5.

Additional file 6.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Translational Medicine

Contact us