Skip to main content

A clinical-radiomics nomogram for the preoperative prediction of lymph node metastasis in colorectal cancer



Accurate lymph node metastasis (LNM) prediction in colorectal cancer (CRC) patients is of great significance for treatment decision making and prognostic evaluation. We aimed to develop and validate a clinical-radiomics nomogram for the individual preoperative prediction of LNM in CRC patients.


We enrolled 766 patients (458 in the training set and 308 in the validation set) with clinicopathologically confirmed CRC. We included nine significant clinical risk factors (age, sex, preoperative carbohydrate antigen 19-9 (CA19-9) level, preoperative carcinoembryonic antigen (CEA) level, tumor size, tumor location, histotype, differentiation and M stage) to build the clinical model. We used analysis of variance (ANOVA), relief and recursive feature elimination (RFE) for feature selection (including clinical risk factors and the imaging features of primary lesions and peripheral lymph nodes), established classification models with logistic regression analysis and selected the respective candidate models by fivefold cross-validation. Then, we combined the clinical risk factors, primary lesion radiomics features and peripheral lymph node radiomics features of the candidate models to establish combined predictive models. Model performance was assessed by the area under the receiver operating characteristic (ROC) curve (AUC). Finally, decision curve analysis (DCA) and a nomogram were used to evaluate the clinical usefulness of the model.


The clinical-primary lesion radiomics-peripheral lymph node radiomics model, with the highest AUC value (0.7606), was regarded as the candidate model and had good discrimination and calibration in both the training and validation sets. DCA demonstrated that the clinical-radiomics nomogram was useful for preoperative prediction in the clinical environment.


The present study proposed a clinical-radiomics nomogram with a combination of clinical risk factors and radiomics features that can potentially be applied in the individualized preoperative prediction of LNM in CRC patients.


Colorectal cancer (CRC) is the third most common cancer and the fourth leading cause of cancer death worldwide [1]. Lymph node metastasis (LNM) is the main metastatic mode of CRC and an important cause of postoperative recurrence and death [2]. At the same time, the metastasis of lymph nodes (LNs) determines the surgical range of CRC, the formulation of adjuvant treatment plans, and the postoperative survival rate of patients [3,4,5]. Currently, surgical treatment is the preferred method for the treatment of CRC. However, the operative treatment is relatively invasive, costly, and associated with considerable surgery-related morbidity. The postoperative mortality of colon and rectal cancer surgery has been reported to be approximately 3–6% [6]. For early colorectal cancer (ECC), local treatment such as endoscopic local resection is considered feasible management for patients without LNM because of its low risk of metastasis (3.6–16.2%) [7, 8]. In other words, additional radical resection may be unnecessary for ECC without LNM to avoid overtreatment. It is well known that LN status is a key factor in the TNM staging of CRC. For patients with stage III and IV CRC, modern treatment regimens have a significant impact on long-term survival. Therefore, determining the status of LNs is the main determinant of adjuvant chemotherapy [5, 9]. Multiple large studies have shown that the LNM rate (LNR) is an important predictive factor for estimating the prognosis of CRC [2, 10, 11]. Therefore, adequate and accurate LN status assessment and prediction are important for treatment decision making and prognostic evaluation in CRC.

At present, an increasing number of studies are trying to explore the relevant risk factors for tumorigenesis and metastasis from the perspective of tumor molecular immunology or genetics [12,13,14]. For instance, Zhang et al. [12] found that IBSP, MMP9, TNFAIP6, DHRS3, RIPK4, and CD200 had diagnostic value for patients with breast cancer bone metastasis. Studies on risk factors associated with CRC are also emerging [7, 8, 15, 16]. For example, the group of Jerome Galon has extensively investigated the important impact of immune-related characteristics and the tumor microenvironment on the prognosis and metastasis of CRC [16]. However, the limitation of using those risk indicators to assess LNM in CRC patients is that some of the influencing factors are related to histopathological information, which cannot be obtained before surgery to guide clinical treatment. In clinical practice, computed tomography (CT) is the most commonly used preoperative imaging method for detecting metastatic lesions and performing tumor staging in patients with colorectal cancer. However, the limitation of CT examination is that it cannot accurately identify the benign and malignant lymph nodes [17]. Hence, we need to develop more powerful and sensitive diagnostic tools to improve the diagnostic accuracy of LNM in CRC patients.

In recent years, with the advancement of computer technology and the realization of the storage and sharing of medical image data, radiomics has become an emerging research method for extracting lesion information using artificial intelligence methods to aid clinical decision making. Radiomics is the process of converting medical images into high-dimensional mineable data through the high-throughput extraction of image features [18, 19]. At present, some studies have demonstrated the feasibility of radiomics for the prediction of CRC LNM. Huang et al. [20] established a radiomics model for predicting CRC LNM (C-index: 0.778) that had favorable discrimination and calibration and incorporated clinicopathological risk factors, the CT-reported LN status and a radiomics signature. However, in their study, only the radiomics features of the primary tumor were extracted and analyzed, and the radiomics features of the LNs themselves were not explored.

Therefore, in this study, we sought to analyze and explore the performance of clinical risk factor and radiomics signatures, including imaging features of the primary lesion and peripheral LNs, in predicting LNM. Then we built and validated a combined clinical-radiomics nomogram as a useful clinical tool for the individualized preoperative prediction of LNM in CRC patients.



Our Institutional Review Board (Fudan University Shanghai Cancer Center Medical Ethics Committee) approved this study and waived the requirement for obtain informed consent. A total of 766 consecutive CRC patients (341 females and 425 males, mean age 58.96 ± 12.03 years, age range 19–87 years) who were treated between May 2012 and December 2015 were enrolled in our study according to the following inclusion criteria: (i) performance of standard contrast-enhanced CT examination less than 10 days before any treatment; (ii) pathological confirmation of CRC; (iii) performance of LN dissection; (iv) availability of complete CT datasets and reconstructed images; and (v) availability of clinical and pathological information. The exclusion criteria were as follows: (i) preoperative neoadjuvant chemotherapy or radiotherapy; and (ii) presence of other tumor diseases during the same period. The patients were allocated to a training and a validation set at a ratio of 6:4 by the scanning date: the early data before the 60th percentile scanning date were allocated to the training set, and the other data were allocated to the validation set. The patient recruitment pathway is shown in Fig. 1.

Fig. 1
figure 1

Flow chart of patients’ recruitment pathway

The baseline clinical characteristics and pathologic data of each patient, including age, sex, preoperative carbohydrate antigen 19-9 (CA19-9) level, preoperative carcinoembryonic antigen (CEA) level, pathological grading, histotype, tumor location, tumor size and M stage, were all derived from medical records and were preoperatively available. The CT reports of the enrolled patients were also collected.

Image acquisition and segmentation

All patients underwent contrast-enhanced abdominal or pelvic CT according to standard clinical scanning protocols (120 kV, 200 mA, and slice thickness of 5 mm) with the Brilliance (Philips Healthcare) and the Sensation 64 (Siemens Healthcare) systems. All images were reconstructed with a standard reconstruction kernel as follows: slice thickness, 5.0 mm; pitch, 1.4 or 0.9; increment, 5.0 mm; matrix, 512 × 512; and field of view, 4.11 cm. We retrieved all reconstructed images from the picture archiving and communication system (PACS) in the hospital for image segmentation and analysis.

Three-dimensional semiautomatic segmentation was conducted by an operator (ML, graduate student) with ITK-SNAP software (v3.6.0; and then validated by a senior expert radiologist (TT, with 11 years of experience in CRC). In this study, the regions of interest (ROIs) included the primary lesion and the peripheral LN, which were drawn on the portal venous enhanced CT images along the lesion contour on each consecutive slice within the borders of the primary tumor lesions, excluding adjacent air, vessels, fat and normal tissues.

Radiomics feature extraction and selection

We extracted radiomics features from each ROI with PyRadiomics ( Because of some impact on the imaging features, we normalized each CT image by centering it at the mean with standard deviation to eliminate the influence of the different ranges of gray values in PyRadiomics. The classes of the features used included first order, shape, gray level cooccurrence matrix, gray level run length matrix, gray level size zone matrix, gray level dependence matrix and neighboring gray tone difference matrix. Since some of the features were redundant, we used the Pearson correlation coefficient to remove the redundant features. We applied normalization on the feature matrix. Each feature vector was subtracted by the mean value and divided by the standard deviation.

Analysis of variance (ANOVA), Relief and recursive feature elimination (RFE) were used for feature selection. For the radiomics features that remained, we selected the top 20 features from all clinical and radiomics features, and the number of selected features was iterated from 1 to 20. The best feature number was found by comparing the performance of fivefold cross-validation on the training cohort by logistic regression (LR).

Model construction

The clinical model was constructed by multivariable LR analysis using nine clinical parameters, including age, sex, preoperative CA19-9 level, preoperative CEA level, pathological grading, histotype, tumor location, tumor size and M stage. Akaike’s information criterion was used as the stopping rule for backward step wise selection. Then, a multivariate LR model was established and its diagnostic performance was tested on the training cohort.

Two radiomics models were constructed with the selected radiomics features from the primary lesion and the peripheral LN ROI respectively.

We also constructed three clinical-radiomics models combining the radiomics and clinical features with a LR mode and evaluated the performance of the combined models.

Hence, a total of six models were constructed to preoperatively predict the LNM of CRC: 1 clinical-only model, 2 radiomics-only models (the LN radiomics model and the lesion radiomics model) and 3 combined clinical-radiomics models (clinical-lesion, clinical-LN and clinical-lesion-LN).

Model validation and comparison

Receiver operating characteristic (ROC) curve analysis was used to assess the model performance in the training and testing cohorts. The area under the ROC curve (AUC) was calculated for quantification. We regarded the model with the highest AUC value in the fivefold cross-validation of the training set as the candidate model. The accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were also calculated at a cutoff value that maximized the value of the Youden index. Moreover, we also boosted the estimation 1000 times and applied paired t-test to determine the 95% confidence intervals (CIs).

Nomogram development and decision curve analysis (DCA)

A nomogram was generated for model visualization and clinical application. To evaluate the added value of radiomics features to clinical features in individually predicting LNM in CRC patients, we developed six decision curves based on the clinical parameters, primary lesion radiomics features, LN radiomics features and the combined clinical-radiomics models. The clinical utility could be demonstrated by quantifying the net benefits of a series of threshold probabilities in the queue.


Clinical features

The LNM rates of the training set and the validation set were 44.15% and 44.54%, respectively. There were no significant differences in patient age, sex, preoperative CEA level, preoperative CA19-9 level, pathological grading, histotype, tumor location, tumor size and M stage between the training set and the validation set (P > 0.05), as shown in Table 1.

Table 1 Demographic comparison between training and validation cohorts

After multivariate LR analysis, age, sex, pathological grading, histotype, preoperative CA19-9 level and preoperative CEA level were independent predictors in the clinical model.

Feature extraction and model construction

A total of 222 radiomics features were extracted from CT images (111 lesion radiomics features and 111 peripheral LN radiomics features). ANOVA, RFE and Relief were used to reduce the feature number to 20. Then we evaluated the candidate feature number in the validation cohort by assessing the discrimination performance. The feature number was validated by fivefold cross-validation. We obtained an candidate subset of 6 clinical features (age, M stage, preoperative CA19-9 level, preoperative CEA level, pathological grading and tumor size), 3 primary lesion radiomics features (Lesion_glcm_ldmn, Lesion_glszm_ZonePercentage, and Lesion_glszm_LowGrayLevelZoneEmphases) and 3 peripheral LN radiomics features (Lymph_glrlm_GrayLevelNonUniformity, and Lymph_firstorder_Kurtosis, Lymph_gldm_GrayLevelNonUniformity) (Fig. 2). Then, we established six predictive models using the above selected features: 1 clinical model, 2 radiomics models and 3 combined clinical-radiomics models.

Fig. 2
figure 2

Feature selection. Selection of the tuning feature number in LR model via fivefold cross validation based on minimum criteria. The y-axis indicates AUC value. The x-axis indicates the feature number. The green point indicates model with highest AUC value in fivefold cross validation

Model comparison and validation

Figure 3 shows the performance of the clinical features, peripheral LN radiomics features and primary lesion radiomics features in the training and validation sets, respectively. By comparing the models, the clinical-lesion-LN model presented the optimal discrimination and best predictive stability with the highest AUC value in both the training cohort (AUC = 0.7606; 95% CI 0.7373–0.7833) and the validation cohort (AUC = 0.7509; 95% CI 0.6901–0.8071) (Table 2, Fig. 4). We also compared different parameters of the LR model for better performance, and there was limited influence on the AUC values in the testing set (Additional file 1: Table S1), therefore, we used the default values. Though the combined model seemed complex, both the lesion radiomics features and the LN radiomics features were extracted from the same CT images, so the combined model with great performance does not bring about an extra burden to clinical examinations. Hence, the clinical-lesion-LN model was identified as the candidate classifier model for LNM in CRC patients.

Fig. 3
figure 3

The ROC curves of the clinical features, peripheral lymph node radiomics features and primary lesion radiomics features in the training and validation sets, respectively

Table 2 Accuracy and predictive value between six models
Fig. 4
figure 4

Model performance in validation cohort. Compared with different features combination, the AUC value is increased when clinical parameters joined either lymph feature or lesion feature. We can also see that the more feature species used, the higher AUC value and the better model performance are

Clinical use

Based on this candidate model, we generated a clinical-lesion radiomics-LN radiomics nomogram for model visualization (Fig. 5). We found that the clinical features have higher classification contributions than the radiomics features in the candidate nomogram. This finding is consistent with the higher AUC value of the clinical model than that of the LN radiomics model and the lesion radiomics model. However, the LN radiomics features greatly help in the decision -making of the candidate model and increases the AUC value from 0.7075 to 0.7509. (Table 2, Fig. 6).

Fig. 5
figure 5

The developed clinical-primary lesion radiomics-peripheral lymph node radiomics nomogram for predicting the probability of lymph node metastases. For age, the number represents the age number. For M stage, 0 represents no distant metastasis while 1 represents distant metastasis. For the preCA19-9, 0 represents within the normal value while 1 represents above the normal value. For the preCEA, 0 represents within the normal value while 1 represents above the normal value. For the grade, 1 for high-middle differentiation and 0 for middle-low differentiation. To use, locate the patient’s age, draw a line straight up to the points axis to establish the score associated with that site. Repeat for the other covariates (M stage, preCA19-9, preCEA, grade and radiomics features). By summing the scores of each point and locating on the total score scale, the estimated probability of lymph node metastases could be determined

Fig. 6
figure 6

Weight of feature coefficients of the candidate model. Feature weights generated by the coefficients of logistic regression model. The “P” after the feature name indicates the positive correlation, and the “N” indicates the negative correlation. lesion GLCM IDMN lesion_CT_original_glcm_Idmn, lesion GLCM ZP lesion_CT_original_glcm_ ZonePercentage, lesion GLSZM LGLZE lesion_glszm_LowGrayLevelZoneEmphaseslymph, lymph GLDM GLNU lymph_glrlm_GrayLevelNonUniformity

DCAs based on the six models are shown in Fig. 7. Both of the single radiomics DCAs showed less benefit in predicting the risk of LNM than the single clinical DCA, which indicates that the clinical features outperformed the radiomics features with more accuracy in LNM prediction. With the addition of radiomic features, the combined clinical-radiomics DCAs achieved more clinical utility, especially the clinical-lesion-LN DCA, which indicated that the nomogram based on the candidate model was a reliable clinical treatment tool for predicting LNM in CRC patients. DCA indicated that when the threshold probability for a patient is within a range from approximately 0.3 to 0.7, the clinical-radiomics nomogram adds more net benefit than the “treat all” or “treat none” strategies.

Fig. 7
figure 7

Decision curve analysis (DCAs). The y-axis measures the net benefit. Threshold probability refers to the point at which a patient considers the benefit of treatment for intermediate to high-risk LNM equivalent to the harm of over-treatment for low-risk disease and thus reflects how the patient weights the benefits and harms associated with the decision. The higher curve at any given threshold probability is the optimal prediction to maximize net benefit. The decision curve showed that the model using all feature sets adds more net benefit than other models


In this study, we built a clinical-radiomics model for the individualized preoperative prediction of LNM in CRC patients, that consists of clinical risk factors and radiomics features (including imaging features of both the primary lesion and peripheral LNs). First, by multivariate LR analysis, we selected 6 clinical risk factors, 3 primary lesion radiomics features and 3 peripheral LN radiomics features as independent risk factors from 9 clinical features, 111 peripheral LN radiomics features and 111 primary lesion radiomics features, respectively. Then, by using the above independent risk factors, we constructed six predictive models. By comparing the models, the clinical-lesion radiomics-LN radiomics model had the highest accuracy in predicting LNM (AUC = 0.7509; 95% CI 0.6901–0.8071; accuracy: 73.7%; sensitivity: 60.29%; specificity: 84.3%: PPV: 75.23%; and NPV: 72.86% in the testing set) and was regarded as the candidate model. Finally, a nomogram was constructed based on the candidate combined model to achieve model visualization and provide clinical utility. Our study demonstrates that this combined clinical-radiomics model has potential as a clinical tool for preoperatively predicting LNM in CRC patients.

In terms of clinical features, our study found that age, M stage, preoperative CA19-9 level, preoperative CEA level, tumor size and pathological grading were independent risk factors associated with LNM in CRC patients, which is consistent with the findings of previous studies [8, 20,21,22]. Among the five potential clinical risk factors, the preoperative CA19-9 level and preoperative CEA level have been considered clinical indicators closely related to the LNM of CRC [20]. Our study also suggested that high preoperative CA19-9 levels and high preoperative CEA levels were important predictors of LNM in CRC patients. This finding might be due to the higher level of CEA and CA19-9, the higher TNM stage of CRC and the stronger tumor cell proliferation ability, which implies poorer differentiation, increased invasiveness and stronger metastasis of the tumor [23]. In addition, young patients with CRC were more likely to have LNM, which might be related to the high metabolism of young patients and the occurrence of mostly poorly differentiated tumors. However, most young people do not have obvious symptoms or are not actively undergoing physical examinations, leading to a low clinical detection rate of CRC in this population. In addition, most of them are generally in advanced stages and have a poor prognosis. Therefore, young people should be encouraged to actively undergo early screening so that early diagnosis and early treatment can be performed. There is no consensus on the correlation between tumor size and LNM which might be related to differences in the number of clinical case samples and methods [15, 24, 25]. However, Wolmark et al. [24] also indicated that the tumor volume of Dukes C phase’s rectal cancer was consistently smaller than that of Dukes B tumor. Similarly, we observed a negative correlation between tumor size and LNM in this study. A recent propensity score matching study indicated that a smaller tumor size was an independent risk factor for cancer-specific survival (CSS) in patients with stage I-III CRC [26]. Hence, clinically, multivariate analysis should be used to select the best clinical treatment method to reduce the residual lesion and micro-metastases, thereby reducing the possibility of tumor recurrence and improving the patient’s prognosis and tumor-free survival.

In addition to assessing clinical risk factors, we also tried to explore and find additional imaging information on contrast-enhanced CT images. Compared with traditional imaging methods that can be analyzed from only the level of anatomical changes, the advantage of radiomics is that high-throughput calculations can extract large numbers of quantitative features from the ROI that reflect the inherent heterogeneity of the lesion, which has been widely used in clinical diagnosis, efficacy evaluations, prognostic evaluations and other aspects [27,28,29]. In 2016, Huang et al. [20] proposed a predictive model of LNM in CRC patients that combined radiomics and clinical indicators (CI = 0.736). However, they analyzed only the radiomics features of the primary lesion. The innovation of our study lies in that we not only analyzed the radiomics features of the primary lesion, but also explored the radiomics features of the peripheral LN themselves. In our study, 3 peripheral LN radiomics features were selected from the 111 screened peripheral LN radiomics features, and these features helped greatly improve the AUC of the final candidate model. This finding indicated that the radiomics features of the LNs themselves might be of greater value in predicting LNM and worthy of further exploration.

Since the combined clinical-lesion-LN radiomics model, with the highest AUC and greatest net benefits across most of the threshold probabilities in DCA, outperformed the single clinical features, single radiomics features and other combined models, it may be the most promising approach to guide clinical management. To facilitate clinical applications, we constructed a clinical-radiomics nomogram combining the radiomics features with preoperative clinical characteristics. The scoring system can generate the probability of LNM to realize the individualized preoperative prediction of the risk of LNM in CRC by clinicians, which is in line with the current development trend of individualized precision medicine.

In conclusion, we recommend that patients with a younger age, distant metastasis, a lower tumor grading, a smaller tumor size and higher preoperative CA19-9 and CEA levels should have regular follow-ups, and the progression of the disease should be closely monitored in these patients. In addition, we suggest that patients with a higher risk of LNM, as screened by the nomogram, should be considered potential surgical candidates to extend survival. The clinical application of this nomogram can reduce the cost of subsequent diagnosis, help develop more reasonable and effective treatment plans and prevent patients from having a poor prognosis.

However, the present study has several limitations. First, as a retrospective study, there may be inevitable selection bias, hence, prospective and external validation studies are needed. Second, the result of this study was from a single institution, so multicenter validation is needed to extend the versatility of the experimental results. Third, only one imaging modality was used in this study, which leads to a limited number of extracted radiomics features. If more imaging modalities (such as MRI and PET-CT) are combined, the feature pool will be effectively expanded to obtain more valuable radiomics information.


In summary, our study established a clinical-radiomics nomogram that combined clinical risk factors and radiomics features (including radiomics features of the primary lesion and peripheral LNs), which can be used as an individualized preoperative noninvasive tool for predicting LNM in patients with CRC, assisting in clinical treatment decision making and achieving precision treatment.

Availability of data and materials

All data generated or analysed during this study are included in this published article.



lymph node metastasis


colorectal cancer


carbohydrate antigen 19-9


carcinoembryonic antigen


analysis of variance


recursive feature elimination


receiver operating characteristic curve


area under the curve


decision curve analysis


early colorectal cancer


lymph node metastasis rate


lymph node


computed tomography


the picture archiving and communication system


regions of interest


logistic regression


positive predictive value


negative predictive value


confidence intervals


decision curve analyses

lesion GLCM IDMN:


lesion GLCM ZP:

lesion_CT_original_glcm_ ZonePercentage



lymph GLDM GLNU:



  1. Ferlay J, et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer. 2015;136(5):E359–86.

    Article  CAS  Google Scholar 

  2. Xue H, et al. Predictive value of lymph node ratio for postoperative distant metastasis of stage III colorectal cancer. Nan Fang Yi Ke Da Xue Xue Bao. 2014;34(4):458–62.

    PubMed  Google Scholar 

  3. Chen SL, Bilchik AJ. More extensive nodal dissection improves survival for stages I to III of colon cancer: a population-based study. Ann Surg. 2006;244(4):602–10.

    PubMed  PubMed Central  Google Scholar 

  4. Engstrom PF, et al. NCCN clinical practice guidelines in oncology: colon cancer. J Natl Compr Cancer Netw. 2009;7(8):778–831.

    Article  Google Scholar 

  5. Watanabe T, et al. Japanese Society for Cancer of the Colon and Rectum (JSCCR) guidelines 2016 for the treatment of colorectal cancer. Int J Clin Oncol. 2018;23(1):1–34.

    Article  Google Scholar 

  6. Iversen LH, et al. Seasonal variation in short-term mortality after surgery for colorectal cancer? Colorectal Dis. 2010;12(7 Online):e31–6.

    CAS  PubMed  Google Scholar 

  7. Han J, et al. Predictive factors for lymph node metastasis in submucosal invasive colorectal carcinoma: a new proposal of depth of invasion for radical surgery. World J Surg. 2018;42(8):2635–41.

    Article  Google Scholar 

  8. Choi JY, et al. Meta-analysis of predictive clinicopathologic factors for lymph node metastasis in patients with early colorectal carcinoma. J Korean Med Sci. 2015;30(4):398–406.

    Article  Google Scholar 

  9. Sabbagh C, et al. A lymph node ratio of 10% is predictive of survival in stage III colon cancer: a French regional study. Int Surg. 2014;99(4):344–53.

    Article  Google Scholar 

  10. Kwon TS, et al. Novel methods of lymph node evaluation for predicting the prognosis of colorectal cancer patients with inadequate lymph node harvest. Cancer Res Treat. 2016;48(1):216–24.

    Article  Google Scholar 

  11. Li DG, et al. Predictive value of the number of harvested lymph nodes and cut-off for lymph node ratio in the prognosis of stage II and III colorectal cancer patients. J Invest Surg. 2017;32:1–7.

    CAS  Google Scholar 

  12. Zhang Y, He W, Zhang S. Seeking for correlative genes and signaling pathways with bone metastasis from breast cancer by integrated analysis. Front Oncol. 2019;9:138.

    Article  Google Scholar 

  13. Zhang S, et al. Icotinib enhances lung cancer cell radiosensitivity in vitro and in vivo by inhibiting MAPK/ERK and AKT activation. Clin Exp Pharmacol Physiol. 2018;45(9):969–77.

    Article  CAS  Google Scholar 

  14. Angell HK, et al. The immunoscore: colon cancer and beyond. Clin Cancer Res. 2019.

    Article  PubMed  Google Scholar 

  15. Dev K, Veerenderkumar KV, Krishnamurthy S. Incidence and predictive model for lateral pelvic lymph node metastasis in lower rectal cancer. Indian J Surg Oncol. 2018;9(2):150–6.

    Article  Google Scholar 

  16. Van den Eynde M, et al. The link between the multiverse of immune microenvironments in metastases and the survival of colorectal cancer patients. Cancer Cell. 2018;34(6):1012.e3–1026.e3.

    Google Scholar 

  17. Dighe S, et al. Diagnostic precision of CT in local staging of colon cancers: a meta-analysis. Clin Radiol. 2010;65(9):708–19.

    Article  CAS  Google Scholar 

  18. Lambin P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–6.

    Article  Google Scholar 

  19. Kumar V, et al. Radiomics: the process and the challenges. Magn Reson Imaging. 2012;30(9):1234–48.

    Article  Google Scholar 

  20. Huang YQ, et al. Development and validation of a radiomics nomogram for preoperative prediction of lymph node metastasis in colorectal cancer. J Clin Oncol. 2016;34(18):2157–64.

    Article  Google Scholar 

  21. Caputo D, et al. T1 colorectal cancer: poor histological grading is predictive of lymph-node metastases. Int J Surg. 2014;12(3):209–12.

    Article  Google Scholar 

  22. Ali K, et al. Predictive Factors of thoracic lymph node metastasis accompanying pulmonary metastasis from colorectal cancer. Thorac Cardiovasc Surg. 2018;67(08):683–7.

    PubMed  Google Scholar 

  23. Wu XZ, Ma F, Wang XL. Serological diagnostic factors for liver metastasis in patients with colorectal cancer. World J Gastroenterol. 2010;16(32):4084–8.

    Article  CAS  Google Scholar 

  24. Wolmark N, et al. Tumor size and regional lymph node metastasis in colorectal cancer. A preliminary analysis from the NSABP clinical trials. Cancer. 1983;51(7):1315–22.

    Article  CAS  Google Scholar 

  25. Wolmark N, et al. The relationship of depth of penetration and tumor size to the number of positive nodes in Dukes C colorectal cancer. Cancer. 1984;53(12):2707–12.

    Article  CAS  Google Scholar 

  26. Li X, et al. Prognostic value of the tumor size in resectable colorectal cancer with different primary locations: a retrospective study with the propensity score matching. J Cancer. 2019;10(2):313–22.

    Article  CAS  Google Scholar 

  27. Suzuki C, et al. Preoperative CT-based predictive factors for resectability and medium-term overall survival in patients with peritoneal carcinomatosis from colorectal cancer. Clin Radiol. 2018;73(8):756-e11.

    Article  Google Scholar 

  28. Liu Z, et al. Radiomics analysis for evaluation of pathological complete response to neoadjuvant chemoradiotherapy in locally advanced rectal cancer. Clin Cancer Res. 2017;23(23):7253–62.

    Article  CAS  Google Scholar 

  29. Huang Y, et al. Individualized prediction of perineural invasion in colorectal cancer: development and validation of a radiomics prediction model. Chin J Cancer Res. 2018;30(1):40–50.

    Article  Google Scholar 

Download references


We appreciate the help from other teammates.


National Natural Science Foundation of China (Grant Nos. 81971687, 61731009). Shanghai Sailing Program (Grant No. 19YF1409900).

Author information

Authors and Affiliations



All authors have had access to the data and all drafts of the manuscript. Specific contributions are as follows: study design: TT, GY, GC; data collection: ML, WD; data management and analysis: ML, JZ, YD, YY, TT; manuscript drafting: ML, JZ; manuscript review: all. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Guang Yang or Tong Tong.

Ethics declarations

Ethics approval and consent to participate

Our Institutional Review Board (Fudan University Shanghai Cancer Center Medical Ethics Committee) approved this study and waived the need for informed consent from patients.

Consent for publication

All authors have read and approved the content and agree to submit for consideration for publication in the journal.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Additional material about the additional descriptions of the clinical features, model building and statistical for software and Table S1 (Performance of different parameters of model).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, M., Zhang, J., Dan, Y. et al. A clinical-radiomics nomogram for the preoperative prediction of lymph node metastasis in colorectal cancer. J Transl Med 18, 46 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: