Skip to main content

Translational value of IDH1 and DNA methylation biomarkers in diagnosing lung cancers: a novel diagnostic panel of stage and histology-specificity



Lung cancer is the leading cause of cancer-related death worldwide, and the timely and serial assessment of low-dose computed tomography (LDCT) in high-risk populations remains a challenge. Furthermore, testing a single biomarker for the diagnosis of lung cancers is of relatively low effectiveness. Thus, a stronger diagnostic combination of blood biomarkers is needed to improve the diagnosis of non-small cell lung cancer (NSCLC).


The blood levels of individual biomarkers [IDH1, DNA methylation of short stature homeobox 2 gene (SHOX2), and prostaglandin E receptor 4 gene (PTGER4)] were measured and statistically analyzed in samples from healthy controls and patients with lung cancer. In total, 221 candidates were enrolled and randomly assigned into two groups for the training and validation of a diagnostic panel. Additionally, a subgroup analysis was performed in the whole cohort.


A newly combined 3-marker diagnostic model for lung cancers was established and validated with area under the receiver operating characteristic (ROC) curve (AUC) values ranging from 0.835 to 0.905 in independent groups showing significantly stronger diagnostic value compared with a single tested biomarker. The sensitivity of the diagnostic model was as high as 86.1% and 80.0% in the training and validation sets, respectively. Although no apparent differences were found between the 3-marker and 2-marker models, the high clinical T-stage and histological type specificity of IDH1 and two other methylated DNA biomarkers were demonstrated in the subgroup analysis.


The combination of single biomarkers with high stage-specificity and histological type specificity (SHOX2 and PTGER4 DNA methylation and IDH1) showed better diagnostic performance in the detection of lung cancers compared with single marker assessment. A greater clinical utility of the panel may be developed by adding demographic/epidemiologic characteristics.


Despite great progress in the surgical and medical management of malignant tumors, lung cancer still ranks first as the most commonly diagnosed malignant disease and is the leading cause of cancer death for both males and females worldwide [1]. Although the randomized National Lung Screening Trial has demonstrated the advantage of low-dose computed tomography (LDCT) compared to chest X-rays for screening lung cancers [2], annual screening was performed on only 4% of patients in high-risk populations [3]; moreover, its relatively lower specificity for indeterminate lung nodules, poor repeatability and related economic burden on the public health system increased the diagnostic cost for lung cancers. Owing to the untimely diagnosis and limited treatment options, the 5-year survival rate for advanced-stage lung cancers is only 5%, which is extremely low compared to localized and regional stage lung cancers (56% and 29%, respectively) [4]. Thus, developing a novel diagnostic panel is of priority to improve the treatment modality and survival rate of lung cancer patients.

As the discriminative diagnostic potential of biological fluid-based biomarkers for the detection of lung cancers has been shown [5,6,7,8,9], the improvement of cancer management and treatment have increasingly relied on a combination of biomarkers. Abnormal epigenetic events are suggested to occur in the early phase of carcinogenesis [10, 11], making DNA methylation biomarkers available for the early detection and monitoring of cancers [12, 13]. Based on previous studies, short stature homeobox 2 (SHOX2) gene DNA was confirmed to have higher methylation levels in patients with small cell lung cancer (SCLC) and squamous cell carcinoma (SCC) than in patients with adenocarcinoma (Ade) and in healthy controls [14,15,16], indicating its potential as a diagnostic biomarker for specific types of lung cancers. Additionally, other clinical applications such as the assessment of lung cancer stages by measuring the SHOX2 methylation levels of lymph nodes [17] and its efficiency for monitoring the response to chemotherapy [18] have been studied. The prostaglandin E2 receptor 4 gene (PTGER4) product, EP4, is important in tumor progression mediated by prostaglandin E2 [19,20,21], which has been confirmed to have an upregulated expression in cancer tissues compared with that in normal tissues [20, 22]. Although, the discriminative ability in detection of lung cancers of the SHOX2/PTGER4 DNA methylation marker panel has been confirmed. However, it was concluded in a population containing relative higher percentage of non-Ade lung cancers (Ade: non-Ade, 46:71) [23]. Thus, in our present study, the two methylation DNA biomarkers were enrolled as candidate markers for analysis in a population mainly containing lung Ades. In addition, isocitrate dehydrogenase 1 (IDH1) has been proven to have an important role in promoting tumor growth in NSCLC [24] and could be used as a blood biomarker for the detection of NSCLC, particularly lung Ade [25].

From the perspective of building a marker panel with high diagnostic efficiency and wide coverage of lung cancers, the plasma levels of protein and methylation levels of circulating DNA were measured and analyzed.


Blood samples and patient details

Between January 2017 and December 2018, 221 candidates from the Cancer Hospital of the Chinese Academy of Medical Sciences were enrolled in our study and randomly divided into the training and validation groups. Blood samples were collected from lung cancer patients who met the following criteria: (a) no history of other specific malignant diseases; and (b) no anticancer treatments before the blood sample collection process. Healthy individuals in the control group were selected from Physical Examination Centers during the same period; these individuals were confirmed to have no lung nodules by chest X-ray or thin-sliced computed tomography as well as no history of malignant tumors. This study was conducted following national institutional ethical policies and approved by the Chinese Academy of Medical Sciences Institutional Review Board.

DNA preparation and bisulfite conversion from plasma specimens

To ensure objectivity, laboratory personnel were blinded to the identities of the samples. For each subject, 10 mL of blood was collected in 10 mL BD Vacutainer ethylenediaminetetraacetic acid tubes (BD Biosciences, San Jose, CA). Each tube was centrifuged for 12 min at 1350×g ± 150×g at room temperature. Plasma was transferred without disturbing the buffy coat to a clean 15 mL conical tube. The sample was centrifuged a second time for 12 min at 1350×g ± 150×g. Plasma was transferred without disturbing the pellet to a clean 15 mL conical tube and stored at − 20 °C. If not assayed immediately, plasma was stored at − 20 °C for up to 2 weeks. The DNA extraction from plasma samples, bisulfite conversion, and purification were performed using the Plasma Preparation Kit (Biochain (Beijing) Technology Co., Ltd., Beijing, China). DNA was eluted in 60 μL of elution buffer. If not used immediately, the eluted DNA was stored at − 20 °C for up to 3 days.

Real-time PCR

The PCR method was performed as described in the kit for gene methylation (real-time PCR) (Biochain Technology Co., Ltd., Beijing, China). PCR amplification was performed in triplicate for each sample. SHOX2, PTGER4 and beta-actin (ACTB) control reactions were performed in the same reaction. Real-time PCR was performed on a 7500 Real-time PCR instrument (Applied Biosystems, CA) using the following cycling conditions: activation at 95 °C for 20 min, 45 cycles of 62 °C for 5 s, 55.5 °C for 35 s, 93 °C for 30 s and final cooling to 40 °C for 5 s. The heating rates were 2.4 °C/s, and the cooling rates were 2.4 °C/s. Data were acquired at the end of each 55.5 °C step. Analysis was performed using the ABI 7500 SDS software V2.0.5.

ELISA detection of serum IDH1

For each individual, 4 mL of blood was collected in 4 mL BD Vacutainer ethylenediaminetetraacetic acid tubes (BD Biosciences, San Jose, CA) and then centrifuged at 3000×g for 5 min. The supernatant was divided into 500 μL aliquots, stored at − 80 °C and detected simultaneously. The detection of serum IDH1 levels was conducted with an ELISA kit (Biochain Technology Co., Ltd., Beijing, China) following the manufacturer’s instructions. Before the assay, the serum samples and test kits were equilibrated to room temperature. Then, 50 μL of each sample was added to the appropriate wells and covered with a film. The plate was incubated at 37 °C for 2 h. The liquid was discarded from each well, and the plate was washed 5 times. Then, 50 μL of testing solution A was added to each well and covered with a film. The plate was incubated at 25 °C for 1 h. After 5 washes with buffer, 50 μL of testing solution B was added to each well, and the plate was incubated at 25 °C for 1 h. Then, 50 μL of mixed substrates A and B were added to each well, and the plate was incubated at 25 °C for 15 min. To stop the reaction, 50 μL of stop solution was added. The optical density (O.D.) was detected at 450 nm on a Multiskan FC microplate photometer (ThermoFisher Scientific, Waltham, USA). The concentration of IDH1 was calculated with a quadratic polynomial fitting curve.

Statistical analysis of the data

The Mann–Whitney U test or t test was used for the statistical analysis of biomarker levels between the normal controls and lung cancer patients. To build a predictive model for the discrimination of lung cancers, the forward logistic regression method was used to screen variables. The area under the receiver operating characteristic (ROC) curve (AUC) with a 95% confidence interval (CI) was applied to compare the diagnostic performance of single markers and the model. In addition, Youden’s index of the training group was used to determine the cut-off value of the predictive model. The diagnostic performances of our model in different subgroups were analyzed by comparing the AUCs of the ROC curves. Other descriptive statistics, such as sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and standard deviation (SD), were calculated in this study.

Software including SPSS 24.0, GraphPad Prism 5.0, MedCalc (version and Microsoft Excel were used for statistical analysis. p values were two tailed. Differences were considered statistically significant with a p value less than 0.05.


Basic characteristics of the two randomly assigned groups

In our study, the whole enrolled population was randomly assigned into the training and validation groups. There were 170 candidates (55 healthy controls and 115 patients with lung cancers) in the training group, and the remaining 16 healthy controls and 35 lung cancer patients were randomly selected as the validation cohort. In both groups, the ratios of healthy controls to patients were near 1:2. And, in the training and validation groups, most patients (78.3% and 62.9%, respectively) had a diagnosis with lung Ade. Furthermore, the enrolled population was divided according to tumor size and pathological type, as shown in Table 1, to analyze the diagnostic performances of subgroups. Other basic information is shown in Table 1.

Table 1 Basic information

Basic analysis of single biomarkers in the training and validation groups

The levels of single biomarkers in lung cancer patients and healthy controls, as shown in Fig. 1, were separately assessed in the two cohorts. In the training group, the methylation levels of SHOX2 and PTGER4 in cancer patients were significantly higher than those in healthy controls (p < 0.05). In addition, the levels of IDH1 in lung cancer patients were significantly elevated (p < 0.05).

Fig. 1
figure 1

Comparisons of biomarkers between lung cancer patients and healthy controls in the training and validation cohorts. Shox2 short stature homeobox 2 gene, ep4 prostaglandin E2 receptor 4 gene (PTGER4), idh1 isocitrate dehydrogenase 1, y-axis cycle threshold, H healthy controls, C lung cancer patients, **p value < 0.05

Similar conditions in the validation group, i.e., significantly increased methylation levels of PTGER4 and elevated levels of IDH1, were observed in lung cancer patients compared with healthy controls (p < 0.05). Although nonsignificant, the methylation levels of SHOX2 in patients with lung cancers were higher than those in healthy controls.

Construction, analysis and validation of the three-biomarker diagnostic panel in two independent groups

By testing the levels of the three biomarkers together, a predictive model was built based on the results of the forward logistic regression analysis (Additional file 1: Table S1). As a result, all three measured biomarkers were enrolled, contributing to the final panel = 16.821 + 0.435 * IDH1 − 0.147 * SHOX2 − 0.28 * PTGER4.

The diagnostic abilities of the individual biomarkers and the panel were analyzed in three groups separately (Table 2). Among the three enrolled tested biomarkers, IDH1 had the highest AUCs of 0.781 (95% CI 0.711–0.841), 0.755 (95% CI 0.615–0.865), and 0.78 (95% CI 0.719–0.833) in the training, validation and whole group, respectively (Fig. 2a–c). However, in the training cohort, the diagnostic panel was confirmed with the highest AUC of 0.835 (95% CI 0.770–0.887), which was significantly better than any single marker (p < 0.05). In addition, the panel exhibited the highest AUCs of 0.905 (95% CI 0.790–0.969) and 0.856 (95% CI 0.803–0.899) in the validation group and the whole group, respectively, demonstrating a significantly stronger diagnostic value compared with any single candidate biomarker (p < 0.05, Table 2).

Table 2 Comparisons of diagnostic models and individual markers in different groups
Fig. 2
figure 2

Receiver operating characteristic (ROC) curves for each single tested biomarker and diagnostic model in different cohorts. Shox2 short stature homeobox 2 gene, ep4 prostaglandin E2 receptor 4 gene (PTGER4), idh1 isocitrate dehydrogenase 1. ac Comparison of the diagnostic ability of the primary 3-marker model (IDH1 + SHOX2 + PTGER4) and each biomarker in the training cohort, the validation cohort, and the whole cohort, respectively. d Analysis of the diagnostic ability of the primary 3-marker model and the 2-marker models in the whole cohort. eh Subgroup analysis of the model in the whole cohort. e, f Diagnostic performance in the group of early stage lung cancers and advanced-stage lung cancers, respectively. g, h ROCs of the primary model in the non-Ade lung cancer group (g) and Ade lung cancer group (h)

In this study, a Youden’s index of 0.534 was chosen to select the cut-off value, 0.569 (Fig. 3). Based on the cut-off value of the diagnostic model, the sensitivity of the training group and validation group were calculated as 86.1% and 80.0%, respectively. The specificity of the model in the validation group was as high as 87.5%; however, it was relatively lower in the training group, 67.3%. In addition, the high diagnostic accuracy, 80.0% and 82.4% in the training and validation groups, respectively, demonstrated its clinical advantage in discriminating lung cancers and healthy controls over any single biomarker. Other statistical information is specifically listed in Table 3.

Fig. 3
figure 3

Resulting sensitivity and specificity of the primary 3-marker diagnostic model in the training group when using the alternative cut-offs. A cut-off of 0.596 was selected with a specificity of 67.3% and a sensitivity of 86.1%

Table 3 Basic assessment of the model in the training and validation groups

Analysis of the diagnostic model with different histological subtypes and T-stages in the whole cohort

To investigate the diagnostic ability with respect to different pathological types and different tumor sizes of lung cancers, a subgroup analysis of the model was performed in the whole group. First, IDH1 and one methylation DNA biomarker were selected to build two-marker models based on the whole cohort for further analysis. As shown in Fig. 2d, no significant differences were found between the primary three-marker panel and two other two-marker panels. However, the diagnostic abilities of the two-marker panels were significantly stronger than each enrolled individual biomarker (Additional file 1: Table S2, Additional file 2: Figures S1–2).

Second, a tumor size of 5 cm was chosen for dividing the patients into subgroups of early stage (T1–2 stage) and advanced stage (T3–4 stage) according to the 8th TNM staging system. In the group of patients with early stage lung cancers, the diagnostic panel demonstrated significantly better performance than the two DNA methylated biomarkers, with an AUC of 0.832 (95% CI 0.771–0.881) (p < 0.05, Table 4). Furthermore, the AUC (0.967, 95% CI 0.909 − 0.992) of the 3-biomarker panel was significantly higher than IDH1 and the PTGER4 gene in patients with tumors larger than 5 cm (p < 0.0001 and p = 0.0038, respectively, Table 4). Among the individual biomarkers, IDH1 had the significantly highest AUC value compared with the two other methylation biomarkers in the group of early stage lung cancers (Fig. 2e, Additional file 1: Table S3). However, in the T3–4 stage group, the AUC values of the two enrolled methylated biomarkers were higher than 0.92 (Fig. 2f), showing significantly better discriminative ability in contrast to IDH1 (AUC: 0.731, 95% CI 0.732–0.815, Table 4).

Table 4 Subgroup analysis of the 3-biomarker model

Additionally, the diagnostic abilities of our panel in different pathological types of lung cancers were assessed. As the proportion of lung Ade was higher than other subtypes in the studied population, patients were divided into two groups of Ade and non-Ade lung cancers. As shown in Table 4, the panel showed significantly stronger diagnostic ability (AUC: 0.963, 95% CI 0.909–0.990) in diagnosing patients with non-Ade than the PTGER4 and IDH1 biomarkers, with p = 0.0034 and p < 0.0001, respectively (Fig. 2g). And, the AUC value of IDH1 (AUC: 0.791, 95% CI 0.724–0.847) was lower than that of the established three-biomarker combination (AUC: 0.819, 95% CI 0.756–0.872) in the lung Ade group, but nonsignificantly (p = 0.2415, Fig. 2h). The individual comparisons of single biomarkers demonstrated that the IDH1 biomarker had significantly lower efficacy in discriminating patients with SCC or SCLC than the DNA methylation of the SHOX2 gene (p = 0.0106, Additional file 1: Table S3).


Although the substantial improvements of radiological technology and advanced clinical treatment options, such as the shift from chest X-ray to high-resolution computed tomography (HRCT), have contributed to the progress of the curative modality of lung cancer, lung cancer patients are generally diagnosed at advanced stages [26] with a poor survival rate [4, 27]. Additionally, Ade has exceeded SCC, becoming the most common histological subtype of lung cancer [28, 29] due to the design of cigarette filter ventilation and changes in the composition of cigarettes [29,30,31]. Thus, understanding the epidemiological changes in lung cancers is essential for building a diagnostic tool with stability and comprehensive ability for discriminating lung cancers in the general population.

Based on our previous study, the diagnostic effectiveness of IDH1 in lung cancers, especially in patients with Ade, has been validated. However, its plasma level in SCC was significantly lower than that in Ade [25] and was uncertain when compared with that in SCLC. Since the result of the low predictive value of individual candidate biomarkers could be enhanced by combination [32,33,34], IDH1 was chosen to construct a panel to discriminate lung cancers as a candidate biomarker of the most common lung Ade. On the other hand, almost all cancers exhibit altered DNA methylation, an epigenetic marker that contributes to cancer development [35] and progression [36], making these epigenetic events a source of biomarkers and means for the detection of cancers [37, 38]. Therefore, the DNA methylation of the SHOX2 gene has been qualified as a sensitive biomarker for lung cancer diagnosis and staging, specifically for SCC and SCLC [14, 15, 17, 39], accompanied by an analysis of the methylation level of PTGER4 in our study. Moreover, the AUC value of IDH1 in both the training and validation groups was nonsignificantly higher than that of any other biomarkers, except for the levels of IDH1 and methylated PTGER4 DNA in the training group (AUCs of 0.781 and 0.67, respectively, p = 0.0351, Additional file 1: Table S4). Between the two methylation DNA biomarkers, nonsignificant differences were observed either in the training and validation cohorts (Additional file 1: Table S4). Although no significant differences were found between our primary 3-marker model and the other two 2-marker models (Fig. 2d), however in the subgroup analysis of non-Ade lung cancers, the primary 3-marker showed a significantly higher value of AUC than that of the other two 2-marker panels as shown in the Additional file 1: Table S5, Additional file 3: Figure S3. These results indicate that the diagnostic efficacy of the three biomarkers was basically comparable and demonstrate the necessity of their combination. In the statistical analysis, logistic regression was performed on the randomly assigned training group, and the final configuration enrolled the three tested blood biomarkers for diagnosing lung cancers, which was validated in an independent group.

Prof. Gunter had confirmed the discriminative performance of the SHOX2/PTGER4 DNA methylation marker panel with AUC values of 0.91–0.98, in a population with up to 61% non-Ade lung cancers [23]. In contrast, the ratio of Ade lung cancer and non-Ade lung cancers in our training group was nearly 4:1 (Ade:non-Ade, 90:25) indicating that our panel was more consistent with the current epidemiological changes of lung cancers [28, 29]. A further analysis of the single enrolled biomarkers, as shown in Fig. 2e–h, implied high specificity in groups of different tumor stages and histological types. Elevated expression levels of IDH1 in tissues of earlier-stage lung cancers have been previously confirmed, but no significant difference was found between Ade and SCC [24]. Therefore, further studies are needed to explore the reason for the high specificity of IDH1 in Ade lung cancers. On the other hand, the relatively stronger stage-specificity of methylation biomarkers in advanced-stage lung cancers may be explained by the higher amount of cell-free DNA in the bloodstream of patients with larger-sized tumors [14]. Additionally, as SCC and SCLC are commonly located centrally in the hilum, non-Ade lung cancers are more likely to invade large blood vessels [40,41,42]. As a consequence, higher levels of candidate DNA methylation will be detected by the blood test compared to the peripherally located Ade lung cancers in the lung lobes. With attributable risks of 85% and 60% to lung cancers in males and females caused by cigarette smoking [2, 43], respectively, the association between alterations of DNA methylation status and tobacco smoking has also been discussed and studied [44, 45]. Hence, we speculate that the strength of the predictive model may be increased when adding the smoking habit as a covariate in the final configuration.

Interestingly, the ROC curve of our primary 3-marker model almost overlapped with those of the other two 2-marker panels (combination of IDH1 and one single methylation biomarker), as shown in Fig. 2d. However, the 3-marker panels showed significantly stronger diagnostic performance compared with any 2-marker model in the analysis of non-Ade group (Additional file 1: Table S5, Additional file 3: Figure S3) demonstrating the advantage of a combined assessment over individual marker tests. In addition, based on the outstanding performance of our 3-marker model (AUC of 0.905) in the validation cohort, the probability of overfitting could be excluded as well. In our study, Youden’s index was used for the assessment of our diagnostic model. Although the specificity in the training group was below 70%, the PPVs in the training and validation groups were as high as 84.6% and 93.3%, respectively. Because radiological technology has been developed to provide more useful information, a comprehensive assessment of CT information and blood biomarkers may further enhance the PPV. In addition, healthy controls who were misclassified as the positive group should be closely followed-up with annual examinations in cases of lung cancers appearing in the absence of symptoms.

At the aspect of underlying molecular mechanisms of high diagnostic efficiency of wild-type IDH1 in NSCLC, we speculate that it is an important enzyme for redox state, DNA repair and epigenetic regulation in malignant tumors and its activation through mutation or overexpression plays a critical role in the initiation and development of different cancers, including NSCLC. Furthermore, we suggest that the combination of plasma IDH1 levels and DNA methylation levels of SHOX2 and/or PTGER4 showed better efficiency because the overexpression of wild-type IDH1 could produce alpha-ketoglutarate and its subsequent activation of DNA-demethylating enzymes such as TET2 might alter the DNA methylation levels of SHOX2/PTGER4 in NSCLC [46]. Therefore, the specific underlying mechanisms still need our further validation in the following researches.


Individual biomarkers of good stage-specificity and histological type specificity were confirmed and combined in our study. Additionally, our newly established panel showed highly significantly stronger diagnostic efficacy compared with the enrolled single factors. Thus, the diagnostic efficacy of lung cancers could be improved by testing blood markers in conjunction.


Some limitations existed in our study as follows. First, a smoking history of healthy controls was not available, resulting in no case matching based on smoking habits. Second, our enrolled cohort was from a single medical institution, and a larger number of cohorts from multiple medical centers will be studied to eliminate underlying confounding variables in future studies. Third, blood levels of conventional biomarkers for lung cancers, such as CEA, CA125, CYFRA 21-1 or SCC, were not routinely tested. Consequently, the differences in the diagnostic value between our model and these commonly used markers could not be assessed.

Availability of data and materials

Not applicable.





area under the curve


cancer antigen 125


carcinoembryonic antigen


95% confidence interval

CYFRA 21-1:

cytokeratin 19 fragment


prostaglandin E2 receptor 4 gene (PTGER4)


high-resolution computed tomography


isocitrate dehydrogenase 1


low-dose computed tomography


National Lung Screening Trial


negative predictive value


non-small cell lung cancer


positive predictive value


receiver operating characteristic


squamous cell carcinoma


small cell lung cancer




short stature homeobox 2 gene




standard deviation


Youden’s index


  1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.

    Article  PubMed  Google Scholar 

  2. National Lung Screening Trial Research T, Aberle DR, Adams AM, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med. 2011;365(5):395–409.

    Article  Google Scholar 

  3. Jemal A, Fedewa SA. Lung cancer screening with low-dose computed tomography in the United States-2010 to 2015. JAMA Oncol. 2017;3(9):1278–81.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA Cancer J Clin. 2018;68(1):7–30.

    Article  PubMed  Google Scholar 

  5. Hubers AJ, Heideman DA, Burgers SA, et al. DNA hypermethylation analysis in sputum for the diagnosis of lung cancer: training validation set approach. Br J Cancer. 2015;112(6):1105–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Integrative Analysis of Lung Cancer E, Risk Consortium for Early Detection of Lung C, Guida F, et al. Assessment of lung cancer risk on the basis of a biomarker panel of circulating proteins. JAMA Oncol. 2018;4:e182078.

    Article  Google Scholar 

  7. Vargas AJ, Harris CC. Biomarker development in the precision medicine era: lung cancer as a case study. Nat Rev Cancer. 2016;16(8):525–37.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Pan J, Song G, Chen D, et al. Identification of serological biomarkers for early diagnosis of lung cancer using a protein array-based approach. Mol Cell Proteom. 2017;16(12):2069–78.

    Article  CAS  Google Scholar 

  9. Roos L, van Dongen J, Bell CG, et al. Integrative DNA methylome analysis of pan-cancer biomarkers in cancer discordant monozygotic twin-pairs. Clin Epigenet. 2016;8:7.

    Article  CAS  Google Scholar 

  10. Ting AH, McGarvey KM, Baylin SB. The cancer epigenome–components and functional correlates. Genes Dev. 2006;20(23):3215–31.

    Article  CAS  PubMed  Google Scholar 

  11. Vaz M, Hwang SY, Kagiampakis I, et al. Chronic cigarette smoke-induced epigenomic changes precede sensitization of bronchial epithelial cells to single-step transformation by KRAS mutations. Cancer Cell. 2017;32(3):360.e366–376.e366.

    Article  CAS  Google Scholar 

  12. Sandoval J, Mendez-Gonzalez J, Nadal E, et al. A prognostic DNA methylation signature for stage I non-small-cell lung cancer. J Clin Oncol. 2013;31(32):4140–7.

    Article  PubMed  Google Scholar 

  13. Dor Y, Cedar H. Principles of DNA methylation and their implications for biology and medicine. Lancet (London, England). 2018;392(10149):777–86.

    Article  CAS  Google Scholar 

  14. Kneip C, Schmidt B, Seegebarth A, et al. SHOX2 DNA methylation is a biomarker for the diagnosis of lung cancer in plasma. J Thorac Oncol. 2011;6(10):1632–8.

    Article  PubMed  Google Scholar 

  15. Schmidt B, Liebenberg V, Dietrich D, et al. SHOX2 DNA methylation is a biomarker for the diagnosis of lung cancer based on bronchial aspirates. BMC Cancer. 2010;10:600.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Dietrich D, Kneip C, Raji O, et al. Performance evaluation of the DNA methylation biomarker SHOX2 for the aid in diagnosis of lung cancer based on the analysis of bronchial aspirates. Int J Oncol. 2012;40(3):825–32.

    CAS  PubMed  Google Scholar 

  17. Darwiche K, Zarogoulidis P, Baehner K, et al. Assessment of SHOX2 methylation in EBUS-TBNA specimen improves accuracy in lung cancer staging. Ann Oncol. 2013;24(11):2866–70.

    Article  CAS  PubMed  Google Scholar 

  18. Schmidt B, Beyer J, Dietrich D, Bork I, Liebenberg V, Fleischhacker M. Quantification of cell-free mSHOX2 plasma DNA for therapy monitoring in advanced stage non-small cell (NSCLC) and small-cell lung cancer (SCLC) patients. PLoS ONE. 2015;10(2):e0118195.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  19. Mutoh M, Watanabe K, Kitamura T, et al. Involvement of prostaglandin E receptor subtype EP(4) in colon carcinogenesis. Cancer Res. 2002;62(1):28–32.

    CAS  PubMed  Google Scholar 

  20. Chell SD, Witherden IR, Dobson RR, et al. Increased EP4 receptor expression in colorectal cancer progression promotes cell growth and anchorage independence. Cancer Res. 2006;66(6):3106–13.

    Article  CAS  PubMed  Google Scholar 

  21. Yokoyama U, Iwatsubo K, Umemura M, Fujita T, Ishikawa Y. The prostanoid EP4 receptor and its signaling pathway. Pharmacol Rev. 2013;65(3):1010–52.

    Article  PubMed  CAS  Google Scholar 

  22. Chang SH, Liu CH, Conway R, et al. Role of prostaglandin E2-dependent angiogenic switch in cyclooxygenase 2-induced breast cancer progression. Proc Natl Acad Sci USA. 2004;101(2):591–6.

    Article  CAS  PubMed  Google Scholar 

  23. Weiss G, Schlegel A, Kottwitz D, Konig T, Tetzner R. Validation of the SHOX2/PTGER4 DNA methylation marker panel for plasma-based discrimination between patients with malignant and nonmalignant lung disease. J Thorac Oncol. 2017;12(1):77–84.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Tan F, Jiang Y, Sun N, et al. Identification of isocitrate dehydrogenase 1 as a potential diagnostic and prognostic biomarker for non-small cell lung cancer by proteomic analysis. Mol Cell Proteom. 2012;11(2):M111.008821.

    Article  CAS  Google Scholar 

  25. Sun N, Chen Z, Tan F, et al. Isocitrate dehydrogenase 1 is a novel plasma biomarker for the diagnosis of non-small cell lung cancer. Clin Cancer Res. 2013;19(18):5136–45.

    Article  CAS  PubMed  Google Scholar 

  26. Walters S, Maringe C, Coleman MP, et al. Lung cancer survival and stage at diagnosis in Australia, Canada, Denmark, Norway, Sweden and the UK: a population-based study, 2004-2007. Thorax. 2013;68(6):551–64.

    Article  PubMed  Google Scholar 

  27. Zeng H, Chen W, Zheng R, et al. Changing cancer survival in China during 2003-15: a pooled analysis of 17 population-based cancer registries. Lancet Glob Health. 2018;6(5):e555–67.

    Article  PubMed  Google Scholar 

  28. Thun MJ, Lally CA, Flannery JT, Calle EE, Flanders WD, Heath CW Jr. Cigarette smoking and changes in the histopathology of lung cancer. J Natl Cancer Inst. 1997;89(21):1580–6.

    Article  CAS  PubMed  Google Scholar 

  29. Thun MJ, Carter BD, Feskanich D, et al. 50-year trends in smoking-related mortality in the United States. N Engl J Med. 2013;368(4):351–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Song MA, Benowitz NL, Berman M, et al. Cigarette filter ventilation and its relationship to increasing rates of lung adenocarcinoma. J Natl Cancer Inst. 2017.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Services UDoHaH. The health consequences of smoking—50 years of progress: a report of the surgeon general. National Library of Medicine Cataloging in Publication, Atlanta (GA). 2014.

  32. Kersting M, Friedl C, Kraus A, Behn M, Pankow W, Schuermann M. Differential frequencies of p16(INK4a) promoter hypermethylation, p53 mutation, and K-ras mutation in exfoliative material mark the development of lung cancer in symptomatic chronic smokers. J Clin Oncol. 2000;18(18):3221–9.

    Article  CAS  PubMed  Google Scholar 

  33. Tsou JA, Hagen JA, Carpenter CL, Laird-Offringa IA. DNA methylation analysis: a powerful new tool for lung cancer diagnosis. Oncogene. 2002;21(35):5450–61.

    Article  CAS  PubMed  Google Scholar 

  34. Virmani AK, Tsou JA, Siegmund KD, et al. Hierarchical clustering of lung cancer cell lines using DNA methylation markers. Cancer Epidemiol Biomark Prev. 2002;11(3):291–7.

    CAS  Google Scholar 

  35. Feinberg AP, Ohlsson R, Henikoff S. The epigenetic progenitor origin of human cancer. Nat Rev Genet. 2006;7(1):21–33.

    Article  CAS  PubMed  Google Scholar 

  36. Nakajima T, Enomoto S, Ushijima T. DNA methylation: a marker for carcinogen exposure and cancer risk. Environ Health Prev Med. 2008;13(1):8–15.

    Article  CAS  PubMed  Google Scholar 

  37. Sharma S, Kelly TK, Jones PA. Epigenetics in cancer. Carcinogenesis. 2010;31(1):27–36.

    Article  CAS  PubMed  Google Scholar 

  38. Shivapurkar N, Gazdar AF. DNA methylation based biomarkers in non-invasive cancer screening. Curr Mol Med. 2010;10(2):123–32.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Schneider KU, Dietrich D, Fleischhacker M, et al. Correlation of SHOX2 gene amplification and DNA methylation in lung cancer tumors. BMC Cancer. 2011;11:102.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Rosado-de-Christenson ML, Templeton PA, Moran CA. Bronchogenic carcinoma: radiologic-pathologic correlation. Radiographics. 1994;14(2):429–446; quiz 447–428.

    Article  CAS  PubMed  Google Scholar 

  41. Nichols L, Saunders R, Knollmann FD. Causes of death of patients with lung cancer. Arch Pathol Lab Med. 2012;136(12):1552–7.

    Article  PubMed  Google Scholar 

  42. Socinski MA, Obasaju C, Gandara D, et al. Current and emergent therapy options for advanced squamous cell lung cancer. J Thorac Oncol. 2018;13(2):165–83.

    Article  CAS  PubMed  Google Scholar 

  43. Simonato L, Agudo A, Ahrens W, et al. Lung cancer and cigarette smoking in Europe: an update of risk estimates and an assessment of inter-country heterogeneity. Int J Cancer. 2001;91(6):876–87.

    Article  CAS  PubMed  Google Scholar 

  44. Baglietto L, Ponzi E, Haycock P, et al. DNA methylation changes measured in pre-diagnostic peripheral blood samples are associated with smoking and lung cancer risk. Int J Cancer. 2017;140(1):50–61.

    Article  CAS  PubMed  Google Scholar 

  45. Fasanelli F, Baglietto L, Ponzi E, et al. Hypomethylation of smoking-related genes is associated with future lung cancer in four prospective cohorts. Nat Commun. 2015;6:10192.

    Article  CAS  PubMed  Google Scholar 

  46. Molenaar RJ, Maciejewski JP, Wilmink JW, van Noorden CJF. Wild-type and mutated IDH1/2 enzymes and therapy responses. Oncogene. 2018;37(15):1949–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We especially appreciate the critical support from Mr. Xiaoliang Han, Mr. Zhen Wu and Mrs. Jing Xie.


This work was supported by the National Key Research and Development Program of China (Grant Numbers 2016YFC1303201, 2016YFC0901400), the National Natural Science Foundation of China, CAMS Innovation Fund for Medical Sciences (2016-I2M-1-001, 2017-I2M-1-005), the National Natural Science Foundation of China (81802299, 81502514), the Fundamental Research Funds for the Central Universities (3332018070), and the National Key Basic Research Development Plan (2018YFC1312105).

Author information

Authors and Affiliations



RZ, NS, SG and JH conceived of and designed the study and carried out data interpretation; RZ wrote the manuscript; RZ, XW, RJ, JH, CL, SZ, FZ, QW collected blood samples; Ruochuan Zang and Runsen Jin acquired data and performed the statistical analysis. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Nan Sun, Shugeng Gao or Jie He.

Ethics declarations

Ethics approval and consent to participate

The research was approved by the Institute Research Medical Ethics Committee of the National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital in Chinese Academy of Medical Sciences and Peking Union Medical College in Beijing.

Consent for publication

All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Multivariable logistic regression. Table S2. Comparisons of the 2-marker models in the whole cohort. Table S3. Individual biomarker comparisons in subgroups of the whole cohort. Table S4. Comparisons between methylation DNA biomarkers in the training and validation cohorts. Table S5. Comparisons between the primary 3-marker model and the 2-maker models in the subgroup of non-Ade lung cancers.

Additional file 2: Figure S1–S2.

ROCs of the 2-marker model in the whole cohort.

Additional file 3: Figure S3.

ROCs of the primary 3-marker model and the 2-marker models in the non-Ade subgroup of the whole cohort.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zang, R., Wang, X., Jin, R. et al. Translational value of IDH1 and DNA methylation biomarkers in diagnosing lung cancers: a novel diagnostic panel of stage and histology-specificity. J Transl Med 17, 430 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: