Scoring mechanisms of p16INK4a immunohistochemistry based on either independent nucleic stain or mixed cytoplasmic with nucleic expression can significantly signal to distinguish between endocervical and endometrial adenocarcinomas in a tissue microarray study

Background Endocervical adenocarcinomas (ECAs) and endometrial adenocarcinomas (EMAs) are malignancies that affect uterus; however, their biological behaviors are quite different. This distinction has clinical significance, because the appropriate therapy may depend on the site of tumor origin. The purpose of this study is to evaluate 3 different scoring mechanisms of p16INK4a immunohistochemical (IHC) staining in distinguishing between primary ECAs and EMAs. Methods A tissue microarray (TMA) was constructed using formalin-fixed, paraffin-embedded tissue from hysterectomy specimens, including 14 ECAs and 24 EMAs. Tissue array sections were immunostained with a commercially available antibody of p16INK4a. Avidin-biotin complex (ABC) method was used for antigens visualization. The staining intensity and area extent of the IHC reactions was evaluated using the semi-quantitative scoring system. The 3 scoring methods were defined on the bases of the following: (1) independent cytoplasmic staining alone (Method C), (2) independent nucleic staining alone (Method N), and (3) mean of the sum of cytoplasmic score plus nucleic score (Method Mean of C plus N). Results Of the 3 scoring mechanisms for p16INK4a expression, Method N and Method Mean of C plus N showed significant (p-values < 0.05), but Method C showed non-significant (p = 0.245) frequency differences between ECAs and EMAs. In addition, Method Mean of C plus N had the highest overall accuracy rate (81.6%) for diagnostic distinction among these 3 scoring methods. Conclusion According to the data characteristics and test effectiveness in this study, Method N and Method Mean of C plus N can significantly signal to distinguish between ECAs and EMAs; while Method C cannot do. Method Mean of C plus N is the most promising and favorable means among the three scoring mechanisms.


Conclusion:
According to the data characteristics and test effectiveness in this study, Method N and Method Mean of C plus N can significantly signal to distinguish between ECAs and EMAs; while Method C cannot do. Method Mean of C plus N is the most promising and favorable means among the three scoring mechanisms.

Background
The histomorphologic overlap of ECA and EMA can make differentiation difficult on H&E in small pre-operative biopsy or curetting specimens. Ascertaining the site of cancer origin may be difficult, but plays an important role in guiding treatment. For the EMA, staging is surgical; however, for the primary ECA, staging is clinical. Treatment protocols may differ substantially between both of them. [1][2][3] Previous studies have shown that certain immunohistochemical markers may be helpful in distinguishing between ECAs and EMAs. A traditional 3-marker panel (ER/Vim/CEA) has previously been proposed to make the distinction. A positive ER, Vim and a negative CEA result indicates an EMA; a negative ER, Vim and positive CEA result indicates an ECA. There are, however, many unexpected aberrant immunoexpressions not characteristic of either primary ECAs or EMAs. No study has identified one marker that clearly and consistently makes this distinction in all cases. [4][5][6][7][8] Recent study has focused on other markers, such as p16 INK4a , which may express in different intensities, staining patterns and subcellular localizations in various malignancies and tissues. It is also reported that ECAs tends to be positively and diffusely expressed by p16 INK4a , whereas EMAs tends to be negatively or focally expressed by p16 INK4a in routine whole-sectioned tissue slides. [9][10][11][12][13] To date, there is not yet consensus to define the optimal scoring methods of p16 INK4a immunoexpression in various tissue samples, especially in those small sizes of pre-operative biopsy or curetting specimens of endocervix or endometrium. In this study, our objective was to propose the appropriately scoring methods and to report that these methods can be easily applied to p16 INK4a immunohistochemistry (IHC) as a diagnostic adjunct in distinguishing between ECAs and EMAs. [14][15][16][17][18]

Study materials
The study material consisted of slides and selected formalin-fixed, paraffin-embedded tissue blocks from 38 hysterectomy specimens retrieved from the archives of the Tissue Bank, Clinical Trial Center, Chung-Shan Medical University Hospital. These specimens of known origin, endocervix or endometrium, were accessioned between 2004 and 2008. The cases studied included EMAs (n = 24), as well as ECAs (n = 14). Two board-certified pathologists (CP Han and LF Kok) reviewed all H&E stained slides for these cases. A slide with tumor representative was selected and circled from each case. In the next step, the area corresponding to the selected area on the slide was also circled on the block with an oil marker pen. All these donors' tissue blocks were sent to the Biochiefdom International Co. LTD, Taiwan for tissue microarray construction. They were cored with a 1.5 mm diameter needle and transferred to a recipient paraffin block. The recipient block was sectioned at 5 um, and transferred to silanized glass slides.

Immunohistochemical staining
Using the Avidin-Biotin Complex (ABC) technique, immunohistochemistry and antigen retrieval methods were applied in the same manner as described in previous literature. [17] Briefly, all the 1.5 mm and 5 um cores of tissue array specimens embedded in paraffin slice on coated slides, were washed in xylene to remove the paraffin, rehydrated through serial dilutions of alcohol, followed by washings with a solution of PBS (pH 7.2). All subsequent washes were buffered via the same protocol. Treated sections were then placed in a citrate buffer (pH 6.0) and heated in a microwave for two 5-minute sessions. The samples were then incubated with a monoclonal antimouse p16 INK4a antibody (F12, sc-1661, Santa Cruz, 1:200 dilution) for 60 minutes at 25°C. The conventional streptavidin peroxidase method (DAKO, LSAB Kit K675, Copenhagen, Denmark) was performed for signal development and the cells were counter-stained with hematoxylin. Negative controls were obtained by excluding the primary antibody, and positive controls were simultaneously obtained by staining tissues of squamous cell carcinoma of uterine cervix. This slide was mounted with gum for examination and capture by the Olympus BX51 microscopic/DP71 Digital Camera System for study comparison.

Statistical analysis
The threshold for differentiating between final positive and negative immunostaining was set at 4 for interpretation. This optimal cut-off value was determined by using the receiver operating characteristic (ROC) curve analysis (Metz, 1978;Zweig & Campbell, 1993) in this study. [24,25] Score of 4 points or greater was considered positive for p16 INK4a expression. A negative stain was classified as having an immunostaining score of 0 to 3 (essentially negative) and indicated a diagnosis of an EMA; whereas a positive stain was classified as having an immunostaining score of 4 to 12 (at least moderately positive in at least 11-50% of cells) and indicated a diagnosis of an ECA. A chi-squared or Fisher's exact test was performed to test the frequency difference of p16 INK4a immunostaining (positive vs. negative) between groups of two primary adenocarcinomas (ECAs vs EMAs). A nonparametric analysis of Mann-Whitney U-test was used to test the immunostaining raw scores between the two adenocarcinomas, given the fact that the analytical IHC scores were not normally distributed. In addition, we also examined associations among the 3 different scoring mechanisms, based on the subcellular localizations of p16 INK4a expression, including (1) Method C, (2) Method N, and (3) Method Mean of C plus N. The nonparametric Spearman's rho correlation coefficient was used to analyze associations between pairs of these three types of p16 INK4a scores. Data were analyzed using standard statistical software (SPSS, Inc., Chicago, IL). All tests were 2-sided and the significance level was 0.05.
To evaluate and compare the patterns of p16 INK4a expression in making a diagnostic distinction of primary ECAs from primary EMAs, the sensitivity, specificity, accuracy, and the positive and negative predictive values (PPV and NPV respectively) were compared and displayed. Sensitivity is defined as the probability of positive p16 INK4a stain in primary ECAs. Specificity is, on the other hand, defined, as the probability of negative p16 INK4a stain in primary EMAs. 18 Overall accuracy is the proportion of true diagnosis of ECAs and EMAs in total number of p16 INK4a scoring tests. Positive predictive value is the probability that a patient with a positive p16 INK4a expression has a primary adenocarcinoma of endocervical origin. Negative predictive value is the probability that a person with a negative p16 INK4a expression has a primary adenocarcinoma of endometrial origin. [26] In order to assess whether the test results were statistically different from each other based on correct diagnosis, McNemar's test was performed. A p-value < 0.05 was considered significant.

Results
For evaluation of p16 INK4a immunohistochemistry, nucleic and cytoplasmic stains were taken into account separately as well as collectively for all cases. H&E ( Figure  1a and Figure Table 1. By using score of 4 as a cut-off point, except for Method C, the other two scoring mechanisms based on N, and Mean of C plus N, showed significant frequency differences between immunostaining (positive vs. negative) in tissues from the two adenocarcinomas (ECA vs. EMA) in origin. Individually, (1) Method C stained positive in 5 out of 14 (35.7%) ECA tumors and 4 out of 24 (16.7%) stained positive in EMA tumors (p = 0.245), with median staining score and range of 2 (0-12) and 2 (0-12), respectively (p = 0.152); (2) Method N stained positive in 11 out of 14 (78.6%) ECA tumors and 7 out of 24 (29.2%) stained positive in EMA tumors (p < 0.001), with median staining score and range of 5 (2-12) and 2 (0-9), respectively (p < 0.001); (3) Method Mean of C plus N stained positively in 10 out of 14 (71.4%) ECA tumors and 3 out of 24 (12.5%) stained positively in EMA tumors (p < 0.001), with median staining score and range of 4.25 (2-12) and 2 (0-10.5), respectively (p < 0.001). In summary, Method C did not show statistically significant, whereas Method N and Method Mean of C plus N revealed statistically significant fre-quency differences (p < 0.05) in distinguishing between ECAs and EMAs.
The associations between these three scoring methods in ECAs and EMAs were also explored and shown in ECAs and EMAs. Table 2 shows the diagnostic performance of these three different scoring mechanisms for measuring p16 INK4a expression in distinguishing 14 ECAs from 24 EMAs. (1) When using Method C, the sensitivity of positively stained ECAs was 35.7% (5/14) and PPV was 55.6%, whereas the specificity of negatively stained EMAs was 83.3% (20/24) and NPV was 69.0%. The overall accuracy rate was 65.8%. (2) When using Method N, the sensitivity was 78.6% (11/14) and PPV was 61.1%, whereas the specificity was 70.8% (17/24) and NPV was 85%. The overall accuracy rate was 71.4%. (3) When using Method Mean of C plus N, the sensitivity was 71.4% (10/14) and PPV was 76.9%, whereas the specificity (21/24) was 87.5% and NPV was 84.0%. The overall accuracy rate was 81.6%, the highest among the three scoring methods (table 2). Furthermore, the 95% confidence intervals (CIs) of these performance parameters were calculated and provided in Table 2 for these three scoring methods. It was clearly that the 95% CI of the sensitivity value for Method C did not overlap with Method N, and Method Mean of C plus N as well. This implied that the performances of sensitivities were different among these three scoring methods especially for Method C. To confirm this finding, McNemar's test was further used to compare the Immunohistochemical analysis of p16 INK4a staining in endometrial adenocarcinomas

Discussion
Distinguishing between ECAs and EMAs before planning the patient treatment is clinically important. When the tumor involves both the uterine endometrium and the endocervix, it becomes difficult to distinguish the primary site of the tumor during preoperative assessment of the limited sizes of biopsy or curetting specimens. In this  • The "a" is Chi-squared test with continuity correction or Fisher exact test, the "b" is Mann-Whitney U test using exact significant.
•Using score 4 points as a cutoff, the immunostains are defined "negative" for scores from 0 to 3, and "positive" for scores from 4 to 12 points.
Scatter plots showing the Spearman's rho correlation coefficients (p value) for the associations between pairs of these three types of p16 INK4a scoring mechanisms in endocervical adenocarcinomas and endometrial adenocarcinomas study, we evaluate various p16 INK4a expression patterns in both ECAs and EMAs. We also investigate the most appropriate and effective p16 INK4a IHC scoring methods in distinguishing these two types of gynecologic cancers in Taiwanese women. Our valuable domestic data can be extrapolated to women in general and will be helpful in referring and managing such cases worldwide.
The p16 INK4a (cyclin-dependent kinase inhibitor 4) is a tumor suppressor protein that binds to cyclin-cdk4/6 complexes, which blocks kinase activity and inhibits progression to the S phase of the cell cycle in the nucleus. [10,15,[27][28][29][30][31][32][33][34][35] However, interpretation of IHC data of p16 INK4a staining results is complicated because of its unclear biological significance of cytoplasmic staining and lack of universal accepted algorithm in scoring methodology. Cytoplasmic reactivity is often regarded as unexpected, unspecific event. [36] Some consider only nucleic p16 INK4a labeling in tumor cells to be positive and ignore cytoplasmic staining. [16,32] Others state that both nucleic and cytoplasmic immunoreactivities in tumor cells are characteristic and are indeed due to p16 INK4a expression. [24][25][26][27][28] It has also been reported that strong cytoplasmic staining in mammary carcinomas is associated with negative prognostic factors, such as low differentiation, p53, Ki-67 labeling etc. We have learned that despite nucleic expression, p53 tumor suppressor protein is localized on cell cytoplasm, where it is regarded as a way of functional inactivation. [29][30][31]  There are a variety of IHC scoring methods including computer-based plans in literatures, and still seems to be no generally accepted protocols in research laboratories and clinical practices for rating and scoring the immunostaining results. Comparing commercially derived computerbased programs with the conventional analyses by pathologists, there are still lacks of optimized and standardized IHC scoring algorithms. As a result, the objective accuracy did not significantly improve clinical outcome measures. [33][34][35][36] There are also various quantitative scoring mechanisms of p16 McCluggage WG (2003) and Mittal K (2007) stated that a diffuse, strong staining pattern of p16 INK4a , involving nearly all tumor cells tends to be an ECA, whereas, focal, patchy staining pattern of p16 INK4a involving 0-50% of cells tends to be an EMA in routine whole-sectioned tissue slides. [15,17] We did not use the patchy or diffuse pattern of p16 INK4a IHC staining as a diagnostically distinctive criterion between ECAs and EMAs in this TMA study, because we think that cases with primary EMA may seem to over-express p16 INK4a beyond the limited 1.5 mm core area and therefore mimic a diffuse pattern of ECA primary. Instead, we preferred to use the semi-quantitative scoring system in considering the 0-3 points of staining intensity and 0-4 points of staining area extent by multiplying both, yielding a range of score 0 to 12 points. We then divided the results by an appropriate cut-off threshold of 4 to a two-tier of negative (0-3 points) or positive (4-12 points) for interpretation. The mixed cytoplasmic and nucleic stains with varying degrees of intensity and

Conclusion
Although careful gross and histologic examinations usually allows a confidant distinction between ECAs and EMAs, diagnostic dilemma may occur when tumor is localized in both endometrial and endocervical biopsies, histomorphologic overlaps and preoperative imaging studies may also confuse in establishing the site of origin. True diagnosis may require the assistance of ancillary IHC stains. The p16 INK4a marker tends to be positively and diffusely expressed in ECAs, but tends to be negatively or focally expressed in EMAs. However, there is still a lack of optimized consensus or standard for p16 INK4a IHC scoring mechanisms. According to the scientific results in this study, we found that Method Mean of C plus N and Method N can help to distinguish between ECAs and EMAs, but Method C is of no use to do so. Based on the data characteristics and test effectiveness, Method Mean of C plus N is the most promising score-calculating means among the three p16 INK4a IHC scoring mechanisms in diagnostic distinction between these two gynecologic malignancies (ECAs vs. EMAs).