Development and validation of prognostic nomogram for young patients with gastric cancer

Chaoran Yu; Yujie Zhang

doi:10.21037/atm.2019.10.77

Original Article

Development and validation of prognostic nomogram for young patients with gastric cancer

Chaoran Yu^1,2, Yujie Zhang³

¹Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China;²Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China;³Department of Gastrointestinal Surgery, Gastrointestinal Cancer Research Institute, Molecular Medicine Center, Tongji Hospital, Tongji Medical College in Huazhong University of Science and Technology, Wuhan 430030, China

Contributions: (I) Conception and design: All authors; (II) Administrative support: All authors; (III) Provision of study materials or patients: All authors; (IV) Collection and assembly of data: All authors; (V) Data analysis and interpretation: All authors; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

Correspondence to: Chaoran Yu, MD, PhD. Fudan University Shanghai Cancer Center, Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China. Email: chaoran_yu@yeah.net; chaoran_yu@sjtu.edu.cn; Yujie Zhang. Department of Gastrointestinal Surgery, Gastrointestinal Cancer Research Institute, Molecular Medicine Center, Tongji Hospital, Tongji Medical College in Huazhong University of Science and Technology, Wuhan 430030, China. Email: yujiezhang@outlook.com.

Background: This study was to establish nomogram models for prognostic evaluation of early-onset gastric cancer (EOGC) in both overall survival (OS) and cancer-specific survival (CSS).

Methods: EOGC patients from 2004 to 2015 were retrieved from the surveillance, epidemiology and end results (SEER) and further randomly assigned to training and validation sets. Univariate and multivariate cox analysis was used to screen out significant variables for construction of nomogram. Nomogram models were assessed by concordance index (C-index), calibration plot, receiver operating characteristics (ROCs) curve and decision curve analysis (DCA).

Results: A total of 549 EOGC were selected in this process. OS nomogram was constructed based on tumor size and tumor site. CSS nomogram was constructed based on tumor size, SEER stage and tumor site. In training set, C-index for the OS nomogram was 0.688 [95% confidence intervals (95% CI): 0.629–0.747], CSS nomogram 0.785 (95% CI: 0.735–0.835). In the external validation, the C-index for the OS nomogram was 0.633 (95% CI: 0.579–0.687), while for the CSS nomogram 0.733 (95% CI: 0.686–0.780). High quality of calibration plots both in OS and OS nomogram models was noticed. Nomograms displayed a comparable result to tumor-node-metastasis (TNM) stage and SEER stage for EOGC based on DCA.

Conclusions: The nomogram models provided an insightful and applicable tool to evaluate the prognosis of EOGC both in OS and CSS.

Keywords: Gastric cancer (GC); nomogram; overall survival (OS); cancer-specific survival (CSS); early-onset

Submitted Jun 21, 2019. Accepted for publication Aug 26, 2019.

doi: 10.21037/atm.2019.10.77

Introduction

Gastric cancer (GC) remains one of the major malignancies in Asia, particularly in China, Japan and Korea (1-4). Both incidence and mortality of GC have been significantly reduced by solid medical interventions and systematic screening techniques (4). Although curative surgical resection is the primary therapy for most advanced GC cases, exact surgical strategies remain largely controversial between eastern and western countries (5,6). Meanwhile, postoperative recurrence is also one of the major challenges disturbing the outcome of GC. Previously, 194 out of 417 Chinese GC patients underwent curative resection (46.5%) suffered from recurrence (7). In fact, the mean recurrence time was less than 3 years (7).

Early-onset gastric cancer (EOGC) is defined as GC patients with age of young than 45 (8,9). EOGC is featured by diffuse histology, closer association in genetic factors and higher metastatic risk (8,10). In fact, the incidence of EOGC in ages 25 to 39 has been increasing in US whereas the incidence of GC in older group dramatically declines (11).

Rona et al. reported that GC patients aged <45 years had a higher proportion of poorly-differentiated histology, signet-cell type and advanced stage (12). Of note, surgical intervention was not significantly associated with better outcome (12). Meanwhile, Takatsu et al. also reported that lymph node metastasis was identified as a strong recurrence-indicator in young patient group, even they were featured by significantly fewer comorbidities and postoperative complications (13). However, two studies both indicated that no significant difference for stage-specific survival was identified between early-onset and control groups. Moreover, in country specific, US young GC patients were featured by proximal tumor and preoperative chemotherapy while Chinese young GC patients were featured by distal position and more advanced stage (14). However, both OS and disease-specific survival (DSS) was not significantly different between China and US (14).

Nomogram-based clinical modeling has been one of the most widely used statistic methods in clinical investigations. Featured by visual and mathematical advantages, nomogram facilitates the clinical implementation and probability calculation of risk factor or other predictor variables.

Generally, in the real world, the age-specific risk factors correlated to prognosis remain largely unnoticed. In fact, prognostic nomogram for EOGC is yet to be fully developed and validated. Given the comparable unsatisfied prognosis of EOGC, we believe that actual clinical outcome of EOGC patients requires a more solid, specific and statistical-power enhanced clinical model rather a general one for all GC. In fact, surveillance, epidemiology and end results (SEER) program enables the establishment of the nomogram for EOGC with sufficient registered cases.

Based on these premises, an EOGC-specific nomogram model is developed and validated to aid the prognosis evaluation for each EOGC individual using the data retrieved from SEER.

Methods

Data retrieved from SEER

Specific clinicopathological data and prognostic outcome of EOGC patients from 2004 to 2015 were retrieved from the SEER using reference number 14622-Nov2017 (15,16). This study did not require a local ethics approval or a statement. Because all the data used in this study were retrieved from the SEER database with public available approach. The identification of GC was based on the histological code and the cancer staging scheme (version 0204). The inclusion criteria included: (I) age <45 years old; (II) with complete tumor-node-metastasis (TNM) stage information; (III) only one primary tumor lesion (GC); (IV) with surgery performed; (V) complete survival data; (VI) without missing data in SEER cause-specific death classification; (VII) without unknown tumor size; (VIII) without unknown grade and race. All included EOGC were randomly assigned into training set and validation set. The median follow-up length was 20 [0–71] months. The training set included 276 EOGC patients and median follow-up length was 18 [0–71] months. The median follow-up length of validation set was 22 [0–71] months.

Clinical variables of EOGC

Clinical variables included sex, age, race, cancer staging scheme, grade, tumor size, American Joint Committee on Cancer (AJCC) TNM stage, SEER stage, tumor site, SEER cause-specific death classification, survival related information. Overall survival (OS) was the primary endpoint whereas cancer-specific survival (CSS) was the secondary endpoint. Age and tumor size were categorically divided based on the optimal cut-off value generated by X-tile software version 3.6.1 (Yale University School of Medicine, US).

Construction and validation of nomogram model

The Kaplan-Meier method and log-rank test were used for survival analysis while chi-square test was used for the comparison of categorical variables. Univariate cox analysis was used to screen out significant variables (P<0.2) for further multivariate analysis and construction of nomogram. In validation, the concordance index (C-index) was used for the measurement of nomogram between performance and predicted results. Calibration plots were used for the comparison between nomogram-predicted and actual outcome using a 45-degree line as an optimal model. Receiver operating characteristics (ROCs) curve was used for the sensitivity and specificity of nomogram. Furthermore, decision curve analysis (DCA) was used for the threshold probabilities range of nomogram in association with TNM stage and SEER stage. In addition, the nomogram was also compared to the TNM stage, SEER stage in terms of C-index. R software version 3.3.0 (R foundation for Statistical Computing, Vienna, Austria) was used for all analysis. Statistically significant cutoff value was set up as P<0.05. However, P<0.2 was selected as filter value for univariate to multivariate analysis.

Results

Input data from SEER

A total of 549 EOGC were selected in this process, of which 276 were randomly assigned to the training set while 273 cases were into validation set (Figure 1). For all patients, 280 (51.0%) were male and 363 (66.1%) were White. Based on the optimal cutoff value in age (age <33, 33–42, age >42), 309 (56.3%) were between 33–42 years old. Meanwhile, based on the optimal cutoff value in tumor size (tumor size <3.7, 3.7–6.8, >6.8 cm), 229 (41.7%) were less than 3.7 cm. The majority of grade is III (78.7%) while 83.8% were in M0 stage. The most common tumor site for EOGC was gastric antrum (including pylorus) (30.1%), followed by cardia (21.1%) and body of stomach (12.4%). Moreover, 59.4% of all patients were regional in SEER stage classification (Table S1).

Figure 1 Flow diagram of the EOGC patients with training and validation sets. EOGC, early-onset gastric cancer; AJCC, American Joint Committee on Cancer; TNM, tumor-node-metastasis; SEER, surveillance, epidemiology and end results.

Table S1 Patients’ demographics and clinicopathological characteristics
Full table

Construction of nomogram

Tumor size, TNM stage, tumor site and SEER stage were significantly identified in univariate analysis in the training set (Table 1). However, only tumor size and tumor site were identified as significantly associated with OS in multivariate analysis. Next, the OS nomogram was constructed based on these two independent factors (Figure 2A). Moreover, in CSS analysis, tumor size, TNM stage, tumor site and SEER stage were identified by univariate analysis. However, only tumor size, SEER stage and tumor site were significantly identified in multivariate analysis and further subject to a CSS nomogram (Table 2, Figure 2B).

Table 1 Univariate and multivariate analysis of OS in the training set (n=276)
Full table

Figure 2 OS and CSS associated nomograms for EOGC patients. (A) OS nomograms for EOGC in 3- and 5-year; (B) CSS nomograms for EOGC in 3- and 5-year. OS, overall survival; CSS, cancer-specific survival; EOGC, early-onset gastric cancer; SEER, surveillance, epidemiology and end results.

Table 2 Univariate and multivariate analysis of CSS in the training set (n=276)
Full table

Nomogram validation

The OS and CSS nomograms were validated both internally and externally. In the internal validation, the C-index for the OS nomogram was 0.688 (95% CI: 0.629–0.747), while for the CSS nomogram 0.785 (95% CI: 0.735–0.835) (Table S2). In the external validation, the C-index for the OS nomogram was 0.633 (95% CI: 0.579–0.687), while for the CSS nomogram 0.733 (95% CI: 0.686–0.780) (Table S2). Meanwhile, high quality of calibration plots both in OS and OS nomogram models had been identified (Figures 3,4). In addition, high area under ROC curve (AUC) was noticed for both training and validation sets, respectively (Figure S1). The DCA results also indicated that nomograms showed a comparable clinical applicability similar to TNM stage and SEER stage (Figure S2). Specifically, both in training and validation sets, OS nomogram displayed a significantly better performance than TNM stage while CSS nomogram displayed better than SEER stage (Table S2). In addition, the etiology of non-cancer-related death in this study has been displayed. There are six types of non-cancer-related death that have been mentioned in the SEER concerning this study, including (I) accidents and adverse effects; (II) complications of pregnancy, childbirth, puerperium; (III) diseases of heart; (IV) nephritis, nephrotic syndrome and nephrosis; (V) other cause of death (COD); (VI) septicemia (Table S3).

Table S2 Comparison of C-indexes between the nomograms, TNM and SEER stages in EOGC patients
Full table

Figure 3 Calibration plots of OS associated nomograms in both training and validation sets. (A,B) Calibration plots of 3- and 5-year OS in training set; (C,D) calibration plots of 3- and 5-year OS in validation set. OS, overall survival.

Figure 4 Calibration plots of CSS associated nomograms in both training and validation sets. (A,B) Calibration plots of 3- and 5-year CSS in training set; (C,D) calibration plots of 3- and 5-year CSS in validation set. CSS, cancer-specific survival.

Figure S1 ROCs curve for the nomograms. (A) The ROC curve of nomogram with 3-year OS in training set; (B) the ROC curve of nomogram with 5-year OS in training set; (C) the ROC curve of nomogram with 3-year OS in validation set; (D) the ROC curve of nomogram with 5-year OS in validation set; (E) the ROC curve of nomogram with 3-year CSS in training set; (F) the ROC curve of nomogram with 5-year CSS in training set; (G) the ROC curve of nomogram with 3-year CSS in validation set; (H) the ROC curve of nomogram with 5-year CSS in validation set. ROC, receiver operating characteristic; OS, overall survival; CSS, cancer-specific survival; AUC, area under ROC curve; FP, false positive; TP, true positive.

Figure S2 DCA of the nomograms for OS and CSS in both training and validation sets. (A,B) The DCA of nomogram in training set for both OS and CSS; (C,D) the DCA of nomogram in validation set for both OS and CSS. DCA, decision curve analysis; OS, overall survival; CSS, cancer-specific survival; SEER, surveillance, epidemiology and end results; TNM, tumor-node-metastasis.

Table S3 The etiology of non-cancer-related death
Full table

Discussion

This study developed and validated prognostic nomogram models for both OS and CSS EOGC based on the public database SEER. By both internal and external validation, the nomograms used displayed comparably outcome to the TNM stage and SEER stage. The prognostic nomograms could facilitate the clinical prognostic evaluation and personalized treatment.

There are two reasons why this study focused on the nomogram of EOGC. Firstly, given the epidemiological facts, the incidence of EOGC remains largely unsatisfactory, particularly in patients between 25 and 39. However, it remains unclear whether the entire range of EOGC could be of high prognostic risk. Therefore, to develop a more specific nomogram that directly targets EOGC, instead of general patients’ group, could be of greater clinical value.

Secondly, given the fact that older GC patients are characterized by significantly declined incidence as well as potential reduced mortality rate, it is possible that those facts could contribute to the confounding bias of general prognostic indicator, particularly when focusing on the EOGC. In fact, the nomogram models in this study also reflected an individualized therapeutic management.

However, we admit that numerous variables, including sex and race, were not identified as prognostic significant. It is reasonable that in EOGC, potential prognostic significant factors may be diverse from general GC patients.

Moreover, the prognostic nomograms in this study may not display drastic different to older GC patients. However, it is also reasonable that the difference, whether or not existed between the EOGC-nomogram and older GC nomogram, did not lower the power of the nomogram developed in this study.

Of note, the nomogram used in OS included tumor size and tumor site while the nomogram used in CSS included tumor size, tumor site and SEER stage. Given the fact that optimal-cutoff categorized tumor size had been identified as significant independent factors in univariate/multivariate analysis, it is reasonably to presume the potential impact of tumor size in EOGC. In fact, conventional categorization based on 5 and 10 cm did not fully reflect the prognostic significance and clinical implications of tumor size in EOGC. Tumor size could be of great value to demonstrate the OS and CSS prognostic risk in this situation. In fact, based on our result, risk significantly increased in tumor size between 3.7–6.8 cm compared to tumor size <3.7 cm. Moreover, tumor size >6.8 cm demonstrated the highest risk compared to tumor size <3.7 cm. Previously, Saito et al. discovered that large size group (tumor size ≥8 cm) had been identified as an independent prognostic factor with worse prognosis (17). Of note, compared to small size group, large size group was featured by cases with undifferentiated histological type and lymphatic and venous invasion (17). Of note, tumor size >6.8 cm was a strong prognostic indicator both in OS and CSS. It may be potentially correlated with more advanced AJCC stage and occult disseminated tumor cells. Moreover, enhanced tumor cell proliferation and angiogenesis capability may also contribute to the biological features of larger tumors as well as extracellular matrix and stromal. In fact, intrinsic features beneath tumor size remain largely unexplored. Simple cutoff of tumor size, such as >5 or 10 cm, is not sufficient to fully characterize the prognostic and other clinical values. Thus, “bedside to bench” investigation is further warranted.

Up to now, SEER provides the largest sample size across various types of cancers. Moreover, similar nomogram models on several tumors by SEER have been explored (18,19). Although SEER does not fully provide numerous prognostic factors, for examples, surgical details (lymphadenectomy extent, D1, D2 or D2+), it remains one of the widely used datasets for nomogram modeling.

Meanwhile, there remains some limitation. Larger sample sizes are warranted for further independent validation and prognostic stratification analysis. The majority race of this data was from white, therefore could be with potential race heterogeneity.

Conclusions

The nomogram models provided an insightful and applicable tool to evaluate the prognosis of EOGC both in OS and CSS.

Acknowledgments

None.

Footnote

Conflict of Interest: The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

References

Siegel RL, Miller KD, Jemal A. Cancer statistics, 2016. CA Cancer J Clin 2016;66:7-30. [Crossref] [PubMed]
Chen W, Zheng R, Baade PD, et al. Cancer statistics in China, 2015. CA Cancer J Clin 2016;66:115-32. [Crossref] [PubMed]
Tsugane S, Sasazuki S. Diet and the risk of gastric cancer: review of epidemiological evidence. Gastric Cancer 2007;10:75-83. [Crossref] [PubMed]
Ajani JA, D'Amico TA, Almhanna K, et al. Gastric cancer, version 3.2016, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw 2016;14:1286-312. [Crossref] [PubMed]
Songun I, Putter H, Kranenbarg EM, et al. Surgical treatment of gastric cancer: 15-year follow-up results of the randomised nationwide Dutch D1D2 trial. Lancet Oncol 2010;11:439-49. [Crossref] [PubMed]
Ajani JA, Bentrem DJ, Besh S, et al. Gastric cancer, version 2.2013: featured updates to the NCCN guidelines. J Natl Compr Canc Netw 2013;11:531-46. [Crossref] [PubMed]
Kang WM, Meng QB, Yu JC, et al. Factors associated with early recurrence after curative surgery for gastric cancer. World J Gastroenterol 2015;21:5934-40. [Crossref] [PubMed]
Milne AN, Offerhaus GJ. Early-onset gastric cancer: learning lessons from the young. World J Gastrointest Oncol 2010;2:59-64. [Crossref] [PubMed]
Milne AN, Carvalho R, Morsink FM, et al. Early-onset gastric cancers have a different molecular expression profile than conventional gastric cancers. Mod Pathol 2006;19:564-72. [Crossref] [PubMed]
Bautista MC, Jiang SF, Armstrong MA, et al. Impact of age on clinicopathological features and survival of patients with noncardia gastric adenocarcinoma. J Gastric Cancer 2014;14:238-45. [Crossref] [PubMed]
Anderson WF, Camargo MC, Fraumeni JF Jr, et al. Age-specific trends in incidence of noncardia gastric cancer in US adults. JAMA 2010;303:1723-8. [Crossref] [PubMed]
Rona KA, Schwameis K, Zehetner J, et al. Gastric cancer in the young: an advanced disease with poor prognostic features. J Surg Oncol 2017;115:371-5. [Crossref] [PubMed]
Takatsu Y, Hiki N, Nunobe S, et al. Clinicopathological features of gastric cancer in young patients. Gastric Cancer 2016;19:472-8. [Crossref] [PubMed]
Strong VE, Russo A, Yoon SS, et al. Comparison of young patients with gastric cancer in the United States and China. Ann Surg Oncol 2017;24:3964-71. [Crossref] [PubMed]
Hankey BF, Ries LA, Edwards BK. The surveillance, epidemiology, and end results program: a national resource. Cancer Epidemiol Biomarkers Prev 1999;8:1117-21. [PubMed]
Hayat MJ, Howlader N, Reichman ME, et al. Cancer statistics, trends, and multiple primary cancer analyses from the surveillance, epidemiology, and end results (SEER) program. Oncologist 2007;12:20-37. [Crossref] [PubMed]
Saito H, Osaki T, Murakami D, et al. Macroscopic tumor size as a simple prognostic indicator in patients with gastric cancer. Am J Surg 2006;192:296-300. [Crossref] [PubMed]
Song W, Lv CG, Miao DL, et al. Development and validation of a nomogram for predicting survival in patients with gastrointestinal stromal tumours. Eur J Surg Oncol 2018;44:1657-65. [Crossref] [PubMed]
Zhuang Y, Liu Y, Zhao H, et al. Age-related disparity in patients with colorectal cancer: a population-based study from SEER program. Int J Clin Exp Med 2017;10:3305-12.

Cite this article as: Yu C, Zhang Y. Development and validation of prognostic nomogram for young patients with gastric cancer. Ann Transl Med 2019;7(22):641. doi: 10.21037/atm.2019.10.77

Development and validation of prognostic nomogram for young patients with gastric cancer

Introduction

Methods

Data retrieved from SEER

Clinical variables of EOGC

Construction and validation of nomogram model

Results

Input data from SEER

Construction of nomogram

Nomogram validation

Discussion

Conclusions

Acknowledgments

Footnote

References

Article Options

Download Citation

Share