Kun-Zhe Wu,Xiao-Hua Xu,Cui-Ping Zhan,Jing Li,Jin-Lan Jiang
Kun-Zhe Wu,Jin-Lan Jiang,Scientific Research Center,China-Japan Union Hospital of Jilin University,Changchun 130000,Jilin Province,China
Xiao-Hua Xu,Jing Li,Department of Nephrology,China-Japan Union Hospital of Jilin University,Changchun 130000,Jilin Province,China
Cui-Ping Zhan,Department of Ultrasound,China-Japan Union Hospital of Jilin University,Changchun 130000,Jilin Province,China
Abstract BACKGROUND Gastric carcinoma(GC)is one of the most aggressive primary digestive cancers.It has unsatisfactory therapeutic outcomes and is difficult to diagnose early.AIM To identify prognostic biomarkers for GC patients using comprehensive bioinformatics analyses.METHODS Differentially expressed genes(DEGs)were screened using gene expression data from The Cancer Genome Atlas and Gene Expression Omnibus databases for GC.Overlapping DEGs were analyzed using univariate and multivariate Cox regression analyses.A risk score model was then constructed and its prognostic value was validated utilizing an independent Gene Expression Omnibus dataset(GSE15459).Multiple databases were used to analyze each gene in the risk score model.High-risk score-associated pathways and therapeutic small molecule drugs were analyzed and predicted,respectively.RESULTS A total of 95 overlapping DEGs were found and a nine-gene signature(COL8A1,CTHRC1,COL5A2,AADAC,MAMDC2,SERPINE1,MAOA,COL1A2,and FNDC1)was constructed for the GC prognosis prediction.Receiver operating characteristic curve performance in the training dataset(The Cancer Genome Atlas-stomach adenocarcinoma)and validation dataset(GSE15459)demonstrated a robust prognostic value of the risk score model.Multiple database analyses for each gene provided evidence to further understand the nine-gene signature.Gene set enrichment analysis showed that the high-risk group was enriched in multiple cancer-related pathways.Moreover,several new small molecule drugs for potential treatment of GC were identified.CONCLUSION The nine-gene signature-derived risk score allows to predict GC prognosis and might prove useful for guiding therapeutic strategies for GC patients.
Key Words:Gastric carcinoma;Bioinformatic analysis;Prognosis;Overall survival;Differentially expressed genes
Gastric carcinoma(GC)is a lethal digestive malignant tumor that ranks as the fifth most commonly occurring cancer and the third cause of cancer-related death worldwide.In 2018,the global incidence and mortality of GC were estimated at 1033000 and 783000,respectively[1].Despite advances in various therapeutic strategies,the 5-year survival rate for GC is still less than 30% and 70% of patients with GC are usually diagnosed at an advanced stage[2,3].Therefore,it is necessary to search for a multiple-gene signature-derived model for predicting prognosis and accurately identifying anti-cancer targeted therapies to improve the prognostic stratification and personalized therapy for GC patients.
With the popularization and advancement of high throughput sequencing technologies,there is a real possibility of establishing multiple-gene signatures based on data integration and bioinformatics analysis in cancer research.The Gene Expression Omnibus(GEO)database and The Cancer Genome Atlas(TCGA)project provide invaluable resources for researchers worldwide to query gene expression and other functional genomics data[4,5].For example,Zhaoet al[6]constructed a five-gene signature based on data from TCGA databases that accurately predicted GC prognosis.Similarly,an 11-microRNA signature-derived risk score module was demonstrated to effectively predict prognosis in GCviaa comprehensive genome-wide profiling analysis[7].Therefore,it is necessary to identify genes that are significantly correlated with progression in GC patients and to further establish robust multiple-gene signatures,which could provide early diagnosis and optimized therapy for GC patients.
In the current study,GC gene expression data from TCGA and GEO datasets were first evaluated using a comprehensive bioinformatics analysis that filtered out overlapping differentially expressed genes(DEGs).Multivariate Cox regression was applied to construct a nine-gene signature based on these identified DEGs to estimate prognosis and therapeutic outcomes in GC.The high-risk group was verified to be associated with tumor-associated pathways based on the nine-gene signature derived risk score model,which also identified promising small molecule drugs.
The two independent GC microarray datasets GSE54129(containing 111 GC and 21 non-cancerous samples)and GSE26899(including 96 GC and 12 non-cancerous samples)were obtained from the GEO database and normalized using the robust multi-array average method[8].Gene sequencing data and corresponding clinical information containing 375 GC samples and 32 non-cancerous samples were extracted from the TCGA-STAD(stomach adenocarcinoma)database.Subsequently,DEGs were filtrated out from the three-gene expression datasets.A flowchart of this study is showed in Figure 1.
After standardization and log2 transformation of data from the original GEO datasets using the Affy package,the DEGs in GC were compared to normal gastric tissues and analyzed using the limma package in R software(version 3.2.1,https://www.rproject.org/)[9].The |log2FoldChange(log2FC)| ≥ 1 and adjustedPvalue <0.05 were defined as the cut-off criteria for identifying DEGs.In addition,the EdgeR package in R was used to explore DEGs for the RNA-Seq count from the TCGA database[10].Data cut-off criteria were the same as described above.The upregulated/downregulated genes in the TCGA-GC cohort,GSE54129,and GSE26899 were overlapped to identify common and robust DEGs in GC.
Gene ontology functional enrichment analysis was performed to expound potential biological processes,molecular functions,and cellular components for the common DEGs.Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis was performed to explore potential signaling pathways associated with overlapping genes,which might influence GC survival.All of the above analyses were performed by utilizing the online Database for Annotation,Visualization,and Integrated Discovery(DAVID,version 6.8,https://david.ncifcrf.gov/)[11].
To further clarify the relationship between overlapping DEGs and the overall survival(OS)in GC patients,a univariate Cox proportional-hazards regression model in the TCGA-STAD cohort was utilized.Genes with a hazard ratio(HR)<1 or >1 were considered protective or risky,respectively.Subsequently,a multivariate Cox proportional-hazards regression analysis was performed to construct multiple DEG signatures.A risk score model was established using the Formula 1.In this equation,“coefx” represents the regression coefficient of geneXand “Exprx” is the expression value of geneXin the signature.
Gene expression data in the TCGA-STAD cohort were classified into high-and lowrisk groups according to median cutoff of the risk score to evaluate the prognostic value of the risk score model.Survival differences between the two groups were compared using Kaplan-Meier(KM)survival analysis with the log-rank test.Reliability of the risk score model was assessed using the area under curve(AUC)of the receiver operating characteristic(ROC)curve.
Moreover,the reliability and prognostic value were validated using the ROC and Kaplan–Meier curves in an additional dataset GSE15459 containing 200 GC samples from the GEO database to explore whether the nine-gene signature functions as an independent prognostic factor.
The cBioPortal for the Cancer Genomics(http://www.cbioportal.org)database was utilized to verify a connection between genetic alterations and the nine genes.Then,Gene Expression Profiling Interactive Analysis(GEPIA,http://gepia.cancerpku.cn/detail.php)was utilized to explore the expression of the nine genes at the transcriptional and translational levels,respectively.Furthermore,an OS analysis of each gene in patients with GC was analyzed using the KM plotter database(http://kmplot.com/analysis/).
Figure 1 Flow diagram showing study scheme and main procedures.
Gene set enrichment analysis(GSEA)(http://software.broadinstitute.org/gsea)was used to identify the promising signaling pathways for the high-risk group based on the risk score module[12].Pvalue <0.05 and |normalized enrichment score(NES)| >0.65 were utilized to determine which functions to explore further.
The connectivity map(CMap)online database(http://www.broadinstitute.org)allows to investigate the interrelation among small molecule drugs,DNA microarray data,and diseases[13].It was used to predict promising small molecule drugs involved in the overlapping DEGs from the GEO database and TCGA project that might be useful for treatment of GC.
Kaplan–Meier curves and log-rank method were utilized to validate the statistical criteria of observed differences in OS for low-and high-risk GC patients.The univariate and multivariate Cox proportional-hazards regression analyses were performed to estimate prognostic effects of independent genes and potential multiplegene signatures.An ROC curve was used to evaluate the diagnostic performance of the nine-gene signature by calculating the AUC.All statistical analyses were performed using SPSS 20.0(SPSS Inc.,Chicago,IL,United States)and Prism 7.0(GraphPad Software Inc.,La Jolla,CA,United States)software.APvalue <0.05 was considered statistically significant.
Using the cut-off criteria,whereP<0.05 and |log2FC| >1.0,1297 upregulated genes and 1165 downregulated genes in GSE54129,331 upregulated genes and 173 downregulated genes in GSE26889,and 1034 highly expressed genes and 694 lowly expressed genes in the TCGA-STAD cohort were identified(Figure 2A).Furthermore,a total of 95 overlapping DEGs were screened out from the GEO microarray datasets and TCGA-STAD dataset,of which 59 were significantly upregulated and 36 were downregulated(Figure 2B).Hierarchical cluster heatmaps were used to explore DEG details between GC and non-cancerous tissues in each GC dataset(Figure 2C).Detailed information from the GEO datasets is shown in Supplementary Table 1.
Gene Ontology and Genes and Genomes Pathway analyses were used to further elucidate the potential biological function and promising signaling pathways of the overlapping genes in GC.The biological processes analysis indicated that the most genes were enriched during the cellular response to amino acid stimulus,cell chemotaxis,doxorubicin metabolic process,and extracellular matrix organization.The cellular components analysis showed that the genes were enriched in the extracellular space,extracellular region,and proteinaceous extracellular matrix.The molecular functions analysis indicated that the genes were enriched in the extracellular matrix structural constituent,oxidoreductase activity,and protease binding.Biological pathways were mainly enriched with chemical carcinogenesis,focal adhesion,drug metabolism-cytochrome P450,and PI3K/Akt signaling pathways(Figure 3).
To determine promising biomarkers in connection with the prognosis of patients with GC,univariate Cox regression was performed to measure 95 overlapped genes in the TCGA-STAD cohort.A total of 46 genes(P<0.05)were significantly correlated to OS in GC(Supplementary Table 2).These genes were then evaluated using multivariate Cox regression analysis.
Finally,a nine-gene signature(COL8A1,CTHRC1,COL5A2,AADAC,MAMDC2,SERPINE1,MAOA,COL1A2,andFNDC1)was constructed to assess the prognostic risk for each patient as follows:Risk score = βCOL8A1*ECOL8A1 +βCTHRC1*ECTHRC1 + βCOL5A2*ECOL5A2 + βAADAC*EAADAC +βMAMDC2*EMAMDC2 + βSERPINE1*ESERPINE1 + βMAOA*EMAOA +βCOL1A2*ECOL1A2 + ΒFNDC1*EFNDC1(Table 1),where “E” is the expression level of the genes obtained from multivariate Cox regression analysis based on the TCGASTAD dataset.
Subsequently,patients with GC were classified into high-and low-risk groups according to the median risk score of the nine-gene signature.The ROC and KM curves were used to evaluate prognostic capacity of the nine-gene signature in GC.The AUC reached 0.751,suggesting that this nine-gene signature was relatively sensitive and specific in prognostic prediction for GC patients(Figure 4A).Moreover,results of the Kaplan–Meier curve for the two collectives indicated that patients in the low-risk group had a better OS than those in the high-risk group(P<0.001;Figure 4B).Taken together,the results demonstrated that the nine-gene signature-derived risk score was significantly different for prognosis and OS between the two groups.
To validate the repeatability and robustness of the nine-gene risk signature,an independent dataset GSE15459 was used as an external validation with the same formula.Patients in the dataset GSE15459 were divided into a high-or low-risk group with the same cutoff value as the training cohort.AUC for the nine-gene signature was calculated to be 0.682,which indicated that the model had a good prognostic capability for the survival of patients with GC in the testing collective(Figure 4C).Furthermore,in accordance with the training dataset,patients in the high-risk group had a significantly shorter OS than those in the low-risk group(P= 0.011;Figure 4D).These data further showed that the nine-gene signature could predict the prognosis of patients with GC.
Genetic alterations in the nine genes were analyzed by exploring 375 GC samples in the cBioPortal database.The results indicated that 158(44%)samples had genetic alterations in the nine genes.Sequence mutations for each gene are shown in Figure 5A.Furthermore,expression levels for the nine genes were significantly different(COL8A1,CTHRC1,COL5A2,SERPINE1,COL1A2,andFNDC1were upregulated andAADAC,MAOA,andMAMDC2were downregulated)in GC tumor tissues compared to non-cancerous tissues based on the GEPIA database(Figure 5B).KM plotter was used to study the prognostic performance of each gene in GC.The results identified that highCOL8A1,CTHRC1,COL5A2,SERPINE1,COL1A2,MAMDC2,andFNDC1expression was related to a worse prognosis,while highMAOA,andAADACexpression was related to a better prognosis in GC patients(Figure 6).
Table 1 Nine prognosis-associated genes for establishing the risk score system
GSEA analysis was performed to explore potential signaling pathways associated with the high-risk group based on the nine-gene signature-derived risk score.The cut-off value was set atP<0.05 and |enrichment score(ES)| >0.65.Results showed that multiple tumor-associated pathways,such as angiogenesis,epithelial-mesenchymal transition,hedgehog signaling,Kirsten rat sarcoma viral oncogene homologue signaling,Notch signaling,and transforming growth factor(TGF)-β signaling,were enriched in the high-risk group GC patients(Figure 7).
The nine-gene signature was further analyzed in the CMap database to predict potential small molecule drugs for GC.Ten small molecule drugs were revealed using the high connectivity score andPvalue <0.05(Table 2).A total of nine small molecule candidates were negatively correlated.The 3D conformers for the top five most significant candidates are shown in Supplementary Figure 1.All findings indicated that these drugs had potential therapeutic applications in GC.
Despite considerable development in the arena of various therapeutic GC strategies,including surgery,radiotherapy,chemotherapy,and targeted precise treatment,the OS of advanced GC patients has remained poor and the therapeutic effect is often unsatisfying.The current prognostic model established based on clinical prognostic factors,such as age,TNM stage,and pathology grade,is a routine predictive model for GC.However,because of the high GC heterogeneity,a conventional prognostic model cannot accurately predict the outcomes for GC patients.Multiple-gene assays,by contrast,are of great importance for precision medicine for GC patients[14].Therefore,exploring potential molecular mechanisms and effective therapeutic targets is important for GC therapy and prevention.This study performed an integrated bioinformatics analysis to establish a nine-gene risk score model associated with prognosis and treatment response in GC patients.The high-risk group was identified to relate to tumor-associated signaling pathways based on the nine-gene signaturederived risk score and several novel small molecule drugs were discovered for potential GC treatment.
The initial step in this study was to identify the DEGs in GC using analysis of gene expression data from the TCGA and GEO datasets.These results showed that a total of 95 overlapping DEGs were identified compared to normal gastric tissue.The common genes were further evaluated using functional enrichment analyses.The results indicated that common DEGs play a crucial important role in cancerous development.Subsequently,univariate and multivariate Cox regression analyses were performed to explore the relationship between DEGs and GC survival.A total of six genes(COL8A1,CTHRC1,COL5A2,SERPINE1,COL1A2,andFNDC1)were upregulated in GC and inversely correlated with OS(β >0,HR >1),whereas three genes(AADAC,MAMDC2,andMAOA)were downregulated and positively correlated with survival(β <0,HR <1).A novel multi-gene signature-derived risk score model was constructed using these nine DEGs.A comprehensive examination of the nine-gene signature prognostic value in the training(TCGA-STAD)and testing(GSE15459)datasets was carried out.ROC curve and Kaplan–Meier analysis performance in the training and validation datasets underscored the robust prognostic value of the risk score model.
Table 2 Results of connectivity map analysis
Among these nine genes,COL5A2,COL1A2,andCOL8A1are members of the collagen family,which is the main structural component of the extracellular matrix in tumors.COL5A2encodes alpha 2 chain in type V collagen and is aberrantly expressed in ductal cancerin situand invasive ductal cancer.It promotes the progression of cancerin situto invasive cancer.Consistent with the present study,Haoet al[15]identifiedCOL5A2as a key gene in GC that serves as an oncogene associated with poor OS.Similarly,COL1A2was reported to inhibit GC cell apoptosis and promote GC cell proliferation,invasion,and migrationviathe PI3k/Akt signaling pathway[16].COL8A1was found to be upregulated and relevant to the poor clinical outcomes in multiple carcinomas,such as colon adenocarcinoma and bladder cancer[17,18].Its regulatory mechanism in GC remains unclear.CTHRC1 is a major glycosylated protein that has been demonstrated to be associated with cell proliferation,metastasis,and invasionviapromoter demethylation and TGF-β1.It is also an independent prognostic predictor[19,20].SERPINE1,also known asPAI-1,can increase GC metastasis and promote peritoneal tumor growth and formation of bloody ascites in a mouse model of GC metastasis,which serves as an important prognostic gene in GC[21,22].FDNC1is a principal component of the fibronectin structural domain that accelerates GC cell proliferation,differentiation,and metastasisviathe epithelial-mesenchymal mechanism pathway.It also plays an important role in carcinogenesis in multiple cancers[23-25].AADACis a microsomal serine esterase that mainly exists in the liver and gastrointestinal tract.Its main function is involved in drug hydrolysis,as well as triglyceride metabolism and Gilles de la Tourette syndrome[26-28].Liuet al[17]analyzed the prognostic genes in GC using bioinformatics analysis and found thatADACCis a significant tumor suppressor gene.MAOAdegrades monoamine neurotransmitters and dietary amines and was also deemed to be a tumor suppressor in liver cancer[29],pancreatic cancer[30],and cholangiocarcinoma[31]and a tumor promoter in prostate carcinoma[32],breast carcinoma[33],and non-small cell lung carcinoma[34].Its role in GC progression is poorly understood.There are few studies on the role ofMAMDC2in tumors.A meta-analysis study indicated that down-regulatedMAMDC2was related to a poor disease-free survival in breast carcinoma[35].Another study reported that miR-196a promotes head and neck squamous cell cancer migration,invasion,and adhesion to fibronectinvia MAMDC2[36].The multiple database analysis of physiological and pathological functions for each gene in GC has provided important evidence helping to understand the prognostic and predictive capacity of the ninegene signature.
Figure 2 Differentially expressed genes between gastric carcinoma and normal gastric tissues.A:Volcano plots visualizing the differentially expressed genes(DEGs)between gastric carcinoma and non-cancerous tissues in GSE54129,GSE26899,and The Cancer Genome Atlas datasets.Red dots represent significantly up-regulated genes;green dots represent significantly down-regulated genes;black dots represent non-differentially expressed genes.P <0.05 and |log2 FC| >1.0 were considered significant;B:Venn diagrams showing the upregulated overlapped DEGs(left)and downregulated overlapped DEGs(right)in three datasets;C:Heatmaps of the common genes in GSE54129,GSE26899,and The Cancer Genome Atlas datasets(top 50).The common genes include 59 upregulated genes and 36 downregulated genes.Each row represents the expression level of a gene,and each column represents a sample:Red for gastric carcinoma and blue for non-cancerous samples.
GSEA analysis was utilized in order to provide a deeper insight into the molecular mechanisms for prognosis prediction of the nine-gene signature.Multiple cancerassociated signaling pathways were highlighted as a result,including angiogenesis,epithelial-mesenchymal transition,hedgehog signaling,Kirsten rat sarcoma viral oncogene homologue signaling,Notch signaling,and TGF-β signaling,which suggested that the nine-gene signature has predictive ability for prognosis and can reveal potential therapeutic targets in GC.
Therefore,the CMap database was utilized to explore promising small molecule drugs that have effective treatment response against GC.Levomepromazine,which belongs to antihistaminic compounds,is mainly used for treating breast cancer by binding to the translationally controlled tumor protein and induction of cell differentiation[37].Lasalocid is a carboxylic ionophore antibiotic produced by Streptomyces lasaliensis that is recognized as a choice for prostate cancer therapy because it increases cytotoxic apoptosis and cytoprotective autophagy[38].Trichostatin A is a histone deacetylase inhibitor that inhibits proliferation,migration,and invasion of GC cells and shows a good therapeutic effect in GC patients[39,40].Similarly,vorinostat is a histone deacetylase inhibitor approved by the United States Food and Drug Administration for cutaneous T-cell lymphoma.It is a promising therapeutic candidate in GC when combined with chemotherapeutic agents[41].Therefore,the present study suggested that these small molecule drugs could serve as novel therapeutic strategies for the high-risk GC group with a poor prognostic response.
One study reported that six genes related to GC prognosis based on DNA microarray data of 65 patients successfully prognosticated relapse in GC patients[42].A recent study identified a three-gene signature,which could predict GC survival using DNA microarray data of 129 GC patients[43].These studies were limited due to the small sample size or lack of suitable verification datasets,which limits the possibility of clinical application of the genes related to the GC prognosis.In the present study,a novel nine-gene signature was identified by examining the gene expression profile of 582 GC patients,which had a robustly effective prognostic capacity in GC.
However,this study includes some limitations.First,because the main sources of data in this study were downloaded from public databases that are constructed using available retrospective data,it is necessary to assess the probable utilization of molecular signatures for prognosis evaluation.Second,further studies including a greater number of GC patients are needed to verify the efficiency of the nine-gene signature in GC patients.A greater number of normal samples should also be included in the differential expression analyses.Moreover,multivariate Cox regression analysis was performed to obtain the expression level of multiple genes.More clinical events will be included to verify the prognosis effect of the nine-gene signature in further studies.
In conclusion,using a series of comprehensive bioinformatics analyses and validations,a novel nine-gene signature was constructed.The signature-derived risk score model had a robust prognostic capacity and therapeutic response in GC.Several small molecule drugs were identified to serve as potential therapeutic candidates for GC using bioinformatics.Further experimental studies are necessary to validate these findings and to elucidate the mechanisms for GC-related signaling pathways.
Figure 3 Functional analysis of differentially expressed genes.Significantly enriched gene ontology biological processes of differentially expressed genes in gastric carcinoma are shown.A:Biological Process;B:Cellular Components;C:Molecular Function;and D:Significantly Enriched Kyoto Encyclopedia of Genes and Genomes Pathways of DEGs in Gastric carcinoma.KEGG:Kyoto Encyclopedia of Genes and Genomes.
Figure 4 Performance of the risk score model in the training(the Cancer Genome Atlas-stomach adenocarcinoma)and validation(GSE15459)datasets.A:Receiving operating characteristic curve of the nine-gene signature in the training dataset(area under the curve = 0.751);B:Kaplan–Meier survival curve for gastric carcinoma patients in the training dataset(P <0.001).C:Receiving operating characteristic curve of the nine-gene signature in the validation datasets(area under the curve = 0.682);D:Kaplan–Meier survival curve for gastric carcinoma patients in the validation dataset(P = 0.011).The blue curve represents low risk score group.The red curve represents high risk score group.AUC:Area under the curve.
Figure 5 Genetic alterations and expression of the nine prognostic genes in gastric carcinoma.A:Alteration proportion for the nine genes in 375 gastric carcinoma samples in the cBioPortal database;B:Gene expression levels of the nine genes between gastric carcinoma and normal gastric tissues in Gene Expression Profiling Interactive Analysis database.Red represents P <0.05.
Figure 6 Prognostic value of the nine prognostic genes in gastric carcinoma.A:COL8A1(aP <0.001);B:CTHRC1(bP <0.001),C:COL5A2(cP =0.004);D:AADAC(dP = 0.004);E:MAMDC2(eP <0.001);F:SERPINE1(fP <0.001);G:MAOA(gP = 0.003);H:COL1A2(hP <0.001);I:MAOA(iP <0.001).
Figure 7 Promising signaling pathways identified by Gene Set Enrichment Analysis.Only six of the most significant tumor-associated pathways enriched in high-risk group gastric carcinoma patients are listed.A:Hedgehog signaling;B:Kirsten rat sarcoma viral oncogene homologue signaling;C:Notch signaling;D:TGF-β signaling;E:Angiogenesis;F:Epithelial-mesenchymal transition.
With the popularization and advancement of high throughput sequencing technologies,there is a real possibility of establishing multiple-gene signatures based on data integration and bioinformatics analysis in cancer research.The present study aimed to identify prognostic biomarkers for gastric carcinoma(GC)patients using comprehensive bioinformatics analyses.
GC is one of the most aggressive primary digestive tumors.It has unsatisfactory therapeutic outcomes and is difficult to diagnose early.Therefore,it is necessary to search for a multiple-gene signature-derived model for predicting prognosis and accurately identifying anti-cancer targeted therapies to improve the prognostic stratification and personalized therapy for GC patients.
We aimed to explore the potential multiple-gene prognostic biomarkers and effective therapeutic targets for GC.In this study,we performed integrated bioinformatics analysis to establish a nine-gene risk score model(COL8A1,CTHRC1,COL5A2,SERPINE1,COL1A2,FNDC1 AADAC,MAOA,andMAMDC2)associated with prognosis and treatment response in GC patients.The nine-gene signature-derived risk score allows to predict GC prognosis and might prove useful for guiding therapeutic strategies for GC patients.
Differentially expressed genes(DEGs)were screened using gene expression data from The Cancer Genome Atlas and GEO databases for GC.Overlapping DEGs were analyzed using univariate and multivariate Cox regression analyses.A risk score model was then constructed and signature prognostic values were validated utilizing an independent GEO dataset(GSE15459).CBioPortal,GEPIA,and KM-plotter databases were used to analyze each gene in the risk score model.Gene set enrichment analysis and the connectivity map database were used to predict high-risk scoreassociated pathways and therapeutic small molecule drugs,respectively.
A total of 95 overlapping DEGs were found and a nine-gene signature(COL8A1,CTHRC1,COL5A2,AADAC,MAMDC2,SERPINE1,MAOA,COL1A2,andFNDC1)was constructed for the GC prognosis prediction.Receiver operating characteristic curve performance in the training dataset(The Cancer Genome Atlas-stomach adenocarcinoma)and validation dataset(GSE15459)demonstrated a robust prognostic value of the risk score model.Multiple database analyses for each gene provided evidence to further understand the nine-gene signature.Gene set enrichment analysis showed that the high-risk group was enriched in multiple cancer-related pathways.Moreover,several new small molecule drugs for potential treatment of GC were identified.
Using a series of comprehensive bioinformatics analyses and validations,a novel ninegene signature was constructed.The signature-derived risk score model had a robust prognostic capacity and therapeutic response in GC.Several small molecule drugs were identified to serve as potential therapeutic candidates for GC using bioinformatics.Further experimental studies are necessary to validate these findings and to elucidate the mechanisms for GC-related signaling pathways.
Multiple-gene assays are of great importance for precision medicine of GC patients.To further verify the prognostic capacity of the nine-gene signature,our future study may pay more attention to exploring the potential regulatory mechanisms how the ninegene signature affects the development of GC.
World Journal of Gastrointestinal Oncology2020年9期