METHODS: We conducted a gene-environment interaction (GxE) analysis including 8,255 cases and 11,900 controls from four pancreatic cancer genome-wide association study (GWAS) datasets (Pancreatic Cancer Cohort Consortium I-III and Pancreatic Cancer Case Control Consortium). Obesity (body mass index ≥30 kg/m2) and diabetes (duration ≥3 years) were the environmental variables of interest. Approximately 870,000 SNPs (minor allele frequency ≥0.005, genotyped in at least one dataset) were analyzed. Case-control (CC), case-only (CO), and joint-effect test methods were used for SNP-level GxE analysis. As a complementary approach, gene-based GxE analysis was also performed. Age, sex, study site, and principal components accounting for population substructure were included as covariates. Meta-analysis was applied to combine individual GWAS summary statistics.
RESULTS: No genome-wide significant interactions (departures from a log-additive odds model) with diabetes or obesity were detected at the SNP level by the CC or CO approaches. The joint-effect test detected numerous genome-wide significant GxE signals in the GWAS main effects top hit regions, but the significance diminished after adjusting for the GWAS top hits. In the gene-based analysis, a significant interaction of diabetes with variants in the FAM63A (family with sequence similarity 63 member A) gene (significance threshold P < 1.25 × 10-6) was observed in the meta-analysis (P GxE = 1.2 ×10-6, P Joint = 4.2 ×10-7).
CONCLUSIONS: This analysis did not find significant GxE interactions at the SNP level but found one significant interaction with diabetes at the gene level. A larger sample size might unveil additional genetic factors via GxE scans.
IMPACT: This study may contribute to discovering the mechanism of diabetes-associated pancreatic cancer.
METHODS: We conducted a large agnostic pathway-based meta-analysis of GWAS data using the summary-based adaptive rank truncated product method to identify gene sets and pathways associated with pancreatic ductal adenocarcinoma (PDAC) in 9040 cases and 12 496 controls. We performed expression quantitative trait loci (eQTL) analysis and functional annotation of the top SNPs in genes contributing to the top associated pathways and gene sets. All statistical tests were two-sided.
RESULTS: We identified 14 pathways and gene sets associated with PDAC at a false discovery rate of less than 0.05. After Bonferroni correction (P ≤ 1.3 × 10-5), the strongest associations were detected in five pathways and gene sets, including maturity-onset diabetes of the young, regulation of beta-cell development, role of epidermal growth factor (EGF) receptor transactivation by G protein-coupled receptors in cardiac hypertrophy pathways, and the Nikolsky breast cancer chr17q11-q21 amplicon and Pujana ATM Pearson correlation coefficient (PCC) network gene sets. We identified and validated rs876493 and three correlating SNPs (PGAP3) and rs3124737 (CASP7) from the Pujana ATM PCC gene set as eQTLs in two normal derived pancreas tissue datasets.
CONCLUSION: Our agnostic pathway and gene set analysis integrated with functional annotation and eQTL analysis provides insight into genes and pathways that may be biologically relevant for risk of PDAC, including those not previously identified.
METHODS: To discover novel pancreatic cancer risk loci and possible causal genes, we performed a pancreatic cancer transcriptome-wide association study in Europeans using three approaches: FUSION, MetaXcan, and Summary-MulTiXcan. We integrated genome-wide association studies summary statistics from 9040 pancreatic cancer cases and 12 496 controls, with gene expression prediction models built using transcriptome data from histologically normal pancreatic tissue samples (NCI Laboratory of Translational Genomics [n = 95] and Genotype-Tissue Expression v7 [n = 174] datasets) and data from 48 different tissues (Genotype-Tissue Expression v7, n = 74-421 samples).
RESULTS: We identified 25 genes whose genetically predicted expression was statistically significantly associated with pancreatic cancer risk (false discovery rate < .05), including 14 candidate genes at 11 novel loci (1p36.12: CELA3B; 9q31.1: SMC2, SMC2-AS1; 10q23.31: RP11-80H5.9; 12q13.13: SMUG1; 14q32.33: BTBD6; 15q23: HEXA; 15q26.1: RCCD1; 17q12: PNMT, CDK12, PGAP3; 17q22: SUPT4H1; 18q11.22: RP11-888D10.3; and 19p13.11: PGPEP1) and 11 at six known risk loci (5p15.33: TERT, CLPTM1L, ZDHHC11B; 7p14.1: INHBA; 9q34.2: ABO; 13q12.2: PDX1; 13q22.1: KLF5; and 16q23.1: WDR59, CFDP1, BCAR1, TMEM170A). The association for 12 of these genes (CELA3B, SMC2, and PNMT at novel risk loci and TERT, CLPTM1L, INHBA, ABO, PDX1, KLF5, WDR59, CFDP1, and BCAR1 at known loci) remained statistically significant after Bonferroni correction.
CONCLUSIONS: By integrating gene expression and genotype data, we identified novel pancreatic cancer risk loci and candidate functional genes that warrant further investigation.
METHODS: In total, DNA samples were obtained from 14,525 case subjects with invasive EOC and from 23,447 controls from 43 sites in the Ovarian Cancer Association Consortium (OCAC). Two hundred seventy nine SNPs, representing 131 genes, were genotyped using an Illumina Infinium iSelect BeadChip as part of the Collaborative Oncological Gene-environment Study (COGS). SNP analyses were conducted using unconditional logistic regression under a log-additive model, and the FDR q<0.2 was applied to adjust for multiple comparisons.
RESULTS: The most significant evidence of an association for all invasive cancers combined and for the serous subtype was observed for SNP rs17216603 in the iron transporter gene HEPH (invasive: OR = 0.85, P = 0.00026; serous: OR = 0.81, P = 0.00020); this SNP was also associated with the borderline/low malignant potential (LMP) tumors (P = 0.021). Other genes significantly associated with EOC histological subtypes (p<0.05) included the UGT1A (endometrioid), SLC25A45 (mucinous), SLC39A11 (low malignant potential), and SERPINA7 (clear cell carcinoma). In addition, 1785 SNPs in six genes (HEPH, MGST1, SERPINA, SLC25A45, SLC39A11 and UGT1A) were imputed from the 1000 Genomes Project and examined for association with INV EOC in white-European subjects. The most significant imputed SNP was rs117729793 in SLC39A11 (per allele, OR = 2.55, 95% CI = 1.5-4.35, p = 5.66x10-4).
CONCLUSION: These results, generated on a large cohort of women, revealed associations between inherited cellular transport gene variants and risk of EOC histologic subtypes.
METHODS: In a population of 15,596 epithelial ovarian cancer (EOC) cases and 23,236 controls, we measured genetic associations of 1,351 SNPs in Treg cell pathway genes with odds of ovarian cancer and tested pathway and gene-level associations, overall and by histotype, for the 25 genes, using the admixture likelihood (AML) method. The most significant single SNP associations were tested for correlation with expression levels in 44 ovarian cancer patients.
RESULTS: The most significant global associations for all genes in the pathway were seen in endometrioid ( p = 0.082) and clear cell ( p = 0.083), with the most significant gene level association seen with TGFBR2 ( p = 0.001) and clear cell EOC. Gene associations with histotypes at p < 0.05 included: IL12 ( p = 0.005 and p = 0.008, serous and high-grade serous, respectively), IL8RA ( p = 0.035, endometrioid and mucinous), LGALS1 ( p = 0.03, mucinous), STAT5B ( p = 0.022, clear cell), TGFBR1 ( p = 0.021 endometrioid) and TGFBR2 ( p = 0.017 and p = 0.025, endometrioid and mucinous, respectively).
CONCLUSIONS: Common inherited gene variation in Treg cell pathways shows some evidence of germline genetic contribution to odds of EOC that varies by histologic subtype and may be associated with mRNA expression of immune-complex receptor in EOC patients.
METHODS: We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls).
RESULTS: Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network.
CONCLUSION: We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development.
IMPACT: Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization.