OBJECTIVE: To determine whether genetic variants in and around SPINK5 are associated with IgE-mediated food allergy.
METHOD: We genotyped 71 "tag" single nucleotide polymorphisms (tag-SNPs) within a region spanning ~263 kb including SPINK5 (~61 kb) in n=722 (n=367 food-allergic, n=199 food-sensitized-tolerant and n=156 non-food-allergic controls) 12-month-old infants (discovery sample) phenotyped for food allergy with the gold standard oral food challenge. Transepidermal water loss (TEWL) measures were collected at 12 months from a subset (n=150) of these individuals. SNPs were tested for association with food allergy using the Cochran-Mantel-Haenszel test adjusting for ancestry strata. Association analyses were replicated in an independent sample group derived from four paediatric cohorts, total n=533 (n=203 food-allergic, n=330 non-food-allergic), mean age 2.5 years, with food allergy defined by either clinical history of reactivity, 95% positive predictive value (PPV) or challenge, corrected for ancestry by principal components.
RESULTS: SPINK5 variant rs9325071 (A⟶G) was associated with challenge-proven food allergy in the discovery sample (P=.001, OR=2.95, CI=1.49-5.83). This association was further supported by replication (P=.007, OR=1.58, CI=1.13-2.20) and by meta-analysis (P=.0004, OR=1.65). Variant rs9325071 is associated with decreased SPINK5 gene expression in the skin in publicly available genotype-tissue expression data, and we generated preliminary evidence for association of this SNP with elevated TEWL also.
CONCLUSIONS: We report, for the first time, association between SPINK5 variant rs9325071 and challenge-proven IgE-mediated food allergy.
RESULTS: We analyzed the whole-genome deep sequencing data (~ 30×) of five native trios from Peninsular Malaysia and North Borneo, and characterized the genomic variants, including single nucleotide variants (SNVs), small insertions and deletions (indels) and copy number variants (CNVs). We discovered approximately 6.9 million SNVs, 1.2 million indels, and 9000 CNVs in the 15 samples, of which 2.7% SNVs, 2.3% indels and 22% CNVs were novel, implying the insufficient coverage of population diversity in existing databases. We identified a higher proportion of novel variants in the Orang Asli (OA) samples, i.e., the indigenous people from Peninsular Malaysia, than that of the North Bornean (NB) samples, likely due to more complex demographic history and long-time isolation of the OA groups. We used the pedigree information to identify de novo variants and estimated the autosomal mutation rates to be 0.81 × 10- 8 - 1.33 × 10- 8, 1.0 × 10- 9 - 2.9 × 10- 9, and ~ 0.001 per site per generation for SNVs, indels, and CNVs, respectively. The trio-genomes also allowed for haplotype phasing with high accuracy, which serves as references to the future genomic studies of OA and NB populations. In addition, high-frequency inherited CNVs specific to OA or NB were identified. One example is a 50-kb duplication in DEFA1B detected only in the Negrito trios, implying plausible effects on host defense against the exposure of diverse microbial in tropical rainforest environment of these hunter-gatherers. The CNVs shared between OA and NB groups were much fewer than those specific to each group. Nevertheless, we identified a 142-kb duplication in AMY1A in all the 15 samples, and this gene is associated with the high-starch diet. Moreover, novel insertions shared with archaic hominids were identified in our samples.
CONCLUSION: Our study presents a full catalogue of the genome variants of the native Malaysian populations, which is a complement of the genome diversity in Southeast Asians. It implies specific population history of the native inhabitants, and demonstrated the necessity of more genome sequencing efforts on the multi-ethnic native groups of Malaysia and Southeast Asia.
METHODS: We used a panel of 34 putative susceptibility genes to perform sequencing on samples from 60,466 women with breast cancer and 53,461 controls. In separate analyses for protein-truncating variants and rare missense variants in these genes, we estimated odds ratios for breast cancer overall and tumor subtypes. We evaluated missense-variant associations according to domain and classification of pathogenicity.
RESULTS: Protein-truncating variants in 5 genes (ATM, BRCA1, BRCA2, CHEK2, and PALB2) were associated with a risk of breast cancer overall with a P value of less than 0.0001. Protein-truncating variants in 4 other genes (BARD1, RAD51C, RAD51D, and TP53) were associated with a risk of breast cancer overall with a P value of less than 0.05 and a Bayesian false-discovery probability of less than 0.05. For protein-truncating variants in 19 of the remaining 25 genes, the upper limit of the 95% confidence interval of the odds ratio for breast cancer overall was less than 2.0. For protein-truncating variants in ATM and CHEK2, odds ratios were higher for estrogen receptor (ER)-positive disease than for ER-negative disease; for protein-truncating variants in BARD1, BRCA1, BRCA2, PALB2, RAD51C, and RAD51D, odds ratios were higher for ER-negative disease than for ER-positive disease. Rare missense variants (in aggregate) in ATM, CHEK2, and TP53 were associated with a risk of breast cancer overall with a P value of less than 0.001. For BRCA1, BRCA2, and TP53, missense variants (in aggregate) that would be classified as pathogenic according to standard criteria were associated with a risk of breast cancer overall, with the risk being similar to that of protein-truncating variants.
CONCLUSIONS: The results of this study define the genes that are most clinically useful for inclusion on panels for the prediction of breast cancer risk, as well as provide estimates of the risks associated with protein-truncating variants, to guide genetic counseling. (Funded by European Union Horizon 2020 programs and others.).