MiniSTR loci has demonstrated to be an effective approach to recover genetic information from degraded sample, due to the improved PCR efficiency of their reduced PCR amplicon sizes. This study constructed a partial miniSGM panel and investigated the performance of four miniSTR loci, D2S1338, D16S539, D18S51 and FGA, in three ethnic populations residing in Singapore. The suitability of the miniSTR primers for Singapore populations was assessed for loci D16S539, D18S51 and FGA.
PURPOSE: To study interethnic variation in myopia prevalence and severity in young adult males in Singapore and to determine whether these variations are related to differences in education level.
METHODS: A population-based survey of refractive errors in a cohort of 15,095 military conscripts between July 1996 and June 1997 using noncycloplegic autorefraction and a standard questionnaire. Prevalence rates of myopia (
Amelogenin paralogs on Chromosome X (AMELX) and Y (AMELY) are commonly used sexing markers. Interstitial deletion of Yp involving the AMELY locus has previously been reported. The combined frequency of the AMELY null allele in Singapore and Malaysia populations is 2.7%, 0.6% in Indian and Malay ethnic groups respectively. It is absent among 541 Chinese screened. The null allele in this study belongs to 3 Y haplogroups; J2e1 (85.7%), F* (9.5%) and D* (4.8%). Low and high-resolution STS mapping, followed by sequence analysis of breakpoint junction confirmed a large deletion of 3 to 3.7-Mb located at the Yp11.2 region. Both breakpoints were located in TSPY repeat arrays, suggesting a non-allelic homologous recombination (NAHR) mechanism of deletion. All regional null samples shared identical breakpoint sequences according to their haplogroup affiliation, providing molecular evidence of a common ancestry origin for each haplogroup, and at least 3 independent deletion events recurred in history. The estimated ages based on Y-SNP and STR analysis were approximately 13.5 +/- 3.1 kyears and approximately 0.9 +/- 0.9 kyears for the J2e1 and F* mutations, respectively. A novel polymorphism G > A at Y-GATA-H4 locus in complete linkage disequilibrium with J2e1 null mutations is a more recent event. This work re-emphasizes the need to include other sexing markers for gender determination in certain regional populations. The frequency difference among global populations suggests it constitutes another structural variation locus of human chromosome Y. The breakpoint sequences provide further information to a better understanding of the NAHR mechanism and DNA rearrangements due to higher order genomic architecture.
The human amylase gene locus at chromosome 1p21.1 is structurally complex. This region contains two pancreatic amylase genes, AMY2B, AMY2A, and a salivary gene AMY1. The AMY1 gene harbors extensive copy number variation (CNV), and recent studies have implicated this variation in adaptation to starch-rich diets and in association to obesity for European and Asian populations. In this study, we showed that by combining quantitative PCR and digital PCR, coupled with careful experimental design and calibration, we can improve the resolution of genotyping CNV with high copy numbers (CNs). In two East Asian populations of Chinese and Malay ethnicity studied, we observed a unique non-normal distribution of AMY1 diploid CN genotypes with even:odd CNs ratio of 4.5 (3.3-4.7), and an association between the common AMY2A CN = 2 genotype and odd CNs of AMY1, that could be explained by the underlying haplotypic structure. In two further case-control cohorts (n = 932 and 145, for Chinese and Malays, respectively), we did not observe the previously reported association between AMY1 and obesity or body mass index. Improved methods for accurately genotyping multiallelic CNV loci and understanding the haplotype complexity at the AMY1 locus are necessary for population genetics and association studies.
Whole-genome sequencing across multiple samples in a population provides an unprecedented opportunity for comprehensively characterizing the polymorphic variants in the population. Although the 1000 Genomes Project (1KGP) has offered brief insights into the value of population-level sequencing, the low coverage has compromised the ability to confidently detect rare and low-frequency variants. In addition, the composition of populations in the 1KGP is not complete, despite the fact that the study design has been extended to more than 2,500 samples from more than 20 population groups. The Malays are one of the Austronesian groups predominantly present in Southeast Asia and Oceania, and the Singapore Sequencing Malay Project (SSMP) aims to perform deep whole-genome sequencing of 100 healthy Malays. By sequencing at a minimum of 30× coverage, we have illustrated the higher sensitivity at detecting low-frequency and rare variants and the ability to investigate the presence of hotspots of functional mutations. Compared to the low-pass sequencing in the 1KGP, the deeper coverage allows more functional variants to be identified for each person. A comparison of the fidelity of genotype imputation of Malays indicated that a population-specific reference panel, such as the SSMP, outperforms a cosmopolitan panel with larger number of individuals for common SNPs. For lower-frequency (<5%) markers, a larger number of individuals might have to be whole-genome sequenced so that the accuracy currently afforded by the 1KGP can be achieved. The SSMP data are expected to be the benchmark for evaluating the value of deep population-level sequencing versus low-pass sequencing, especially in populations that are poorly represented in population-genetics studies.