Affiliations 

  • 1 Department of Paediatric, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150 Kelantan, Malaysia
  • 2 Human Genome Centre, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150 Kelantan, Malaysia
  • 3 Department of Medical Microbiology and Parasitology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150 Kelantan, Malaysia
  • 4 National Center for Genetic Engineering and Biotechnology (BIOTEC), Thailand Science Park, Pathum Thani 12120, Thailand
  • 5 Department of Paediatric, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150 Kelantan, Malaysia. Electronic address: [email protected]
Forensic Sci Int Genet, 2017 09;30:152-159.
PMID: 28743033 DOI: 10.1016/j.fsigen.2017.07.005

Abstract

Malay, the main ethnic group in Peninsular Malaysia, is represented by various sub-ethnic groups such as Melayu Banjar, Melayu Bugis, Melayu Champa, Melayu Java, Melayu Kedah Melayu Kelantan, Melayu Minang and Melayu Patani. Using data retrieved from the MyHVP (Malaysian Human Variome Project) database, a total of 135 individuals from these sub-ethnic groups were profiled using the Affymetrix GeneChip Mapping Xba 50-K single nucleotide polymorphism (SNP) array to identify SNPs that were ancestry-informative markers (AIMs) for Malays of Peninsular Malaysia. Prior to selecting the AIMs, the genetic structure of Malays was explored with reference to 11 other populations obtained from the Pan-Asian SNP Consortium database using principal component analysis (PCA) and ADMIXTURE. Iterative pruning principal component analysis (ipPCA) was further used to identify sub-groups of Malays. Subsequently, we constructed an AIMs panel for Malays using the informativeness for assignment (In) of genetic markers, and the K-nearest neighbor classifier (KNN) was used to teach the classification models. A model of 250 SNPs ranked by In, correctly classified Malay individuals with an accuracy of up to 90%. The identified panel of SNPs could be utilized as a panel of AIMs to ascertain the specific ancestry of Malays, which may be useful in disease association studies, biomedical research or forensic investigation purposes.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.