Affiliations 

  • 1 Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
  • 2 Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia. Electronic address: [email protected]
Comput Biol Chem, 2023 Feb;102:107801.
PMID: 36528019 DOI: 10.1016/j.compbiolchem.2022.107801

Abstract

A high-quality reference genome is an important resource that can help decipher the genetic basis of traits in combination with linkage or association analyses. The publicly available oil palm draft genome sequence of AVROS pisifera (EG5) accounts for 1.535 Gb of the 1.8 Gb oil palm genome. However, the assemblies are fragmented, and the earlier assembly only had 43% of the sequences placed on pseudo-chromosomes. By integrating a number of SNP and SSR-based genetic maps, a consensus map (AM_EG5.1), comprising of 828.243 Mb genomic scaffolds anchored to 16 pseudo-chromosomes, was generated. This accounted for 54% of the genome assembly, which is a significant improvement to the original assembly. The total length of N50 scaffolds anchored to the pseudo-chromosomes increased by ∼18% compared to the previous assembly. A total of 139 quantitative trait loci for agronomically important quantitative traits, sourced from literature, were successfully mapped on the new pseudo-chromosomes. The improved assembly could also be used as a reference to identify potential errors in placement of specific markers in the linkage groups of the genetic maps used to assemble the consensus map. The 3422 unique markers from five genetic maps, anchored to the pseudo-chromosomes of AM_EG5.1, are an important resource that can be used preferentially to either construct new maps or fill gaps in existing genetic maps. Synteny analysis further revealed that the AM_EG5.1 had high collinearity with the date palm genome cultivar 'Barhee BC4' and shared most of its segmental duplications. This improved chromosomal-level genome is a valuable resource for genetic research in oil palm.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.