Chromosome identification is essential for linking sequence and chromosomal maps, verifying sequence assemblies, showing structural variations and tracking inheritance or recombination of chromosomes and chromosomal segments during evolution and breeding programs. Unfortunately, identification of individual chromosomes and chromosome arms has been a major challenge for some economically important crop species with a near-continuous chromosome size range and similar morphology. Here, we developed oligonucleotide-based chromosome-specific probes that enabled us to establish a reference chromosome identification system for oil palm (Elaeis guineensis Jacq., 2n = 32). Massive oligonucleotide sequence pools were anchored to individual chromosome arms using dual and triple fluorescent in situ hybridization (EgOligoFISH). Three fluorescently tagged probe libraries were developed to contain, in total 52,506 gene-rich single-copy 47-mer oligonucleotides spanning each 0.2-0.5 Mb across strategically placed chromosome regions. They generated 19 distinct FISH signals and together with rDNA probes enabled identification of all 32 E. guineensis chromosome arms. The probes were able to identify individual homoeologous chromosome regions in the related Arecaceae palm species: American oil palm (Elaeis oleifera), date palm (Phoenix dactylifera) and coconut (Cocos nucifera) showing the comparative organization and concerted evolution of genomes in the Arecaceae. The oligonucleotide probes developed here provide a valuable approach to chromosome arm identification and allow tracking chromosome transfer in hybridization and breeding programs in oil palm, as well as comparative studies within Arecaceae.
MAIN CONCLUSION: Karyotyping using high-density genome-wide SNP markers identified various chromosomal aberrations in oil palm (Elaeis guineensis Jacq.) with supporting evidence from the 2C DNA content measurements (determined using FCM) and chromosome counts. Oil palm produces a quarter of the world's total vegetable oil. In line with its global importance, an initiative to sequence the oil palm genome was carried out successfully, producing huge amounts of sequence information, allowing SNP discovery. High-capacity SNP genotyping platforms have been widely used for marker-trait association studies in oil palm. Besides genotyping, a SNP array is also an attractive tool for understanding aberrations in chromosome inheritance. Exploiting this, the present study utilized chromosome-wide SNP allelic distributions to determine the ploidy composition of over 1,000 oil palms from a commercial F1 family, including 197 derived from twin-embryo seeds. Our method consisted of an inspection of the allelic intensity ratio using SNP markers. For palms with a shifted or abnormal distribution ratio, the SNP allelic frequencies were plotted along the pseudo-chromosomes. This method proved to be efficient in identifying whole genome duplication (triploids) and aneuploidy. We also detected several loss of heterozygosity regions which may indicate small chromosomal deletions and/or inheritance of identical by descent regions from both parents. The SNP analysis was validated by flow cytometry and chromosome counts. The triploids were all derived from twin-embryo seeds. This is the first report on the efficiency and reliability of SNP array data for karyotyping oil palm chromosomes, as an alternative to the conventional cytogenetic technique. Information on the ploidy composition and chromosomal structural variation can help to better understand the genetic makeup of samples and lead to a more robust interpretation of the genomic data in marker-trait association analyses.