RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time.
CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species.
METHODS: Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes.
RESULTS: From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups.
CONCLUSIONS: A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection still occur.
RESULTS: Restriction-site associated DNA sequencing (RAD-seq) was employed to isolate sex-specific SNP markers for S. paramamosain. A total of 335.6 million raw reads were obtained from 20 individuals, of which 204.7 million were from 10 females and 130.9 million from 10 males. After sequence assembly and female-male comparison, 20 SNP markers were identified to be sex-specific. Furthermore, ten SNPs in a short sequence (285 bp) were confirmed heterozygous in females and homozygous in males in a large population by PCR amplification and sequencing. Subsequently, a female-specific primer was successfully designed according to the female-specific nucleotide which could amplify an expected band from females but not from males. Thus, a rapid and effective method for molecular sexing in S. paramamosain was developed, meanwhile, this method could successfully identify the sex of S. tranquebarica and S. serrata. Finally, nine and four female-specific SNP markers were detected in S. tranquebarica and S. serrata, respectively.
CONCLUSIONS: Sex-specific SNP markers were firstly identified in crab species and showed female heterogamety and male homogamety, which provided strong genetic evidence for a WZ/ZZ sex determination system in mud crabs S. paramamosain, S. tranquebarica and S. serrata. These findings will lay a solid foundation for the study of sex determination mechanism, sex chromosome evolution, and the development of mono-sex population in crustaceans.