RESULTS: SatA comprises c. 14.5% of the P. armeniacum genome and is specific to subgenus Parvisepalum. It is composed of four primary monomers that range from 230 to 359 bp and contains multiple inverted repeat regions with hairpin-loop motifs. A new karyotype of P. vietnamense (2n = 28) is presented and shows that the chromosome number in subgenus Parvisepalum is not conserved at 2n = 26, as previously reported. The physical locations of SatA sequences were visualised on the chromosomes of all seven Paphiopedilum species of subgenus Parvisepalum (2n = 26-28), together with the 5S and 45S rDNA loci using FISH. The SatA repeats were predominantly localisedin the centromeric, peri-centromeric and sub-telocentric chromosome regions, but the exact distribution pattern was species-specific.
CONCLUSIONS: We conclude that the newly discovered, highly abundant and rapidly evolving satellite sequence SatA is specific to Paphiopedilum subgenus Parvisepalum. SatA and rDNA chromosomal distributions are characteristic of species, and comparisons between species reveal that the distribution patterns generate a strong phylogenetic signal. We also conclude that the ancestral chromosome number of subgenus Parvisepalum and indeed of all Paphiopedilum could be either 2n = 26 or 28, if P. vietnamense is sister to all species in the subgenus as suggested by the ITS data.
RESULTS: In line with this, we have generated two small RNAs libraries from samples with contrasting lignin content using Illumina GAII sequencer. About 10 million sequence reads were obtained in secondary xylem of Am48 with high lignin content (41%) and a corresponding 14 million sequence reads were obtained in secondary xylem of Am54 with low lignin content (21%). Our results suggested that A. mangium small RNAs are composed of a set of 12 highly conserved miRNAs families found in plant miRNAs database, 82 novel miRNAs and a large proportion of non-conserved small RNAs with low expression levels. The predicted target genes of those differentially expressed conserved and non-conserved miRNAs include transcription factors associated with regulation of the lignin biosynthetic pathway genes. Some of these small RNAs play an important role in epigenetic silencing. Differential expression of the small RNAs between secondary xylem tissues with contrasting lignin content suggests that a cascade of miRNAs play an interconnected role in regulating the lignin biosynthetic pathway in Acacia species.
CONCLUSIONS: Our study critically demonstrated the roles of small RNAs during secondary wall formation. Comparison of the expression pattern of small RNAs between secondary xylem tissues with contrasting lignin content strongly indicated that small RNAs play a key regulatory role during lignin biosynthesis. Our analyses suggest an evolutionary mechanism for miRNA targets on the basis of the length of their 5' and 3' UTRs and their cellular roles. The results obtained can be used to better understand the roles of small RNAs during lignin biosynthesis and for the development of gene constructs for silencing of specific genes involved in monolignol biosynthesis with minimal effect on plant fitness and viability. For the first time, small RNAs were proven to play an important regulatory role during lignin biosynthesis in A. mangium.
RESULTS: To investigate the genomic properties and taxonomic status of these strains, we employed both 16S rRNA Sanger sequencing and whole-genome sequencing using the Illumina HiSeq X Ten platform with PE151 (paired-end) sequencing. Our analyses revealed that the draft genome of Actinomyces acetigenes ATCC 49340 T was 3.27 Mbp with a 68.0% GC content, and Actinomyces stomatis ATCC 51655 T has a genome size of 3.08 Mbp with a 68.1% GC content. Multi-locus (atpA, rpoB, pgi, metG, gltA, gyrA, and core genome SNPs) sequence analysis supported the phylogenetic placement of strains ATCC 51655 T and ATCC 49340 T as independent lineages. Digital DNA-DNA hybridization (dDDH), average nucleotide identity (ANI), and average amino acid identity (AAI) analyses indicated that both strains represented novel Actinomyces species, with values below the threshold for species demarcation (70% dDDH, 95% ANI and AAI). Pangenome analysis identified 5,731 gene clusters with strains ATCC 49340 T and ATCC 51655 T possessing 1,515 and 1,518 unique gene clusters, respectively. Additionally, genomic islands (GIs) prediction uncovered 24 putative GIs in strain ATCC 49340 T and 16 in strain ATCC 51655 T, contributing to their genetic diversity and potential adaptive capabilities. Pathogenicity analysis highlighted the potential human pathogenicity risk associated with both strains, with several virulence-associated factors identified. CRISPR-Cas analysis exposed the presence of CRISPR and Cas genes in both strains, indicating these strains might evolve a robust defense mechanism against them.
CONCLUSION: This study supports the classification of strains ATCC 49340 T and ATCC 51655 T as novel species within the Actinomyces, in which the name Actinomyces acetigenes sp. nov. (type strain ATCC 49340 T = VPI D163E-3 T = CCUG 34286 T = CCUG 35339 T) and Actinomyces stomatis sp. nov. (type strain ATCC 51655 T = PK606T = CCUG 33930 T) are proposed.
RESULTS: A set of sequences retrieved from IBD virus-infected chickens that did not map to the chicken reference genome were de novo assembled, clustered and analysed. From six inbred chicken lines, we managed to assemble 10,828 uni-transcripts and screened 618 uni-transcripts which were the most significant sequences to known genes, as determined by BLASTX searches. Based on the differentially expressed genes (DEGs) analysis, 12 commonly upregulated and 18 downregulated uni-genes present in all six inbred lines were identified with false discovery rate of q-value
RESULTS: In the genomic analysis, 33 homozygous and 1377 heterozygous mutations in the coding sequences of the genome of MT strain were detected. Among these heterozygous mutations, the proportion of mutated reads in each gene was different, ranging from 21 to 75%. These results suggest that the MT strain may contain multiple nuclei containing different mutations. We tried to isolate haploid spores from the MT strain to prove its ploidy, but this strain did not sporulate under the conditions tested. Heterozygous mutations detected in genes which are important for sporulation likely contribute to the sporulation deficiency of the MT strain. Homozygous and heterozygous mutations were found in genes encoding enzymes involved in amino acid metabolism, the TCA cycle, purine and pyrimidine nucleotide metabolism and the DNA mismatch repair system. One homozygous mutation in AgILV2 gene encoding acetohydroxyacid synthase, which is also a flavoprotein in mitochondria, was found. Gene ontology (GO) enrichment analysis showed heterozygous mutations in all 22 DNA helicase genes and genes involved in oxidation-reduction process.
CONCLUSION: This study suggests that oxidative stress and the aging of cells were involved in the riboflavin over-production in A. gossypii riboflavin over-producing mutant and provides new insights into riboflavin production in A. gossypii and the usefulness of disparity mutagenesis for the creation of new types of mutants for metabolic engineering.
RESULTS: The gene expression profile of SUB in the adult sheep was not affected by the pre- or early postnatal nutrition history. In PER, 993 and 186 differentially expressed genes (DEGs) were identified in LOW versus HIGH and NORM, respectively, but no DEG was found between HIGH and NORM. DEGs identified in the mismatched pre- and postnatal nutrition groups LOW-HCHF (101) and HIGH-HCHF (192) were largely downregulated compared to NORM-CONV. Out of 831 DEGs, 595 and 236 were up- and downregulated in HCHF versus CONV, respectively. The functional enrichment analyses revealed that transmembrane (ion) transport activities, motor activities related to cytoskeletal and spermatozoa function (microtubules and the cytoskeletal motor protein, dynein), and responsiveness to the (micro) environmental extracellular conditions, including endocrine and nervous stimuli were enriched in the DEGs of LOW versus HIGH and NORM. We confirmed that mismatched pre- and postnatal feeding was associated with long-term programming of adipose tissue remodeling and immunity-related pathways. In agreement with phenotypic measurements, early postnatal HCHF feeding targeted pathways involved in kidney cell differentiation, and mismatched LOW-HCHF sheep had specific impairments in cholesterol metabolism pathways.
CONCLUSIONS: Both pre- and postnatal malnutrition differentially programmed (patho-) physiological pathways with implications for adipose functional development associated with metabolic dysfunctions, and PER was a major target.
RESULTS: The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases.Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82.
CONCLUSIONS: A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci.
RESULTS: The chloroplast genome of Gracilaria firma maps as a circular molecule of 187,001 bp and contains 252 genes, which are distributed on both strands and consist of 35 RNA genes (3 rRNAs, 30 tRNAs, tmRNA and a ribonuclease P RNA component) and 217 protein-coding genes, including the unidentified open reading frames. The chloroplast genome of G. firma is by far the largest reported for Gracilariaceae, featuring a unique intergenic region of about 7000 bp with discontinuous vestiges of red algal plasmid DNA sequences interspersed between the nblA and cpeB genes. This chloroplast genome shows similar gene content and order to other Florideophycean taxa. Phylogenomic analyses based on the concatenated amino acid sequences of 146 protein-coding genes confirmed the monophyly of the classes Bangiophyceae and Florideophyceae with full nodal support. Relationships within the subclass Rhodymeniophycidae in Florideophyceae received moderate to strong nodal support, and the monotypic family of Gracilariales were resolved with maximum support.
CONCLUSIONS: Chloroplast genomes hold substantial information that can be tapped for resolving the phylogenetic relationships of difficult regions in the Rhodymeniophycidae, which are perceived to have experienced rapid radiation and thus received low nodal support, as exemplified in this study. The present study shows that chloroplast genome of G. firma could serve as a key link to the full resolution of Gracilaria sensu lato complex and recognition of Hydropuntia as a genus distinct from Gracilaria sensu stricto.
RESULTS: Having confirmed via histology, haematology and clinical biochemistry analyses that OPP is not toxic to mice, we further explored the gene expression changes caused by OPP through statistical and functional analyses using Illumina microarrays. OPP showed numerous biological activities in three major organs of mice, the liver, spleen and heart. In livers of mice given OPP, four lipid catabolism genes were up-regulated while five cholesterol biosynthesis genes were down-regulated, suggesting that OPP may play a role in reducing cardiovascular disease. OPP also up-regulated eighteen blood coagulation genes in spleens of mice. OPP elicited gene expression changes similar to the effects of caloric restriction in the hearts of mice supplemented with OPP. Microarray gene expression fold changes for six target genes in the three major organs tested were validated with real-time quantitative reverse transcription-polymerase chain reaction (qRT-PCR), and the correlation of fold changes obtained with these two techniques was high (R2 = 0.9653).
CONCLUSIONS: OPP showed non-toxicity and various pleiotropic effects in mice. This study implies the potential application of OPP as a valuable source of wellness nutraceuticals, and further suggests the molecular mechanisms as to how dietary phenolics work in vivo.
METHODS: We introduce a new node representation method based on initial information fusion, called FFANE, which amalgamates PPI networks and protein sequence data to enhance the precision of PPIs' prediction. A Gaussian kernel similarity matrix is initially established by leveraging protein structural resemblances. Concurrently, protein sequence similarities are gauged using the Levenshtein distance, enabling the capture of diverse protein attributes. Subsequently, to construct an initial information matrix, these two feature matrices are merged by employing weighted fusion to achieve an organic amalgamation of structural and sequence details. To gain a more profound understanding of the amalgamated features, a Stacked Autoencoder (SAE) is employed for encoding learning, thereby yielding more representative feature representations. Ultimately, classification models are trained to predict PPIs by using the well-learned fusion feature.
RESULTS: When employing 5-fold cross-validation experiments on SVM, our proposed method achieved average accuracies of 94.28%, 97.69%, and 84.05% in terms of Saccharomyces cerevisiae, Homo sapiens, and Helicobacter pylori datasets, respectively.
CONCLUSION: Experimental findings across various authentic datasets validate the efficacy and superiority of this fusion feature representation approach, underscoring its potential value in bioinformatics.
RESULTS: Restriction-site associated DNA sequencing (RAD-seq) was employed to isolate sex-specific SNP markers for S. paramamosain. A total of 335.6 million raw reads were obtained from 20 individuals, of which 204.7 million were from 10 females and 130.9 million from 10 males. After sequence assembly and female-male comparison, 20 SNP markers were identified to be sex-specific. Furthermore, ten SNPs in a short sequence (285 bp) were confirmed heterozygous in females and homozygous in males in a large population by PCR amplification and sequencing. Subsequently, a female-specific primer was successfully designed according to the female-specific nucleotide which could amplify an expected band from females but not from males. Thus, a rapid and effective method for molecular sexing in S. paramamosain was developed, meanwhile, this method could successfully identify the sex of S. tranquebarica and S. serrata. Finally, nine and four female-specific SNP markers were detected in S. tranquebarica and S. serrata, respectively.
CONCLUSIONS: Sex-specific SNP markers were firstly identified in crab species and showed female heterogamety and male homogamety, which provided strong genetic evidence for a WZ/ZZ sex determination system in mud crabs S. paramamosain, S. tranquebarica and S. serrata. These findings will lay a solid foundation for the study of sex determination mechanism, sex chromosome evolution, and the development of mono-sex population in crustaceans.
RESULTS: We propose a succinct representation of the distance matrices which tremendously reduces the space requirement. We give a complete solution, called SuperRec, for the inference of chromosomal structures from Hi-C data, through iterative solving the large-scale weighted multidimensional scaling problem.
CONCLUSIONS: SuperRec runs faster than earlier systems without compromising on result accuracy. The SuperRec package can be obtained from http://www.cs.cityu.edu.hk/~shuaicli/SuperRec .
RESULTS: Using a combination of short (10X Genomics) and long read (PacBio HiFi, PacBio CLR) sequencing and a genetic map for the GIFT strain, we generated a chromosome level genome assembly for the GIFT. Using genomes of two closely related species (O. mossambicus, O. aureus), we characterised the extent of introgression between these species and O. niloticus that has occurred during the breeding process. Over 11 Mb of O. mossambicus genomic material could be identified within the GIFT genome, including genes associated with immunity but also with traits of interest such as growth rate.
CONCLUSION: Because of the breeding history of elite strains, current reference genomes might not be the most suitable to support further studies into the GIFT strain. We generated a chromosome level assembly of the GIFT strain, characterising its mixed origins, and the potential contributions of introgressed regions to selected traits.