With the advent of high-throughput sequencing technologies, many staphylococcal genomes have been sequenced. Comparative analysis of these strains will provide better understanding of their biology, phylogeny, virulence and taxonomy, which may contribute to better management of diseases caused by staphylococcal pathogens. We developed StaphyloBase with the goal of having a one-stop genomic resource platform for the scientific community to access, retrieve, download, browse, search, visualize and analyse the staphylococcal genomic data and annotations. We anticipate this resource platform will facilitate the analysis of staphylococcal genomic data, particularly in comparative analyses. StaphyloBase currently has a collection of 754 032 protein-coding sequences (CDSs), 19 258 rRNAs and 15 965 tRNAs from 292 genomes of different staphylococcal species. Information about these features is also included, such as putative functions, subcellular localizations and gene/protein sequences. Our web implementation supports diverse query types and the exploration of CDS- and RNA-type information in detail using an AJAX-based real-time search system. JBrowse has also been incorporated to allow rapid and seamless browsing of staphylococcal genomes. The Pairwise Genome Comparison tool is designed for comparative genomic analysis, for example, to reveal the relationships between two user-defined staphylococcal genomes. A newly designed Pathogenomics Profiling Tool (PathoProT) is also included in this platform to facilitate comparative pathogenomics analysis of staphylococcal strains. In conclusion, StaphyloBase offers access to a range of staphylococcal genomic resources as well as analysis tools for comparative analyses. Database URL: http://staphylococcus.um.edu.my/.
Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/.
Listeria consists of both pathogenic and non-pathogenic species. Reports of similarities between the genomic content between some pathogenic and non-pathogenic species necessitates the investigation of these species at the genomic level to understand the evolution of virulence-associated genes. With Listeria genome data growing exponentially, comparative genomic analysis may give better insights into evolution, genetics and phylogeny of Listeria spp., leading to better management of the diseases caused by them.
Jeotgalibacillus spp. are halophilic bacteria within the family Planococcaceae. No genomes of Jeotgalibacillus spp. have been reported to date, and their metabolic pathways are unknown. How the bacteria survive in hypertonic conditions such as seawater is yet to be discovered. As only few studies have been conducted on Jeotgalibacillus spp., potential applications of these bacteria are unknown. Here, we present the complete genome of J. malaysiensis D5(T) (=DSM 28777(T) =KCTC 33350(T)), which is invaluable in identifying interesting applications for this genus.
Sulfanilic acid (4-aminobenzenesulfonic acid) is a sulfonated aromatic amine widely used in chemical industries for synthesis of various organic dyes and sulfa drugs. There are quite a few microbial co-cultures or single isolates capable of completely degrading this compound. Novosphingobium resinovorum SA1 was the first single bacterium which could utilize sulfanilic acid as its sole carbon, nitrogen and sulfur source. The strain has versatile catabolic routes for the bioconversion of numerous other aromatic compounds. Here, the complete genome sequence of the N. resinovorum SA1 strain is reported. The genome consists of a circular chromosome of 3.8 Mbp and four extrachromosomal elements between 67 and 1 759.8 kbp in size. Three alternative 3-ketoadipate pathways were identified on the plasmids. Sulfanilic acid is decomposed via a modified 3-ketoadipate pathway and the oxygenases involved form a phylogenetically separate branch on the tree. Sequence analysis of these elements might provide a genetic background for deeper insight into the versatile catabolic metabolism of various aromatic xenobiotics, including sulfanilic acid and its derivatives. Moreover, this is also a good model strain for understanding the role and evolution of multiple genetic elements within a single strain.
The type strain Planococcus donghaensis JH1Tis a psychrotolerant and halotolerant bacterium with starch-degrading ability. Here, we determine the carbon utilization profile of P. donghaensis JH1Tand report the first complete genome of the strain. This study revealed the strain's ability to utilize pectin and d-galacturonic acid, and identified genes responsible for degradation of the polysaccharides. The genomic information provided may serve as a fundamental resource for full exploration of the biotechnological potential of P. donghaensis JH1T.
Although more than 100 genome sequences of Pasteurella multocida are available, comprehensive and complete genome sequence analysis is limited. This study describes the analysis of complete genome sequence and pathogenomics of P. multocida strain PMTB2.1. The genome of PMTB2.1 has 2176 genes with more than 40 coding sequences associated with iron regulation and 140 virulence genes including the complete tad locus. The tad locus includes several previously uncharacterized genes such as flp2, rcpC and tadV genes. A transposable phage resembling to Mu phages was identified in P. multocida that has not been identified in any other serotype yet. The multi-locus sequence typing analysis assigned the PMTB2.1 genome sequence as type ST101, while the comparative genome analysis showed that PMTB2.1 is closely related to other P. multocida strains with the genomic distance of less than 0.13. The expression profiling of iron regulating-genes of PMTB2.1 was characterized under iron-limited environment. Results showed significant changes in the expression profiles of iron-regulating genes (p < 0.05) whereas the highest expression of fecE gene (281 fold) at 30 min suggests utilization of the outer-membrane proteins system in iron acquisition at an early stage of growth. This study showed the phylogenomic relatedness of P. multocida and improved annotation of important genes and functional characterization of iron-regulating genes of importance to the bacterial growth.
To date, the genus Roseivirga consists of six species with one subspecies and is one of the least-studied genera among the family Flammeovirgaceae. In order to further explore this genus, the genome sequences of five Roseivirga spp. were compared and described in this study. The Roseivirga genomes have similar sizes in the range of 4.08-4.47Mb with an average of 4.22Mb. Several key proteins related to osmotic stress adaptation were identified in Roseivirga spp. including betaine transporter, choline dehydrogenase, and glutamate synthases. Significant amount of proteins associated with amino acid transport and metabolism were also present in Roseivirga genome. All five Roseivirga spp. were able to grow in medium contained casamino acids (mixture of amino acids) as sole carbon or nitrogen sources. Taken together, these findings suggested the potential role of Roseivirga in decomposing organic nitrogen matter in marine environment.
Kocuria marina has recently emerged as a cause for catheter-related bloodstream infections in patients with underlying health complications. One K. marina strain was recently isolated from the lung tissues of a wild urban rat (Rattus rattus diardii) caught during rodent surveillance. Here, we present the draft genome of the first K. marina animal isolate, K. marina TRE150902.
Aeromonas is a pathogenic organism that is often found to infect humans. Here we report the draft genome of a clinical isolate in Malaysia, Aeromonas sp. strain 159, which shows N-acylhomoserine lactone production. In the draft genome of strain 159, luxI and luxR homologue genes were found to be located at contig 47, and these genes are believed to be important for the quorum-sensing system present in this pathogen.
Acinetobacter sp. strain GG2 is a quorum-sensing and quorum-quenching bacterium isolated from the ginger rhizosphere. It degrades a broad range of N-acylhomoserine lactone molecules via lactonase. The genome sequence of strain GG2 may provide insights on the regulation of quorum-sensing and quorum-quenching mechanisms in this bacterium.
We report here the complete genome sequence of Salmonella enterica subsp. enterica serovar Typhi P-stx-12, a clinical isolate obtained from a typhoid carrier in India.
Here, we report of the annotated genome sequence of Mycobacterium tuberculosis MTB221/11. The organism was isolated from the cerebrospinal fluid of a patient in Malaysia.
This is a report of an annotated genome sequence of Mycobacterium tuberculosis MTBR1/09. The organism was isolated from a sputum sample from a male patient in Malaysia.
Mycobacterium spp. are renowned for being the causative agent of diseases like leprosy, Buruli ulcer and tuberculosis in human beings. With more and more mycobacterial genomes being sequenced, any knowledge generated from comparative genomic analysis would provide better insights into the biology, evolution, phylogeny and pathogenicity of this genus, thus helping in better management of diseases caused by Mycobacterium spp.With this motivation, we constructed MycoCAP, a new comparative analysis platform dedicated to the important genus Mycobacterium. This platform currently provides information of 2108 genome sequences of at least 55 Mycobacterium spp. A number of intuitive web-based tools have been integrated in MycoCAP particularly for comparative analysis including the PGC tool for comparison between two genomes, PathoProT for comparing the virulence genes among the Mycobacterium strains and the SuperClassification tool for the phylogenic classification of the Mycobacterium strains and a specialized classification system for strains of Mycobacterium abscessus. We hope the broad range of functions and easy-to-use tools provided in MycoCAP makes it an invaluable analysis platform to speed up the research discovery on mycobacteria for researchers. Database URL: http://mycobacterium.um.edu.my.
Primers corresponding to conserved bacterial repetitive of BOX elements were used to show that BOX-DNA sequences are widely distributed in phosphate solubilizing Pseudomonas strains. Phosphate solubilizing Pseudomonas was isolated from oil palm fields (tropical soil) in Malaysia. BOX elements were used to generate genomic fingerprints of a variety of Pseudomonas isolates to identify strains that were not distinguishable by other classification methods. BOX-PCR, that derived genomic fingerprints, was generated from whole purified genomic DNA by liquid culture of phosphate solubilizing Pseudomonas. BOX-PCR generated the phosphate solubilizing Pseudomonas specific fingerprints to identify the relationship between these strains. This suggests that distribution of BOX elements' sequences in phosphate solubilizing Pseudomonas strains is the mirror image of their genomic structure. Therefore, this method appears to be a rapid, simple, and reproducible method to identify and classify phosphate solubilizing Pseudomonas strains and it may be useful tool for fast identification of potential biofertilizer strains.
Social bacteria use chemical communication to coordinate and synchronize gene expression via the quorum-sensing (QS) regulatory pathway. In Pectobacterium, a causative agent of the blackleg and soft-rot diseases on potato plants and tubers, expression of the virulence factors is collectively controlled by the QS-signals N-acylhomoserine lactones (NAHLs). Several soil bacteria, such as the actinobacterium Rhodococcus erythropolis, are able to degrade NAHLs, hence quench the chemical communication and virulence of Pectobacterium. Here, next-generation sequencing was used to investigate structural and functional genomics of the NAHL-degrading R. erythropolis strain R138. The R. erythropolis R138 genome (6.7 Mbp) contained a single circular chromosome, one linear (250 kbp) and one circular (84 kbp) plasmid. Growth of R. erythropolis and P. atrosepticum was not altered in mixed-cultures as compared with monocultures on potato tuber slices. HiSeq-transcriptomics revealed that no R. erythropolis genes were differentially expressed when R. erythropolis was cultivated in the presence vs absence of the avirulent P. atrosepticum mutant expI, which is defective for QS-signal synthesis. By contrast 50 genes (<1% of the R. erythropolis genome) were differentially expressed when R. erythropolis was cultivated in the presence vs absence of the NAHL-producing virulent P. atrosepticum. Among them, quantitative real-time reverse-transcriptase-PCR confirmed that the expression of some alkyl-sulfatase genes decreased in the presence of a virulent P. atrosepticum, as well as deprivation of organic sulfur such as methionine, which is a key precursor in the synthesis of NAHL by P. atrosepticum.
The discordant prevalence of Helicobacter pylori and its related diseases, for a long time, fostered certain enigmatic situations observed in the countries of the southern world. Variation in H. pylori infection rates and disease outcomes among different populations in multi-ethnic Malaysia provides a unique opportunity to understand dynamics of host-pathogen interaction and genome evolution. In this study, we extensively analyzed and compared genomes of 27 Malaysian H. pylori isolates and identified three major phylogeographic lineages: hspEastAsia, hpEurope and hpSouthIndia. The analysis of the virulence genes within the core genome, however, revealed a comparable pathogenic potential of the strains. In addition, we identified four genes limited to strains of East-Asian lineage. Our analyses identified a few strain-specific genes encoding restriction modification systems and outlined 311 core genes possibly under differential evolutionary constraints, among the strains representing different ethnic groups. The cagA and vacA genes also showed variations in accordance with the host genetic background of the strains. Moreover, restriction modification genes were found to be significantly enriched in East-Asian strains. An understanding of these variations in the genome content would provide significant insights into various adaptive and host modulation strategies harnessed by H. pylori to effectively persist in a host-specific manner.
Mycobacterium iranicum is a newly reported mycobacterial species. We present the first comparative study of M. iranicum UM_TJL and other mycobacteria. We found M. iranicum to have a close genetic association with environmental mycobacteria infrequently associated with human infections. Nonetheless, UM_TJL is also equipped with many virulence genes (some of which appear to be the consequence of transduction-related gene transfer) that have been identified in established human pathogens. Taken all together, our data suggest that M. iranicum is an environmental bacterium adapted for pathogenicity in the human host. This comparative study provides important clues and forms the basis for future functional studies on this mycobacterium.