The GrAfSS (Graph theoretical Applications for Substructure Searching) webserver is a platform to search for three-dimensional substructures of: (i) amino acid side chains in protein structures; and (ii) base arrangements in RNA structures. The webserver interfaces the functions of five different graph theoretical algorithms - ASSAM, SPRITE, IMAAAGINE, NASSAM and COGNAC - into a single substructure searching suite. Users will be able to identify whether a three-dimensional (3D) arrangement of interest, such as a ligand binding site or 3D motif, observed in a protein or RNA structure can be found in other structures available in the Protein Data Bank (PDB). The webserver also allows users to determine whether a protein or RNA structure of interest contains substructural arrangements that are similar to known motifs or 3D arrangements. These capabilities allow for the functional annotation of new structures that were either experimentally determined or computationally generated (such as the coordinates generated by AlphaFold2) and can provide further insights into the diversity or conservation of functional mechanisms of structures in the PDB. The computed substructural superpositions are visualized using integrated NGL viewers. The GrAfSS server is available at http://mfrlab.org/grafss/.
Assessments of genetic diversity have been claimed to be significantly efficient in utilising and managing resources of genetic for breeding programme. In this study, variations in genetic were observed in 65 pineapple accessions gathered from germplasm available at Malaysian Agriculture Research and Development Institute (MARDI) located in Pontian, Johor via 15 markers of simple sequence repeat (SSR). The results showed that 59 alleles appeared to range from 2.0 to 6.0 alleles with a mean of 3.9 alleles per locus, thus displaying polymorphism for all samples at a moderate level. Furthermore, the values of polymorphic information content (PIC) had been found to range between 0.104 (TsuAC035) and 0.697 (Acom_9.9), thus averaging at the value of 0.433. In addition, the expected and the observed heterozygosity of each locus seemed to vary within the ranges of 0.033 to 0.712, and from 0.033 to 0.885, along with the average values of 0.437 and 0.511, respectively. The population structure analysis via method of delta K (ΔK), along with mean of L (K) method, revealed that individuals from the germplasm could be divided into two major clusters based on genetics (K = 2), namely Group 1 and Group 2. As such, five accessions (Yankee, SRK Chalok, SCK Giant India, SC KEW5 India and SC1 Thailand) were clustered in Group 1, while the rest were clustered in Group 2. These outcomes were also supported by the dendrogram, which had been generated through the technique of unweighted pair group with arithmetic mean (UPGMA). These analyses appear to be helpful amongst breeders to maintain and to manage their collections of germplasm. Besides, the data gathered in this study can be useful for breeders to exploit the area of genetic diversity in estimating the level of heterosis.