Displaying all 11 publications

Abstract:
Sort:
  1. Szpak M, Xue Y, Ayub Q, Tyler-Smith C
    FEBS Lett, 2019 07;593(13):1431-1448.
    PMID: 31116407 DOI: 10.1002/1873-3468.13447
    Classic selective sweeps occur when positive selection increases a variant's frequency from low to high in a population, and underlie some long-studied human characteristics such as variation in skin, hair or eye colour. In such well-studied 'gold standard' examples, a known variant has been associated with a plausible phenotype and underlying selective force. Signatures of classic sweeps have more recently been detected in population-genetic data independently of any prior information about the corresponding phenotype or selective force, and usually without suggesting any insights into these. Motivated by the need to understand such candidates, we first review the gold standards and show that our understanding of them is often incomplete or unconvincing; only two of the examples we consider are compellingly explained. We assess approaches for large-scale association of classic sweep candidate variants to phenotypes and selective forces, test these on the gold standards, and discuss the standards of evidence needed to adequately understand a selective sweep.
  2. Szpak M, Mezzavilla M, Ayub Q, Chen Y, Xue Y, Tyler-Smith C
    Genome Biol, 2018 Jan 17;19(1):5.
    PMID: 29343290 DOI: 10.1186/s13059-017-1380-2
    We present a new method, Fine-Mapping of Adaptive Variation (FineMAV), which combines population differentiation, derived allele frequency, and molecular functionality to prioritize positively selected candidate variants for functional follow-up. We calibrate and test FineMAV using eight experimentally validated "gold standard" positively selected variants and simulations. FineMAV has good sensitivity and a low false discovery rate. Applying FineMAV to the 1000 Genomes Project Phase 3 SNP dataset, we report many novel selected variants, including ones in TGM3 and PRSS53 associated with hair phenotypes that we validate using available independent data. FineMAV is widely applicable to sequence data from both human and other species.
  3. Shi W, Massaia A, Louzada S, Banerjee R, Hallast P, Chen Y, et al.
    Hum Genet, 2018 Jan;137(1):73-83.
    PMID: 29209947 DOI: 10.1007/s00439-017-1857-9
    We describe the variation in copy number of a ~ 10 kb region overlapping the long intergenic noncoding RNA (lincRNA) gene, TTTY22, within the IR3 inverted repeat on the short arm of the human Y chromosome, leading to individuals with 0-3 copies of this region in the general population. Variation of this CNV is common, with 266 individuals having 0 copies, 943 (including the reference sequence) having 1, 23 having 2 copies, and two having 3 copies, and was validated by breakpoint PCR, fibre-FISH, and 10× Genomics Chromium linked-read sequencing in subsets of 1234 individuals from the 1000 Genomes Project. Mapping the changes in copy number to the phylogeny of these Y chromosomes previously established by the Project identified at least 20 mutational events, and investigation of flanking paralogous sequence variants showed that the mutations involved flanking sequences in 18 of these, and could extend over > 30 kb of DNA. While either gene conversion or double crossover between misaligned sister chromatids could formally explain the 0-2 copy events, gene conversion is the more likely mechanism, and these events include the longest non-allelic gene conversion reported thus far. Chromosomes with three copies of this CNV have arisen just once in our data set via another mechanism: duplication of 420 kb that places the third copy 230 kb proximal to the existing proximal copy. Our results establish gene conversion as a previously under-appreciated mechanism of generating copy number changes in humans and reveal the exceptionally large size of the conversion events that can occur.
  4. Sirica R, Buonaiuto M, Petrella V, Sticco L, Tramontano D, Antonini D, et al.
    Sci Rep, 2019 03 19;9(1):4843.
    PMID: 30890716 DOI: 10.1038/s41598-019-40360-9
    Natural selection acts on genetic variants by increasing the frequency of alleles responsible for a cellular function that is favorable in a certain environment. In a previous genome-wide scan for positive selection in contemporary humans, we identified a signal of positive selection in European and Asians at the genetic variant rs10180970. The variant is located in the second intron of the ABCA12 gene, which is implicated in the lipid barrier formation and down-regulated by UVB radiation. We studied the signal of selection in the genomic region surrounding rs10180970 in a larger dataset that includes DNA sequences from ancient samples. We also investigated the functional consequences of gene expression of the alleles of rs10180970 and another genetic variant in its proximity in healthy volunteers exposed to similar UV radiation. We confirmed the selection signal and refine its location that extends over 35 kb and includes the first intron, the first two exons and the transcription starting site of ABCA12. We found no obvious effect of rs10180970 alleles on ABCA12 gene expression. We reconstructed the trajectory of the T allele over the last 80,000 years to discover that it was specific to H. sapiens and present in non-Africans 45,000 years ago.
  5. Szpak M, Collins SC, Li Y, Liu X, Ayub Q, Fischer MC, et al.
    Mol Biol Evol, 2021 Dec 09;38(12):5655-5663.
    PMID: 34464968 DOI: 10.1093/molbev/msab243
    A nonsense allele at rs1343879 in human MAGEE2 on chromosome X has previously been reported as a strong candidate for positive selection in East Asia. This premature stop codon causing ∼80% protein truncation is characterized by a striking geographical pattern of high population differentiation: common in Asia and the Americas (up to 84% in the 1000 Genomes Project East Asians) but rare elsewhere. Here, we generated a Magee2 mouse knockout mimicking the human loss-of-function mutation to study its functional consequences. The Magee2 null mice did not exhibit gross abnormalities apart from enlarged brain structures (13% increased total brain area, P = 0.0022) in hemizygous males. The area of the granular retrosplenial cortex responsible for memory, navigation, and spatial information processing was the most severely affected, exhibiting an enlargement of 34% (P = 3.4×10-6). The brain size in homozygous females showed the opposite trend of reduced brain size, although this did not reach statistical significance. With these insights, we performed human association analyses between brain size measurements and rs1343879 genotypes in 141 Chinese volunteers with brain MRI scans, replicating the sexual dimorphism seen in the knockout mouse model. The derived stop gain allele was significantly associated with a larger volume of gray matter in males (P = 0.00094), and smaller volumes of gray (P = 0.00021) and white (P = 0.0015) matter in females. It is unclear whether or not the observed neuroanatomical phenotypes affect behavior or cognition, but it might have been the driving force underlying the positive selection in humans.
  6. Arciero E, Kraaijenbrink T, Asan, Haber M, Mezzavilla M, Ayub Q, et al.
    Mol Biol Evol, 2018 Aug 01;35(8):1916-1933.
    PMID: 29796643 DOI: 10.1093/molbev/msy094
    We genotyped 738 individuals belonging to 49 populations from Nepal, Bhutan, North India, or Tibet at over 500,000 SNPs, and analyzed the genotypes in the context of available worldwide population data in order to investigate the demographic history of the region and the genetic adaptations to the harsh environment. The Himalayan populations resembled other South and East Asians, but in addition displayed their own specific ancestral component and showed strong population structure and genetic drift. We also found evidence for multiple admixture events involving Himalayan populations and South/East Asians between 200 and 2,000 years ago. In comparisons with available ancient genomes, the Himalayans, like other East and South Asian populations, showed similar genetic affinity to Eurasian hunter-gatherers (a 24,000-year-old Upper Palaeolithic Siberian), and the related Bronze Age Yamnaya. The high-altitude Himalayan populations all shared a specific ancestral component, suggesting that genetic adaptation to life at high altitude originated only once in this region and subsequently spread. Combining four approaches to identifying specific positively selected loci, we confirmed that the strongest signals of high-altitude adaptation were located near the Endothelial PAS domain-containing protein 1 and Egl-9 Family Hypoxia Inducible Factor 1 loci, and discovered eight additional robust signals of high-altitude adaptation, five of which have strong biological functional links to such adaptation. In conclusion, the demographic history of Himalayan populations is complex, with strong local differentiation, reflecting both genetic and cultural factors; these populations also display evidence of multiple genetic adaptations to high-altitude environments.
  7. Bergström A, McCarthy SA, Hui R, Almarri MA, Ayub Q, Danecek P, et al.
    Science, 2020 Mar 20;367(6484).
    PMID: 32193295 DOI: 10.1126/science.aay5012
    Genome sequences from diverse human groups are needed to understand the structure of genetic variation in our species and the history of, and relationships between, different populations. We present 929 high-coverage genome sequences from 54 diverse human populations, 26 of which are physically phased using linked-read sequencing. Analyses of these genomes reveal an excess of previously undocumented common genetic variation private to southern Africa, central Africa, Oceania, and the Americas, but an absence of such variants fixed between major geographical regions. We also find deep and gradual population separations within Africa, contrasting population size histories between hunter-gatherer and agriculturalist groups in the past 10,000 years, and a contrast between single Neanderthal but multiple Denisovan source populations contributing to present-day human populations.
  8. Shi W, Louzada S, Grigorova M, Massaia A, Arciero E, Kibena L, et al.
    Hum Mol Genet, 2019 Aug 15;28(16):2785-2798.
    PMID: 31108506 DOI: 10.1093/hmg/ddz101
    Human RBMY1 genes are located in four variable-sized clusters on the Y chromosome, expressed in male germ cells and possibly associated with sperm motility. We have re-investigated the mutational background and evolutionary history of the RBMY1 copy number distribution in worldwide samples and its relevance to sperm parameters in an Estonian cohort of idiopathic male factor infertility subjects. We estimated approximate RBMY1 copy numbers in 1218 1000 Genomes Project phase 3 males from sequencing read-depth, then chose 14 for valid ation by multicolour fibre-FISH. These fibre-FISH samples provided accurate calibration standards for the entire panel and led to detailed insights into population variation and mutational mechanisms. RBMY1 copy number worldwide ranged from 3 to 13 with a mode of 8. The two larger proximal clusters were the most variable, and additional duplications, deletions and inversions were detected. Placing the copy number estimates onto the published Y-SNP-based phylogeny of the same samples suggested a minimum of 562 mutational changes, translating to a mutation rate of 2.20 × 10-3 (95% CI 1.94 × 10-3 to 2.48 × 10-3) per father-to-son Y-transmission, higher than many short tandem repeat (Y-STRs), and showed no evidence for selection for increased or decreased copy number, but possible copy number stabilizing selection. An analysis of RBMY1 copy numbers among 376 infertility subjects failed to replicate a previously reported association with sperm motility and showed no significant effect on sperm count and concentration, serum follicle stimulating hormone (FSH), luteinizing hormone (LH) and testosterone levels or testicular and semen volume. These results provide the first in-depth insights into the structural rearrangements underlying RBMY1 copy number variation across diverse human lineages.
  9. Pinotti T, Bergström A, Geppert M, Bawn M, Ohasi D, Shi W, et al.
    Curr Biol, 2019 01 07;29(1):149-157.e3.
    PMID: 30581024 DOI: 10.1016/j.cub.2018.11.029
    The Americas were the last inhabitable continents to be occupied by humans, with a growing multidisciplinary consensus for entry 15-25 thousand years ago (kya) from northeast Asia via the former Beringia land bridge [1-4]. Autosomal DNA analyses have dated the separation of Native American ancestors from the Asian gene pool to 23 kya or later [5, 6] and mtDNA analyses to ∼25 kya [7], followed by isolation ("Beringian Standstill" [8, 9]) for 2.4-9 ky and then a rapid expansion throughout the Americas. Here, we present a calibrated sequence-based analysis of 222 Native American and relevant Eurasian Y chromosomes (24 new) from haplogroups Q and C [10], with four major conclusions. First, we identify three to four independent lineages as autochthonous and likely founders: the major Q-M3 and rarer Q-CTS1780 present throughout the Americas, the very rare C3-MPB373 in South America, and possibly the C3-P39/Z30536 in North America. Second, from the divergence times and Eurasian/American distribution of lineages, we estimate a Beringian Standstill duration of 2.7 ky or 4.6 ky, according to alternative models, and entry south of the ice sheet after 19.5 kya. Third, we describe the star-like expansion of Q-M848 (within Q-M3) starting at 15 kya [11] in the Americas, followed by establishment of substantial spatial structure in South America by 12 kya. Fourth, the deep branches of the Q-CTS1780 lineage present at low frequencies throughout the Americas today [12] may reflect a separate out-of-Beringia dispersal after the melting of the glaciers at the end of the Pleistocene.
  10. Pawar H, Rymbekova A, Cuadros-Espinoza S, Huang X, de Manuel M, van der Valk T, et al.
    Nat Ecol Evol, 2023 Sep;7(9):1503-1514.
    PMID: 37500909 DOI: 10.1038/s41559-023-02145-2
    Archaic admixture has had a substantial impact on human evolution with multiple events across different clades, including from extinct hominins such as Neanderthals and Denisovans into modern humans. In great apes, archaic admixture has been identified in chimpanzees and bonobos but the possibility of such events has not been explored in other species. Here, we address this question using high-coverage whole-genome sequences from all four extant gorilla subspecies, including six newly sequenced eastern gorillas from previously unsampled geographic regions. Using approximate Bayesian computation with neural networks to model the demographic history of gorillas, we find a signature of admixture from an archaic 'ghost' lineage into the common ancestor of eastern gorillas but not western gorillas. We infer that up to 3% of the genome of these individuals is introgressed from an archaic lineage that diverged more than 3 million years ago from the common ancestor of all extant gorillas. This introgression event took place before the split of mountain and eastern lowland gorillas, probably more than 40 thousand years ago and may have influenced perception of bitter taste in eastern gorillas. When comparing the introgression landscapes of gorillas, humans and bonobos, we find a consistent depletion of introgressed fragments on the X chromosome across these species. However, depletion in protein-coding content is not detectable in eastern gorillas, possibly as a consequence of stronger genetic drift in this species.
  11. Malaspinas AS, Westaway MC, Muller C, Sousa VC, Lao O, Alves I, et al.
    Nature, 2016 Oct 13;538(7624):207-214.
    PMID: 27654914 DOI: 10.1038/nature18299
    The population history of Aboriginal Australians remains largely uncharacterized. Here we generate high-coverage genomes for 83 Aboriginal Australians (speakers of Pama-Nyungan languages) and 25 Papuans from the New Guinea Highlands. We find that Papuan and Aboriginal Australian ancestors diversified 25-40 thousand years ago (kya), suggesting pre-Holocene population structure in the ancient continent of Sahul (Australia, New Guinea and Tasmania). However, all of the studied Aboriginal Australians descend from a single founding population that differentiated ~10-32 kya. We infer a population expansion in northeast Australia during the Holocene epoch (past 10,000 years) associated with limited gene flow from this region to the rest of Australia, consistent with the spread of the Pama-Nyungan languages. We estimate that Aboriginal Australians and Papuans diverged from Eurasians 51-72 kya, following a single out-of-Africa dispersal, and subsequently admixed with archaic populations. Finally, we report evidence of selection in Aboriginal Australians potentially associated with living in the desert.
Filters
Contact Us

Please provide feedback to Administrator ([email protected])

External Links