Published March 19, 2025 | Version v1
Dataset

Data and code from: The distribution and dispersal of large haploblocks in a superspecies

  • 1. University of British Columbia
  • 2. Lund University
  • 3. Chinese Academy of Sciences
  • 4. Wildlife Institute of India
  • 5. Max-Planck-Institut für Evolutionsbiologie
  • 6. Lomonosov Moscow State University
  • 7. Biozeron Shenzhen, Inc.
  • 8. University of Colombo
  • 9. Salim Ali Centre for Ornithology and Natural History
  • 10. Sichuan University
  • 11. University of Chicago

Description

Haploblocks are regions of the genome that coalesce to an ancestor as a single unit. Differentiated haplotypes in these regions can result from the accumulation of mutational differences in low-recombination chromosomal regions, especially when selective sweeps occur within geographically structured populations. We introduce a method to identify large well-differentiated haploblock regions (LHBRs), based on the variance in standardized heterozygosity (ViSHet) of single nucleotide polymorphism (SNP) genotypes among individuals, calculated across a genomic region (500 SNPs in our case). We apply this method to the greenish warbler (Phylloscopus trochiloides) ring species, using a newly assembled reference genome and genotypes at more than 1 million SNPs among 257 individuals. Most chromosomes carry a single distinctive LHBR, containing 4-6 distinct haplotypes that are associated with geography, enabling detection of hybridization events and transition zones between taxa. LHBRs have exceptionally low within-haplotype nucleotide variation and moderately low between-haplotype nucleotide distance, suggesting their establishment through recurrent selective sweeps at varying geographic scales. Meiotic drive is potentially a powerful mechanism of producing such selective sweeps, and the LHBRs are likely to often represent centromeric regions where recombination is restricted. Links between populations enable introgression of favored haplotypes and we identify one haploblock showing a highly discordant distribution compared to the rest of the genome, being present in two distantly separated geographic regions that are at similar latitudes in both east and west Asia. Our results set the stage for detailed studies of haploblocks, including their genomic location, gene content, and contribution to reproductive isolation.

Additional details

Identifiers

DOI
10.5061/dryad.8w9ghx3xr
Other
oai:uchicago.tind.io:14752

Related works

Is cited by
10.1111/mec.17731 (DOI)
Is supplemented by
10.5061/dryad.8w9ghx3xr (DOI)

Funding

Natural Sciences and Engineering Research Council of Canada
RGPIN-2017-03919
Natural Sciences and Engineering Research Council of Canada
RGPAS-2017-507830
Natural Sciences and Engineering Research Council of Canada
RGPIN-2023-04300
National Natural Science Foundation of China
32270454
Russian Science Foundation
RSF 20-14-00058-П
Second Tibetan Plateau Scientific Expedition and Research Program
STEP 2019QZKK0501
Max Planck Institute for Evolutionary Biology

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Ecology and Evolution