- Research article
- Open Access
Inferring the demographic history of European Ficedula flycatcher populations
BMC Evolutionary Biology volume 13, Article number: 2 (2013)
Inference of population and species histories and population stratification using genetic data is important for discriminating between different speciation scenarios and for correct interpretation of genome scans for signs of adaptive evolution and trait association. Here we use data from 24 intronic loci re-sequenced in population samples of two closely related species, the pied flycatcher and the collared flycatcher.
We applied Isolation-Migration models, assignment analyses and estimated the genetic differentiation and diversity between species and between populations within species. The data indicate a divergence time between the species of <1 million years, significantly shorter than previous estimates using mtDNA, point to a scenario with unidirectional gene-flow from the pied flycatcher into the collared flycatcher and imply that barriers to hybridisation are still permeable in a recently established hybrid zone. Furthermore, we detect significant population stratification, predominantly between the Spanish population and other pied flycatcher populations.
Our results provide further evidence for a divergence process where different genomic regions may be at different stages of speciation. We also conclude that forthcoming analyses of genotype-phenotype relations in these ecological model species should be designed to take population stratification into account.
Using genetic data to infer the demographic history of a species or a population is of importance for several reasons. For instance, a central goal in evolutionary genetics is to understand which forces have contributed to the observed patterns of genetic variation in natural populations. Of particular interest is the identification of genomic regions that evolve under pressure of natural selection and characterization of functional elements underlying fitness traits . Since demographic events in the history of populations govern the distribution of alleles on a genome-wide scale, the design, analytical efficiency and interpretation of downstream population genetic analyses or genome scans to discover such regions can be enhanced if the population history is known in some detail [2–4]. For example, association analyses may be severely biased if there is population structure or recent admixture in the set of sampled individuals . Moreover, demographic inference based on genetic data supplements morphological records in the quest towards understanding the natural history of organisms on deeper time scales [eg.  and can aid in discriminating between different scenarios of population differentiation and speciation, for instance, between speciation models including or excluding post-divergence gene-flow [7, 8]. Until recently, demographic history and the factors governing genetic diversity were generally studied using limited data sets. However, since the variance in genetic diversity at a single or a few loci is unlikely to reflect the overall genomic patterns, assessment of the proportional contribution of drift, selection and demography in shaping genetic variability and population differentiation should ideally be based on multi-locus datasets [7, 9, 10]. A recently developed and powerful way of disentangling between alternative demographic hypotheses is the application of isolation migration model theory via a maximum likelihood analysis framework [eg.  of coalescence based models of population history . This type of analysis can be time consuming and computer intense and are still not suitable for data on a genome scale but can be useful for multi-locus re-sequencing data sets with a moderate number of loci.
The pied flycatcher (Ficedula hypoleuca) and the collared flycatcher (F. albicollis) are small, migratory, passerine birds (family Muscicapidae) which occur over large parts of the western Palearctic (Figure 1). Current distribution ranges likely reflect expansions from Pleistocene glacial refuges on the Iberian (pied flycatcher) and the Apennine (collared flycatcher) peninsulas . Areas of sympatric occurrence are present both in central and eastern Europe and on the Baltic Sea islands Gotland and Öland (Figure 1), and hybridization occurs at a low rate within these zones [14–16]. The hybrid zone on the islands in the Baltic Sea might have been formed as recently as 150 years (Gotland) to 50 years (Öland) ago when the collared flycatcher started colonizing the islands previously occupied by the pied flycatcher only [17, 18]. The species system has been subject to thorough studies of speciation and hybridization, and the emerging consents include the presence of powerful intrinsic post-zygotic isolation (female hybrids are thought to be completely sterile), and potential reinforcement of pre-copulatory isolation, despite limited ecological differentiation [19, 20]. Moreover, a suite of genetic mapping studies [21–25], transcriptome characterization [e.g.  and on-going efforts to sequence the flycatcher genome point towards that the species system is underway of becoming a genetic/genomic model and hold promise for downstream unravelling of important genotype-phenotype relationships. However, the knowledge about the demographic history is still sparse and likely insufficient to allow for robust interpretations of results from genome-wide selection scans or association efforts in these species.
We set out to use a multi-locus re-sequencing approach to obtain better understanding of the population history of the collared flycatcher and the pied flycatcher in Europe. Initially we focused on between-species divergence by estimating effective population sizes, species divergence time and by assessing potential post-divergence gene-flow. We followed up by investigating intra-specific patterns of genetic diversity and differentiation, our specific aims being to i) assess the level of genetic diversity in different parts of the distribution range of each species, ii) identify potential population structure within species, and iii) use allele frequency data to look for signs of demographic events in the history of the species.
Sampling and DNA extraction
Blood samples were available from previous studies [13, 28, 29] and taken during breeding season from between 11 and 40 birds at four different locations within each species distribution range, respectively (Figure 1). Pied flycatchers were sampled in central Spain (Madrid, 40° 24’ 0” N / 3° 41’ 0” W, n=15), northern Germany (Lingen, 52° 31’ 0” N / 7° 19’ 0” E, n=16), central Sweden (Uppsala, 59° 51’ 0” N / 17° 38’ 0” E, n=22) and in southeastern Sweden (Öland, 56° 40’ 0” N / 16° 22’ 0” E, n=11). Collared flycatchers were sampled in central Italy (Abruzzo, 42° 28’ 0” N / 14° 13’ 0” E, n=19), northern Hungary (Budapest, 47° 30’ 0” N / 19° 5’ 0’ E, n=35), sothern Czech Republic (Břeclav, 48° 46’ 0’ N / 16° 53’ 0” E, n=16) and in southeastern Sweden (Öland, 52° 31’ 0” N / 7° 19’ 0” E, n=40). DNA was extracted with either the DNEasy DNA extraction kit (QIAGEN), or with a standard proteinase K digestion, phenol-chloroform purification procedure .
We selected 24 autosomal, gene-based (intronic) markers from a set previously developed for amplification in a wide range of avian taxa . The markers were chosen on basis of their amplification success in Ficedula flycatchers and to represent genes widely distributed over the genome, including both macro- and microchromosomes (Table 1). PCR amplifications were set up according to Backström et al., the general recipe was a 20 μl reaction with 50–100 ng of template DNA with 50 μM of each dNTP, 2.5 mM MgCl2, 0.5 pmol of each primer, and 0.025 U of AmpliTaq or AmpliTaq Gold (Applied Biosciences). All PCRs were run on a Tetrad PTC-100 thermal cycler (MJ Research) with the general temperature profile; initial denaturation 5 min at 95°C, 20 cycles with 30 s denaturation at 95°C, 30 s annealing starting off at 65°C and decreasing the annealing temperature with 0.5°C per cycle and 1 min elongation at 72°C, 20 cycles with the same temperature profile but with fixed annealing temperature at 55°C, and finally finishing off with an extra elongation step for 5 minutes at 72°C. PCR products were purified with Exo-SAP IT (USB Corp.) according to recommendations from the manufacturer. Purified PCR products were prepared for sequencing by running a 30 cycle sequencing reaction using ≈100 ng product together with 3 pmol of either forward or reverse primer, 0.875 μl BDX64, 0.125 μl BigDye3.1, 1.5 μl 5X dilution buffer and 10 μl ddH2O (Applied Biosystems). The temperature profile was initiated by a denaturation step for 3 min at 96°C followed by 30 cycles including 10 s at 96°C, 5 s at 50°C and 2 min at 60°C as suggested by the manufacturer. Sequencing reactions were purified using the XTerminator system according to manufacturer’s protocol and sequences were run on an ABI3730xl DNA Analyzer (Applied Biosystems). Likely as a consequence of length polymorphisms in some loci in some populations, all loci could not be sequenced to full length in all individuals. The number of chromosomes sequenced for a specific population and a specific locus is given in Table 1.
Sequences were edited using Sequencher 4.7 (Gene Codes Corp.) and trimmed to include only non-coding parts (introns). Locus specific alignments were created using Clustal W  as implemented in MEGA 5.0 . We selected 10 individuals (20 chromosomes) with the highest sequence coverage as averaged over all 24 loci from each of the four populations from each species, respectively (40 birds in total per species), for analysis.
We applied a six-parameter isolation-migration model [IMa;  to the data using selections of individuals that represented two allopatric (separate analyses including Spanish pied flycatchers compared to Italian collared flycatchers and Hungarian collared flycatchers, respectively) and one sympatric (Baltic Sea islands pied flycatchers and Baltic Sea islands collared flycatchers) population pair as well as to a dataset including all 10 individuals from all four populations lumped together for each species, respectively. Since the isolation-migration model assumes no intra-locus recombination we inferred gametic phase and analysed all four datasets for signs of recombination using PHASE and the four-gamete test as implemented in DnaSP , and cut the alignments so that the longest stretch of sequence without evidence for recombination was kept for analysis. IMa simulates genealogies under different demographic scenarios using a Markov chain Monte Carlo approach and provides the estimates of six parameter values (q1 = population mutation rate for population 1, q2 = population mutation rate for population 2, qA = population mutation rate for the ancestral population, τ = time since divergence scaled by the mutation rate, m1 = migration rate from population 2 to population 1 scaled by the mutation rate and m2 = migration rate from population 1 to population 2 scaled by the mutation rate) that best fit the data. All datasets were analysed in an initial M-mode run sampling every 100 genealogies to a sum of 5*105 genealogies after a burn-in of 1 million steps with prior ranges as follows: q1 = 0–25 q2 = 0–25, qA = 0–25, τ = 0–25, m1 = 0–15, m2 = 0–15. After inspecting the posterior distributions for the parameters a second, equally long, M-mode run was performed with narrower prior intervals and a different random seed number. The prior ranges for the second run were generally q1 = 0–1 q2 = 0–1, qA = 0–1, τ = 0–5, m1 = 0–10, m2 = 0–10. ESS values varied substantially between runs, the second analysis of allopatric populations (Spanish pied flycatchers and Hungarian collared flycatchers) had lower ESS values than the other analyses, but there was good agreement in HiPt values and posterior distribution ranges between independent runs. Subsequent analysis and the interpretations were therefore restricted to the runs with narrower priors only. All parameter estimates were scaled by a mutation rate of 1.4*10-9 and a generation time of one year. We evaluated different demographic models by comparing relevant nested models (L-mode) to the full six-parameter model and assessed the significance by likelihood ratio tests implemented in the software .
Population genetic analyses including estimates of nucleotide diversity (π), Tajima’s D, and inter-population genetic differentiation (F ST ) within and between species were calculated in DnaSP version 5 . All single nucleotide polymorphisms (SNPs, 272 in collared flycatcher and 193 in pied flycatcher) were used to assess intra-specific population structure using both a model-based clustering algorithm based on allele sharing among populations [STRUCTURE v2.3; [36, 37], and a principal component analysis (PCA) based population stratification tool in the software package eigensoft [SMARTPCA; . STRUCTURE version 2.3  was run with default settings, using the admixture model and inferring α, for 400,000 steps after a burn-in period of 100,000 steps for each species, respectively. For both species, 10 independent analyses with different random seeds were run for each value of K from K = 1 to K = 4. The optimal number of populations (K) was assessed using the method suggested in the STRUCTURE 2.3 documentation (http://pritch.bsd.uchicago.edu/software/documentation.pdf). Graphical displays of individual assignment coefficients were created using Distruct version 1.1 . Nexus files exported from MEGA 5.0  including intra-specific polymorphisms (see numbers above) were transformed to eigenstrat format using an in-house developed python script (Nexus2smartpca_v2.py, Charles Chapus personal communication). SMARTPCA  was run with default settings for each of the species separately as well as for both species combined, and the three most informative principal components were selected for the plots.
We collected up to 13,164 bp of sequence data from 24 autosomal loci (Table 1) for a total of 64 pied flycatchers and 110 collared flycatchers sampled from four different locations throughout their respective breeding ranges (Figure 1, Table 1). From each population, 10 individuals with the highest yield of sequence data were selected for analysis.
The isolation-migration analyses showed relatively consistent results over different datasets including allopatric populations, sympatric populations and all populations from each species combined. Distributions of divergence time estimates from all pair-wise comparisons steadily fell in the range of a few hundred thousand to one million years and all analyses showed a biased migration rate with higher estimated gene-flow from the pied flycatcher into the collared flycatcher (eg. 0.22*10-6 gene-1 generation-1 for the analysis of all populations combined) than vice versa (0.0016*10-6 gene-1 generation-1). The rate of gene-flow from the pied flycatcher into the collared flycatcher was also considerably higher in the comparison of sympatric populations (1.70*10-6 gene-1 generation-1) than in the comparisons of allopatric populations (0.31 - 0.42*10-6 gene-1 generation-1). The analysis of samples from all populations showed that the estimated effective population size was higher for the collared flycatcher (Ne ≈ 450,000 – 750,000) than for the pied flycatcher (Ne ≈ 150,000 – 400,000) and for the ancestral population (Ne ≈ 20,000 – 400,000). Posterior probability distributions for all parameter estimates and datasets are presented in Figure 2 and HiPt (Maximum likelihood) and 90% highest posterior density boundaries in Table 2.
Since the analysis of allopatric Spanish pied flycatchers and Hungarian collared flycatchers showed low ESS values (Effective Sample Size, an indication about the number of independent estimates that have been generated for each parameter) we focused on the other three datasets for the interpretation of nested models. These analyses supported the findings from the isolation-migration analysis with only minor differences among data sets (Additional file 1 Supplementary Information, bracketed parameters are collared flycatcher Ne, pied flycatcher Ne, ancestral Ne, migration rate, migration rate from pied to collared flycatcher and migration rate from collared to pied flycatcher, e.g. the full model being (ABCDE)). In general, a model assuming equal effective population sizes between the pied flycatcher and the collared flycatcher (AABXX) or between the collared flycatcher and the ancestral flycatcher (ABAXX) was significantly rejected, especially if the gene-flow was assumed to be equal in both directions (AABDD, ABADD) or completely absent (AAB00, ABA00). A model assuming equal effective population sizes between the pied flycatcher and the ancestral flycatcher (ABBXX) was also rejected if the rate of post-divergence gene-flow was forced to be equal in both directions (ABBDD), but not otherwise (ABBDE). The models excluding gene-flow in both directions (XXX00) were the least supported nested models of all, irrespective of if they allowed for differences in effective population size or not. Specifically, the full model, allowing for unequal population sizes and gene-flow in both directions (ABCDE) was not significantly better than a model with no gene-flow from the collared flycatcher to the pied flycatcher (ABCD0) and could be rejected in all datasets. However, the corresponding model without gene-flow from the pied flycatcher to the collared flycatcher (ABC0D) was marginally, but significantly, rejected in both sympatry (p-value = 0.04) and in the comparison of all populations combined (p-value = 0.02).
Polymorphisms and allele frequency distributions
The overall genetic diversity (π) was lower in the pied flycatcher (mean 0.0043 ± S.D. 0.0024) than in the collared flycatcher (0.0051 ± 0.0028), but the difference was not significant (Wilcoxon’s test, W = 227.5, p-value = 0.22). There were only small differences in genetic diversity between populations within species, the range of diversity estimates was between 0.0044 - 0.0049 for all populations with the exception of the Spanish pied flycatcher population which had lower diversity (0.0036; Table 3). Tajima’s D was significantly higher (Wilcoxon’s test, W = 148, p-value = 0.006) in the pied flycatcher (0.18 ± 0.56) than in the collared flycatcher (−0.32 ± 0.69) but there was no significant difference between populations within species (Table 3).
The average F ST between species was 0.31 ± 0.22. Within the pied flycatcher, there was significant differentiation between population pairs including the Spanish population with F ST = 0.13 (p-value < 0.01) in the comparison to the Baltic Sea islands and the Scandinavian mainland populations and F ST = 0.09 (p-value < 0.01) in the comparison to the German population. The level of differentiation between all other population pairs was limited (F ST < 0.01, p-value >0.05; Table 3). There was no significant differentiation between any collared flycatcher population pairs (F ST in the range of 0 – 0.05, p-values > 0.05; Table 4).
The analysis of assignment of individuals to specific populations with STRUCTURE  and SMARTPCA  did not indicate any considerable population stratification in either of the two species (Figure 3). Likelihood values for different number of clusters did not vary significantly and we interpreted the most likely number of clusters to be one in both the pied and the collared flycatcher. The PCA analysis also indicated limited population differentiation in both the pied flycatcher and the collared flycatcher. However, the resolution of the PCA allowed for more detailed visual inspection and a few observations are worth bringing up. First, in agreement with the intra-specific differentiation analyses using FST-values, the Spanish pied flycatcher population grouped outside the range of the other pied flycatcher populations, in particular along the axis of principal component 2 and 3 (Figure 4A). Second, in the collared flycatcher the most differentiated population was the population from the Baltic Sea islands (Figure 4B). Both these patterns could also be observed when all individuals from both species were analyzed together (Figure 4C).
We re-sequenced 24 autosomal loci in population samples of pied flycatchers and collared flycatchers collected throughout their respective European breeding distribution ranges and used the data to infer parameters in the history of the species and to evaluate potential population stratification within species. The isolation-migration model fitting generated relatively consistent estimates for the parameter values over all data sets. The ancestral population size was generally estimated to be smaller than the current population sizes or similar to the current population size of the pied flycatcher. In agreement with the diversity estimates, the effective population size of the pied flycatcher was smaller than the effective population size of the collared flycatcher in all comparisons. Although the exact numbers for the effective population size estimates from the isolation-migration analysis should be treated with care since they are heavily dependent on the assumed generation time and mutation rate, it is of some interest to compare the ratio of census to effective population size estimates for the different species. The estimated census population size for the pied flycatcher, 5,250,000 in Europe plus >3,000,000 in Russia , is approximately one order of magnitude higher than the estimated effective population size. In the collared flycatcher, the estimated census population size is only about 10% of the pied flycatcher estimate 340,000-762,000,  and very similar to the estimated effective population size. In line with indications of a recent population expansion , this suggests a stronger bottleneck and a more dramatic population growth from the bottlenecked population in the pied flycatcher (ie. smaller refugial populations but significant recent population growth), during re-colonization of Northern Europe subsequent to the retrieval of the ice cover from the latest Pleistocene glaciations. Previous work has shown that the pied flycatcher is more opportunistic in choice of habitat than the collared flycatcher. For example, it can breed successfully also in relatively poor habitats [17, 41], and pied flycatcher hatchlings are less vulnerable to periods of low food abundance than collared flycatcher hatchlings . Hence, the pied flycatcher is likely able to re-colonize new areas more rapidly than the collared flycatcher when an ice cover retracts after a glaciation. This ability can potentially explain the current, more widespread and more northern distribution of the pied flycatchers as compared to the collared flycatcher.
There was also regularity in the estimated divergence time among datasets, all results pointed towards numbers around 0.5 million years. Previous estimates, using mitochondrial data and a fixed clock, implied a divergence time of 1–2 million years between the species (approximately 3% mtDNA divergence, ). The inconsistency between estimates could be caused by several factors [cf. . First, the estimate from the isolation-migration model is dependent on the assumed generation time and mutation rates; a halving of the mutation rate would double the divergence time estimate. Second, divergence times based on strict molecular clocks have proven to be subject to biases from rate heterogeneity both among lineages and among genomic regions and the rate of mtDNA divergence is variable among different taxa . Finally, the estimates could reflect a true difference in divergence between mtDNA and nuclear genes. During a speciation process different genomic regions may diverge at varying rates. Simulation analyses show that a higher degree of mtDNA differentiation is expected in organisms with female biased dispersal (eg. birds in general) due to smaller drift effects of introgressed alleles in regions with high intra-specific gene-flow . The rate of differentiation in a region can also be dependent on the proximity (degree of genetic linkage) to selected alleles [46–48], differences in relative fitness of male and female hybrids , or the effective population size of the locus . It is also known that female hybrids are sterile while male hybrids can produce viable offspring , albeit with severely reduced fitness . This fitness difference has potentially contributed to reduce the gene-flow on mitochondria compared to autosomes . Moreover, as mtDNA is clonally and strictly maternally inherited, recessive mutations are not masked by dominance, similar to the case for sex-chromosomes in hemizygous state [cf. Haldane’s rule; . Consequently, there is scope for stronger selection against incompatible combinations of alleles and potential for more rapid build-up of barriers to gene-flow on mitochondria than on autosomes. If so, it is possible that recurrent secondary contact events following re-colonizations during warmer interglacial epochs have resulted in less introgression on the mitochondria than on the autosomes, similar to what has recently been observed on the Z-chromosome in several avian taxa [20, 53–57]. This would result in relatively larger proportion of shared polymorphisms and, hence, a shallower divergence estimate for the autosomes. Another possible explanation for a deeper mtDNA divergence estimate is historical (but post initial divergence) introgression of mtDNA into any of our focal species from more distant relatives, ie. Atlas flycatcher (Ficedula speculigera) or semi-collared flycatcher (F. semitorquata) . However, this scenario is less plausible since mtDNA from all abovementioned species are more or less equally differentiated postulating that past introgression has to have occurred from several extinct lineages into at least three of the extant species. Our data does not provide the power to disentangle the factors underlying the discrepancy between divergence time estimates based on mtDNA and genomic DNA. However, since many of the causes are not mutually exclusive, it is not unreasonable that a combination of factors have acted in concert, or during different time periods, to finally result in the observed pattern.
The strongest evidence for post-divergence gene-flow in our data was from the recently formed hybrid zone on the Baltic Sea islands. This result is in consistency with preceding studies [28, 29] and further supports the idea that hybridization barriers are not yet complete in this young hybrid zone . The data also support a directional bias with significantly more, or even completely unidirectional, gene-flow from the pied flycatcher to the collared flycatcher. This is in agreement with genotype information in backcrossing pied and collared flycatcher families  as well as re-sequencing data  and, given that hybrid females are sterile  this implies that the exclusive option for successful back-crossing will be hybrid males mating with collared flycatcher females.
Contrary to expectations from a substantial founder effect during re-colonization of new areas after expansion from glacial refuges, there was little variation in nucleotide diversity among populations within species. For the pied flycatcher the trend was actually opposite expectations, the Spanish population had the lowest diversity and the northern and thus more recently founded populations showed on average slightly higher diversity. This could reflect a smaller effective population size in Spain resulting from a partially fragmented population and, as implied by previous microsatellite data  and by our differentiation analyses, also probably from restricted gene-flow to other pied flycatcher populations. The Pyrenees may act as a dispersal barrier and the pied flycatcher has a disjunct distribution pattern in southwestern Europe indicating that there are limited opportunities for short-range diffusion. Overall, this suggests that newly established populations inhabiting novel habitats after glacial retraction were founded by a large number of immigrants. Alternatively, subsequent gene-flow between newly established and refugial populations may have been extensive enough to maintain similar diversity levels among populations.
All pied flycatcher populations showed slightly positive Tajima’s D values while the collared flycatcher populations showed on average slightly negative Tajima’s D values in derived populations but a slightly positive value in the supposedly ancestral Italian population. These observations stand in some contrast to previous findings in both the pied flycatcher and the collared flycatcher for a large set of Z-chromosome linked genes [average Tajima’s D = −0.52 and 0.14, respectively; 24] and to another dataset including both autosomal (D = −0.31 and 0.32) and Z-chromosome data D = 1.12 and −0.04; . Admittedly, these mixed trends make interpretation difficult and speculative. It is known that historical events like bottlenecks followed by population expansions, can have varying effects on the allele frequency distribution dependent on the effective population size of the chromosomal region . Since both species may have had fluctuating population sizes during relatively recent history, a discrepancy between autosomal and Z-chromosome linked loci is not entirely startling. However, the considerable variation among loci within chromosomes and the substantial difference between estimates among studies [24, 28] suggest that the results spring from a complicated combination of random effects, demographic history and selective pressures unique to specific populations, species or genomic regions. This implies that particular caution should be taken before drawing any conclusions regarding general population history from these data.
The F ST estimates varied only slightly among population comparisons between species and averaged at 0.31. This is lower than a previous estimate using a large set of Z-chromosome linked loci [0.36; , as expected given the lower effective population size [7, 59], and reduced introgression on the Z-chromosome compared to the autosomes . The latter effect could possibly be a consequence of the importance of Z-linked loci in species recognition and female mate choice . Within species, the level of differentiation was generally low with the exception of pair-wise comparisons including the Spanish pied flycatcher population. This agrees with the analysis of genetic diversity and the principal component analyses, and implies that there is some differentiation among current European populations of the pied flycatcher that inhabit appreciably geographically separated areas.
Recent data from large-scale genotyping assays have revealed a potential to identify genetic structure on a very detailed scale . For example, derived contemporary human populations, which actually harbor relatively limited genetic variability compared to eg. the species of flycatchers in our study [22, 62], can be assigned to geographic sampling site with very high precision . This indicates that the possibility to detect fine-scale population structure depends mainly on the number of markers used. It has also proven true that genome-wide scans for genotype-phenotype associations are highly sensitive to population structure , and that a large proportion of associations might be explained by rare genetic variants and alleles private to restricted populations . Methods for handling stratification have been suggested and they apparently perform well except under some circumstances when the loss of power is substantial and when the number of markers is low . Consequently, understanding the underlying population stratification is crucial to correctly infer the proportion and effect size of specific alleles on phenotypes of interest and to accurately transfer information from population to population. We utilized a limited number of markers, 24 loci with a few hundred SNPs in total, and detected significant population stratification in the pied flycatcher and observed some stratification also in the collared flycatcher, although between population differentiation, as measured by F ST , was insignificant in this species. Extensive genomic tools are currently under development in both the pied flycatcher and the collared flycatcher. Our data indicates that forthcoming analytical efforts using these tools should be designed so that within species stratification is taken into account using e.g. Genomic Control , or performed with individuals sampled within an unstructured subpopulation.
We sequenced 24 autosomal intronic loci in population samples of the pied flycatcher and the collared flycatcher, two model species for ecology and behavior. The data indicate substantial differences in effective and census population size for the pied flycatcher, population stratification within species and shorter divergence time for autosomes than previously reported for mtDNA. We also observed unidirectional post-divergence gene-flow between the species. Our findings support a scenario where different portions of the genome can be at different stages of speciation and they provide important information about population stratification that will be useful for forthcoming analyses of the link between genotypes and phenotypes in these ecological model species.
Availability of supporting data
All sequences have been deposited in the Dryad database under doi:http://dx.doi.org10.5061/dryad.dh53m.
Ellegren H, Sheldon BC: Genetic basis of fitness differences in natural populations. Nature. 2008, 452: 169-175. 10.1038/nature06737.
Nielsen R, Hellmann I, Hubisz M, Bustamante C, Clark AG: Recent and ongoing selection in the human genome. Nat Rev Genet. 2007, 8: 857-868.
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, et al: Finding the missing heritability of complex diseases. Nature. 2009, 461: 747-753. 10.1038/nature08494.
Thornton KR, Jensen JD, Becquet C, Andolfatto P: Progress and prospects in mapping recent selection in the genome. Heredity. 2007, 98: 340-348.
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P: Association mapping in structured populations. Am J Hum Genet. 2000, 67: 170-181. 10.1086/302959.
Goebel T, Waters MR, O’Rourke DH: The late Pleistocene dispersal of modern humans in the Americas. Science. 2008, 319: 1497-1502. 10.1126/science.1153569.
Wakeley J, Hey J: Estimating ancestral population parameters. Genetics. 1997, 145: 847-855.
Gaggiotti OE: Making inferences about speciation using sophisticated statistical genetics methods - look before you leap. Mol Ecol. 2011, 20: 2229-2232. 10.1111/j.1365-294X.2011.05104.x.
Ross-Ibarra J, Wright SI, Foxe JP, Kawabe A, DeRose-Wilson L, Gos G, Charlesworth D, Gaut BS: Patterns of polymorphism and demographic history in natural populations of Arabidopsis lyrata. PLoS One. 2008, 3: e2411-10.1371/journal.pone.0002411.
Zhang D-X, Hewitt GM: Nuclear DNA analyses in genetic studies of populations: practice, problems and prospects. Mol Ecol. 2003, 12: 563-584. 10.1046/j.1365-294X.2003.01773.x.
Hey J, Nielsen R: Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics. 2004, 167: 747-760. 10.1534/genetics.103.024182.
Wakeley J: Coalescent theory. An introduction. 2008, Greenwood Village, CO: Roberts & Co
Sætre G-P, Borge T, Lindell J, Moum T, Primmer CR, Sheldon BC, Haavie J, Johnsen A, Ellegren H: Speciation, introgressive hybridization, and nonlinear rate of molecular evolution in flycatchers. Mol Ecol. 2001, 10: 737-749.
Alatalo RV, Gustafsson L, Lundberg A: Hybridization and breeding success of collared and pied flycatchers on the island of Gotland. Auk. 1982, 99: 285-291.
Sætre GP, Král K, Bures S, Ims RA: Dynamics of a clinal hybrid zone and a comparison with island hybrid zones of flycatchers (Ficedula hypoleuca and F. albicollis). J Zool. 1999, 247: 53-64.
Veen T, Borge T, Griffith SC, Saetre GP, Bures S, Gustafsson L, Sheldon BC: Hybridization and adaptive mate choice in flycatchers. Nature. 2001, 411: 45-50. 10.1038/35075000.
Lundberg A, Alatalo RV: The pied flycatcher. 1992, London, UK: Poyser
Qvarnström A, Wiley C, Svedin N, Vallin N: Life-history divergence facilitates regional coexistence of competing Ficedula flycatchers. Ecology. 2009, 90: 1948-1957. 10.1890/08-0494.1.
Qvarnström A, Rice AM, Ellegren H: Speciation in Ficedula flycatchers. Phil Trans Roy Soc B. 2010, 365: 1841-1852. 10.1098/rstb.2009.0306.
Sætre GP, Sæther SA: Ecology and genetics of speciation in Ficedula flycatchers. Mol Ecol. 2010, 19: 1091-1106. 10.1111/j.1365-294X.2010.04568.x.
Backström N, Brandström M, Gustafsson L, Qvarnström A, Cheng H, Ellegren H: Genetic mapping in a natural population of collared flycatchers (Ficedula albicollis): conserved synteny but gene order rearrangements on the avian Z chromosome. Genetics. 2006, 174: 377-386. 10.1534/genetics.106.058917.
Backström N, Fagerberg S, Ellegren H: Genomics of natural bird populations: a gene-based set of reference markers evenly spread across the avian genome. Mol Ecol. 2008, 17: 964-980.
Backström N, Karaiskou N, Leder EH, Gustafsson L, Primmer CR, Qvarnström A, Ellegren H: A gene-based genetic linkage map of the collared flycatcher (Ficedula albicollis) reveals extensive synteny and gene-order conservation during 100 million years of avian evolution. Genetics. 2008, 179: 1479-1495. 10.1534/genetics.108.088195.
Backström N, Lindell J, Zhang Y, Palkopoulou E, Qvarnström A, Sætre GP, Ellegren H: A high-density scan of the Z-chromosome in Ficedula flycatchers reveals candidate loci for diversifying selection. Evolution. 2010, 64: 3461-3475. 10.1111/j.1558-5646.2010.01082.x.
Backström N, Palkopoulou E, Qvarnström A, Ellegren H: No evidence for Z-chromosome rearrangements between the pied flycatcher and the collared flycatcher as judged by gene-based comparative genetic maps. Mol Ecol. 2010, 19: 3394-3405. 10.1111/j.1365-294X.2010.04742.x.
Künstner A, Wolf JBW, Backström N, Whitney O, Balakrishnan CN, Day L, Edwards SV, Janes DE, Schlinger BA, Wilson RK, et al: Comparative genomics based on massive parallel transcriptome sequencing reveals patterns of substitution and selection across 11 bird species. Mol Ecol. 2010, 19: 266-276.
Cramp S, Perrins CM: The birds of the western Palearctic. 1993, Oxford: Oxford University Press
Borge T, Lindroos K, Nadvornik P, Syvanen AC, Saetre GP: Amount of introgression in flycatcher hybrid zones reflects regional differences in pre and post-zygotic barriers to gene exchange. J Evol Biol. 2005, 18: 1416-1424. 10.1111/j.1420-9101.2005.00964.x.
Hogner S, Saether SA, Borge T, Bruvik T, Johnsen A, Saetre GP: Increased divergence but reduced variation on the Z chromosome relative to autosomes in Ficedula flycatchers: differential introgression or the faster-Z effect?. Ecol Evol. 2012, 2: 379-396. 10.1002/ece3.92.
Sambrook J, Fritsch EF, Maniatis T: Molecular Cloning: A Laboratory Manual. 1989, Cold Spring Harbor New York: Cold Spring Harbor Laboratory Press, 2nd
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucl Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analuysis using maximum likelihood, evolutionary distances, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Hey J, Nielsen R: Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc Natl Acad Sci USA. 2007, 104: 2785-2790. 10.1073/pnas.0611164104.
Librado P, Rozas J: DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25: 1451-1452. 10.1093/bioinformatics/btp187.
Ellegren H: Molecular evolutionary genomics of birds. Cytogenet Genome Res. 2007, 117: 120-130. 10.1159/000103172.
Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155: 945-959.
Hubisz MJ, Falush D, Stephens M, Pritchard JK: Inferring weak population structure with the assistance of sample group information. Mol Ecol Res. 2009, 9: 1322-1332. 10.1111/j.1755-0998.2009.02591.x.
Patterson N, Price AL, Reich D: Population structure and eigenanalysis. PLoS Genet. 2006, 2: e190-10.1371/journal.pgen.0020190.
Rosenberg NA: Distruct: a program for the graphical display of population structure. Mol Ecol Notes. 2004, 4: 137-138.
Del Hoyo J, Elliot A, Christie DA: Handbook of the Birds of the World. Old World Flycatchers to Old World Warblers. 2006, Barcelona: Lynx Edicions
Veen T, Sheldon BC, Weissing FJ, Visser ME, Qvarnström A, Sætre G-P: Temporal diVerences in food abundance promote coexistence between two congeneric passerines. Oecologia. 2010, 162: 873-884. 10.1007/s00442-009-1544-1.
Qvarnström A, Svedin N, Wiley C, Veen T, Gustafsson L: Cross-fostering reveals seasonal changes in the relative fitness of two competing species of flycatchers. Biol Lett. 2005, 22: 68-71.
Voje KL, Hemp C, Flagstad O, Saetre GP, Stenseth NC: Climatic change as an engine for speciation in flightless Orthoptera species inhabiting African mountains. Mol Ecol. 2009, 18: 93-108.
Arbogast BS, Edwards SV, Wakeley J, Beerli P, Slowinski JB: Estimating divergence times from molecular data on phylogenetic and population genetic timescales. Ann Rev Ecol Syst. 2002, 33: 707-740. 10.1146/annurev.ecolsys.33.010802.150500.
Petit RJ, Excoffier L: Gene flow and species delimitation. Trends Ecol Evol. 2009, 24 (7): 386-393. 10.1016/j.tree.2009.02.011.
Andolfatto P: Contrasting patterns of X-linked and autosomal nucleotide variation in Drosophila melanogaster and Drosophila simulans. Mol Biol Evol. 2001, 18: 279-290. 10.1093/oxfordjournals.molbev.a003804.
Hey J, Kliman RM: Interactions between natural selection, recombination and gene density in the genes of Drosophila. Genetics. 2002, 160: 595-608.
Wu C-I, Ting C-T: Genes and speciation. Nat Rev Genet. 2004, 5: 114-122. 10.1038/nrg1269.
Hey J, Wakeley J: A coalescent estimator of the population recombination rate. Genetics. 1997, 145: 833-846.
Wiley C, Qvarnström A, Andersson G, Borge TG-PS: Postzygotic isolation over multiple generations of hybrid descendents in a natural hybrid zone: how well do single-generation estimates reflect reproductive isolation?. Evolution. 2009, 63: 1731-1739. 10.1111/j.1558-5646.2009.00674.x.
Tegelström H, Gelter H: Haldane’s rule and sex-biased gene flow between two hybridizing flycatcher species (Ficedula albicollis and F. hypoleuca, Aves: Muscicapidae). Evolution. 1990, 44: 2012-2021. 10.2307/2409611.
Haldane JBS: Sex ratio and unisexual sterility in hybrid animals. J Genet. 1922, 12: 279-281.
Borge T, Webster MT, Andersson G, Sætre G-P: Contrasting patterns of polymorphism and divergence on the Z chromosome and autosomes in two Ficedula flycatcher species. Genetics. 2005, 171: 1861-1873. 10.1534/genetics.105.045120.
Storchová R, Reif J, Nachman MW: Female heterogamety and speciation: reduced introgression of the Z chromosome between two species of nightingales. Evolution. 2009, 64: 456-471.
Backström N, Väli Ü: Sex- and species-biased gene flow in a spotted eagle hybrid zone. BMC Evol Biol. 2011, 11: 100-10.1186/1471-2148-11-100.
Carling MD, Brumfield RT: Speciation in Passerina buntings: introgression patterns of sex-linked loci identify a candidate gene region for reproductive isolation. Mol Ecol. 2009, 18: 834-847. 10.1111/j.1365-294X.2008.04038.x.
Carling MD, Lovette IJ, Brumfield RT: Historical divergence and gene flow: coalescent analyses of mitochondrial, autosomal and sex-linked loci in Passerina buntings. Evolution. 2010, 64: 1762-1772. 10.1111/j.1558-5646.2010.00954.x.
Lehtonen PK, Laaksonen T, Artemyev AV, Belskii E, Both C, Bures S, Bushuev AV, Krams I, Moreno J, Mägi M, et al: Geographic patterns of genetic differentiation and plumage colour variation are different in the pied flycatcher (Ficedula hypoleuca). Mol Ecol. 2009, 18: 4463-4476. 10.1111/j.1365-294X.2009.04364.x.
Fay JC, Wu C-I: Hitchhiking under positive Darwinian selection. Genetics. 2000, 155: 1405-1413.
Sæther SA, Sætre G-P, Borge T, Wiley C, Svedin N, Andersson G, Veen T, Haavie J, Servedio MR, Bures S, et al: Sex chromosome-linked species recognition and evolution of reproductive isolation in flycatchers. Science. 2007, 318: 95-97. 10.1126/science.1141506.
Novembre J, Johnson T, Bryc K, Kutalik Z, Boyko AR, Auton A, Indap A, King KS, Bergmann S, Nelson MR, et al: Genes mirror geography within Europe. Nature. 2008, 456: 98-101. 10.1038/nature07331.
IHMC: A haplotype map of the human genome. Nature. 2005, 437: 1299-1320. 10.1038/nature04226.
Marchini J, Cardon LR, Phillips MS, Donnelly P: The effects of human population structure on large genetic association studies. Proc Natl Acad Sci USA. 2004, 36: 512-517.
NB acknowledges Post-Doctoral funding from the Swedish Research Council (VR Grant: 2009–693). GPS is supported by the Norwegian Research Council. HE is supported by a European Research Council Advanced Investigator Award, a Knut and Alice Wallenberg Scholar Award and by the Swedish Research Council.
The authors declare no financial or non-financial competing interests.
NB and HE conceived of the study. NB designed the study, did the molecular work and analysed the data. GPS contributed materials. NB and HE wrote the manuscript with input from GPS. All authors approved the final version of the manuscript.
Electronic supplementary material
Additional file 1: Supplementary information: Nested model analysis test values (2LLR) and corresponding p-values. df = degrees of freedom. * = 2LLR test statistic follows a mixed chi-squared distribution (Hey & Nielsen 2007). Allopatry 1 = Spanish pied flycatchers and Italian collared flycatchers, Allopatry 2 = Spanish pied flycatchers and Hungarian collared flycatchers, Sympatry = pied flycatchers and collared flycatchers from the Baltic Sea islands, Species = samples from all four populations combined for each species, respectively. (PDF 65 kb) (PDF 65 KB)
About this article
Cite this article
Backström, N., Sætre, G. & Ellegren, H. Inferring the demographic history of European Ficedula flycatcher populations. BMC Evol Biol 13, 2 (2013) doi:10.1186/1471-2148-13-2
- Ficedula flycatchers