A Bayesian framework to estimate diversification rates and their variation through time and space
- Daniele Silvestro†^{1, 2, 3}Email author,
- Jan Schnitzler†^{1, 2} and
- Georg Zizka^{1, 2, 3}
https://doi.org/10.1186/1471-2148-11-311
© Silvestro et al; licensee BioMed Central Ltd. 2011
Received: 23 April 2011
Accepted: 21 October 2011
Published: 21 October 2011
Abstract
Background
Patterns of species diversity are the result of speciation and extinction processes, and molecular phylogenetic data can provide valuable information to derive their variability through time and across clades. Bayesian Markov chain Monte Carlo methods offer a promising framework to incorporate phylogenetic uncertainty when estimating rates of diversification.
Results
We introduce a new approach to estimate diversification rates in a Bayesian framework over a distribution of trees under various constant and variable rate birth-death and pure-birth models, and test it on simulated phylogenies. Furthermore, speciation and extinction rates and their posterior credibility intervals can be estimated while accounting for non-random taxon sampling. The framework is particularly suitable for hypothesis testing using Bayes factors, as we demonstrate analyzing dated phylogenies of Chondrostoma (Cyprinidae) and Lupinus (Fabaceae). In addition, we develop a model that extends the rate estimation to a meta-analysis framework in which different data sets are combined in a single analysis to detect general temporal and spatial trends in diversification.
Conclusions
Our approach provides a flexible framework for the estimation of diversification parameters and hypothesis testing while simultaneously accounting for uncertainties in the divergence times and incomplete taxon sampling.
Keywords
Background
Patterns of species diversity have been shaped by both speciation and extinction throughout the history of life, and one of the key questions in evolutionary biology is to understand the temporal and spatial dynamics of these processes [1–6]. In addition to the fossil record, molecular phylogenetic data of extant lineages can provide valuable information on the process of diversification in form of branch length and the distribution of divergence times throughout the evolutionary history of a clade. Despite the omission of extinct lineages, it has been shown that differential patterns of speciation and extinction can leave a discernible signature on phylogenetic trees of extant taxa [7, 8]. Methodological advances [9–14] as well as the growing number of well sampled, dated molecular phylogenies have generated considerable interest in unraveling the temporal dynamics of species diversification. Indeed, diversification rates have been assessed for a wide range of taxa from the tree of life to address questions concerning rapid radiations [15–18], mass extinction events [19], and differences among lineages [20, 21] and geographic regions [11, 22, 23]. In particular, the identification of potential correlates of speciation and/or extinction rates, either extrinsic [e.g. climate or ecology; [24]] and/or intrinsic [e.g. key innovations; [25–28]], has received increased attention by relating rate shifts to external conditions or the evolution of species' traits.
Nee at al. [9] first applied the generalized birth-death process [29] to molecular phylogenies of extant lineages to extract information on the evolutionary process and proposed a likelihood approach to estimate both speciation and extinction rates (λ and μ, respectively). Given the need to distinguish different modes of diversification (e.g. deviations from constant rates), further approaches have been developed to incorporate rate variation through time [8, 12, 13, 30] and across clades [14, 31]. In addition, the original birth-death process was modified to correct rate estimates in case of incomplete taxon sampling [32–35].
While Bayesian Markov chain Monte Carlo (MCMC) methods are now commonly employed in phylogenetics to accommodate for uncertainties in model parameters, the temporal uncertainty of node age estimates is usually not taken into account when studying the dynamics of species diversification, resulting in an erroneous impression of precision. Here, we present a novel MCMC approach to estimate rates of speciation and extinction over the posterior distribution of trees generated in Bayesian molecular clock analyses. Several models of diversification have been implemented, including the constant rate birth-death and pure-birth processes modified to account for incomplete taxon sampling [33], a birth-death process with continuously varying rates [8], and a pure-birth process with rate shifts [12], for which the posterior distribution of λ and the temporal position of each rate shift are jointly estimated. Within the Bayesian framework, we describe a meta-analysis approach that aims at evaluating general patterns of species diversification across different taxonomic groups. In addition to the estimation of rate parameters, the approach presented here can also be used to distinguish between different modes of diversification, and test explicit hypotheses of rate variation through time and between clades using Bayes factors. We assess the power of the MCMC and of the Bayes factor test using simulated data sets, and demonstrate the application on empirical data sets: rate variation through time in the diversification of Mediterranean cyprinid genus Chondrostoma, geographic patterns in the radiation of the genus Lupinus, and a meta-analysis of four clades from the Cape flora of South Africa.
Results
Bayesian rate estimation across phylogenies
The birth-death process was implemented in a Markov chain Monte Carlo framework to estimate the parameters of species diversification (speciation and extinction rate) while accounting for phylogenetic uncertainty. Several modifications of the birth-death process originally described by Nee et al. [9] were implemented in the MCMC algorithm to describe different patterns of diversification and allow model selection and hypothesis testing. We included a modification of the birth-death process that accounts for incomplete taxon sampling based on Yang and Rannala [33] and Stadler [34] in which the fraction of the sampled species out of the total diversity (ρ) is used to correct the estimate of the diversification parameters. Although the missing species are assumed to be randomly distributed within the phylogeny, unlike in other models [e.g. [35]], we incorporate an option to assign different sampling fractions to predefined clades (see partitioned models below). In addition, Rabosky and Lovette's [8] SPVAR model was implemented to analyze the commonly observed pattern of "explosive-early" radiations, in which clades show an initial burst of diversification followed by a gradually declining speciation rate. Another model that accounts for rate variation through time is a pure-birth process in which a fixed number of shifts in diversification rate is assumed [12]. In contrast to the continuously varying birth-death process, the rate is assumed to vary only at specific times and otherwise remain constant. For a given number of rate shifts, the MCMC estimates their temporal position and the respective rates. Finally, our approach can be used to assign independent rates to predefined clades and is especially intended for hypothesis testing, complementing other approaches in which the rate constancy across-clades is relaxed [31], or in which the number and position of the rate shifts on the tree are estimated [14]. The data set is partitioned a priori by defining clades of interest, based for example on morphology or biogeography, and independent rates, models, and sampling proportions can be assigned to each clade.
F-model: A meta-analysis approach
Proposals for the parameters m are sampled from normal distributions centered on their current values, whereas new values of the F parameter are obtained from a log-normal distribution to achieve a symmetric proposal distribution in log(F). Reflection at the boundary was used to avoid proposals outside of the valid range (e.g. m ≤ 0). Uniform priors are assigned to m in range [0, 10] and to log(F) in range [-2.3, 2.3], which corresponds to an F value in range [0.1, 10]. The clade-specific F-model can be extended to a birth-death process by assigning the parameter m to the mean net diversification (r) and introducing a second parameter n for the mean extinction fraction (a). Consequently, two parameters F_{ r } and F_{ a } are defined to measure the overall variation of r and a between clades of each data set. The significance of an overall rate difference across partitions is assessed via Bayes factor between a model in which F is allowed to vary and a model with constrained F = 1 (i.e. equal rates across partitions).
Model selection using Bayes factor
Bayes factors (BF) tests to distinguish different modes of diversification.
Simulation settings | Bayes Factors (TDI) | ||||||
---|---|---|---|---|---|---|---|
no. of tips (ρ) | λ | μ | BD | PB | PB2 | PB3 | PB4 |
50 | 0.5 | 0.05 | 0 | -0.71 | 0.17 | ||
100 | 0.5 | 0.05 | 0 | -2.40 | -1.77 | ||
50 | 0.5 | 0.25 | 0 | 2.59 | 1.51 | ||
100 | 0.5 | 0.25 | 0 | 3.66 | -0.21 | ||
50 | 0.5 | 0.45 | 0 | 24.98 | 3.57 | ||
100 | 0.5 | 0.45 | 0 | 36.68 | 5.06 | ||
50 | 0.5 | 0 | 1.05 | 0 | 1.18 | ||
100 | 0.5 | 0 | 2.92 | 0 | 0.69 | ||
100 (25%) | 1 | 0 | 2.21 | 0 | |||
200 (50%) | 1 | 0 | 4.10 | 0 | |||
300 (75%) | 1 | 0 | 5.33 | 0 | |||
400 | 1 | 0 | 6.37 | 0 | |||
100 (25%) | 1 | 0.9 | 0 | 36.65 | |||
200 (50%) | 1 | 0.9 | 0 | 68.30 | |||
300 (75%) | 1 | 0.9 | 0 | 98.18 | |||
400 | 1 | 0.9 | 0 | 131.57 | |||
50 | 0.1, 0.2 | 0 | -0.59 | 1.74 | 0 | 1.76 | |
100 | 0.1, 0.2 | 0 | 0.44 | 6.78 | 0 | 0.88 | |
50 | 0.05, 0.25 | 0 | 4.64 | 23.73 | 0 | 1.22 | |
100 | 0.05, 0.25 | 0 | 5.79 | 0 | 0.73 | ||
50 | 0.02, 0.16 | 0 | 13.94 | 40.60 | 0 | 0.69 | |
100 | 0.02, 0.16 | 0 | 29.53 | 90.60 | 0 | 0.93 | |
50 | 0.2, 0.1 | 0 | 4.02 | 0.54 | 0 | 1.47 | |
100 | 0.2, 0.1 | 0 | 11.57 | 5.57 | 0 | 1.53 | |
50 | 0.5, 0.1 | 0 | 23.22 | 18.52 | 0 | 1.49 | |
100 | 0.5, 0.1 | 0 | 58.15 | 51.49 | 0 | 0.81 | |
50 | 0.16, 0.02 | 0 | 23.92 | 19.51 | 0 | 0.85 | |
100 | 0.16, 0.02 | 0 | 56.32 | 49.53 | 0 | 0.60 | |
50 | 0.1, 0.2, 0.1 | 0 | -0.55 | -2.26 | -1.28 | 0 | 2.04 |
100 | 0.1, 0.2, 0.1 | 0 | 4.81 | 0.35 | 0.69 | 0 | 1.52 |
50 | 0.1, 0.5, 0.1 | 0 | 12.93 | 12.56 | 8.26 | 0 | 0.67 |
100 | 0.1, 0.5, 0.1 | 0 | 45.38 | 40.22 | 21.47 | 0 | -0.10 |
50 | 0.02, 0.16, 0.02 | 0 | 22.34 | 21.49 | 18.18 | 0 | -1.11 |
100 | 0.02, 0.16, 0.02 | 0 | 63.95 | 64.01 | 54.67 | 0 | -1.31 |
Rate estimation on simulated phylogenies
Estimates of speciation (λ) and extinction (μ) rates from simulated phylogenies.
Simulation settings | Estimates | |||
---|---|---|---|---|
no. of tips (ρ) | λ | μ | λ_{MEAN}(rel. error) | μ_{MEAN}(rel. error) |
50 | 0.5 | 0 | 0.47 (-0.07) | - |
100 | 0.5 | 0 | 0.48 (-0.04) | - |
50 | 0.5 | 0.05 | 0.57 (0.14) | 0.23 (3.61) |
100 | 0.5 | 0.05 | 0.55 (0.10) | 0.18 (2.55) |
50 | 0.5 | 0.25 | 0.49 (-0.03) | 0.28 (0.11) |
100 | 0.5 | 0.25 | 0.52 (0.03) | 0.30 (0.20) |
50 | 0.5 | 0.45 | 0.49 (-0.02) | 0.43 (-0.04) |
100 | 0.5 | 0.45 | 0.49 (-0.02) | 0.44 (-0.03) |
100 (25%) | 1 | 0 | 1.09 (0.09) | - |
200 (50%) | 1 | 0 | 1.07 (0.07) | - |
300 (75%) | 1 | 0 | 1.06 (0.06) | - |
400 (100%) | 1 | 0 | 1.05 (0.05) | - |
100 (25%) | 1 | 0.9 | 0.99 (-0.02) | 0.82 (-0.09) |
200 (50%) | 1 | 0.9 | 1.02 (0.02) | 0.85 (-0.05) |
300 (75%) | 1 | 0.9 | 1.04 (0.03) | 0.88 (-0.03) |
400 (100%) | 1 | 0.9 | 1.04 (0.04) | 0.89 (-0.01) |
Parameter estimation for the continuously varying birth-death process.
Simulation settings | Estimates | |||||
---|---|---|---|---|---|---|
no. of tips | λ | μ | k | λ_{MEAN}(rel. error) | μ_{MEAN}(rel. error) | k_{MEAN}(rel. error) |
50 | 1 | 0.1 | 0.25 | 1.91 (0.91) | 0.50 (3.99) | 0.23 (-0.08) |
100 | 1 | 0.1 | 0.25 | 1.68 (0.68) | 0.34 (2.38) | 0.19 (-0.25) |
50 | 5 | 0 | 0.95 | 3.72 (-0.26) | 0.31 (-) | 0.75 (-0.21) |
100 | 5 | 0 | 0.95 | 5.22 (0.04) | 0.33 (-) | 0.97 (0.02) |
Rate estimates for the pure-birth process with rate shifts.
Simulation settings | Estimates | |||
---|---|---|---|---|
no. of tips | λ | s | λ_{MEAN}(rel. error) | s _{MODE} |
50 | 0.1, 0.2 | 5 | 0.11 (0.12), 0.24 (0.22) | 4.86 |
100 | 0.1, 0.2 | 5 | 0.10, 0.22 (0.12) | 4.81 |
50 | 0.05, 0.25 | 3.5 | 0.05, 0.26 | 3.59 |
100 | 0.05, 0.25 | 5 | 0.05 (0.07), 0.26 | 4.97 |
50 | 0.02, 0.16 | 5 | 0.02 (0.07), 0.17 (0.07) | 4.95 |
100 | 0.02, 0.16 | 5 | 0.02, 0.16 | 5.05 |
50 | 0.2, 0.1 | 5 | 0.19, 0.12 (0.17) | 5.09 |
100 | 0.2, 0.1 | 7 | 0.21, 0.11 | 6.97 |
50 | 0.5, 0.1 | 3 | 0.50, 0.11 (0.12) | 3.06 |
100 | 0.5, 0.1 | 5 | 0.51, 0.10 | 5.04 |
50 | 0.16, 0.02 | 5 | 0.16, 0.03 (0.31) | 5.05 |
100 | 0.16, 0.02 | 5 | 0.16, 0.02 (0.19) | 5.01 |
50 | 0.1, 0.2, 0.1 | 3, 7 | 0.12 (0.22), 0.19 (-0.06), 0.12 (0.22) | 3.01, 6.96 |
100 | 0.1, 0.2, 0.1 | 5, 10 | 0.10, 0.19 (-0.06), 0.11 (0.08) | 4.88, 10.04 |
50 | 0.1, 0.5, 0.1 | 2, 4 | 0.11 (0.07), 0.32 (-0.36), 0.12 (0.21) | 2.05, 3.95 |
100 | 0.1, 0.5, 0.1 | 2, 6 | 0.12 (0.18), 0.50, 0.11 (0.10) | 1.94, 6.05 |
50 | 0.02, 0.16, 0.02 | 15, 20 | 0.03, 0.15, 0.03 (0.41) | 15.49, 19.77 |
100 | 0.02, 0.16, 0.02 | 15, 20 | 0.02, 0.16, 0.02 (0.10) | 15.48, 20.58 |
The model with continuously decreasing diversification rates (SPVAR) yields accurate estimates of the parameter k (which determines the magnitude of the temporal decrease of the speciation rate; Table 3). On the other hand, the estimated initial speciation rate λ_{0} tends to be overestimated when k is small (with relative errors between 0.2 and 0.3), and underestimated for higher k values.
Contrasting times of rate shift: Diversification of Mediterranean cyprinids
The use of a pure-birth process with rate shift and its implementation in hypothesis testing are illustrated in an analysis of the cyprinid genus Chondrostoma (Teleostei: Cyprinidae). A recently published molecular phylogeny of the genus [39] places the origin of the present lineages in the mid-Miocene around 15 Mya. Two alternative hypotheses on the diversification of Chondrostoma have been proposed, placing its radiation in the Mediterranean region either during the Messinian salinity crisis [40] or earlier in the Miocene [41]. The comparison of different models of diversification using Bayes factor tests led to the selection of a two-rate pure-birth process and estimated a fourfold decrease in speciation rates (dropping from an initial 0.441 to 0.108), and indicating a substantial slowdown during the Miocene. We used alternative two-rate pure-birth models to specifically test the fit of a rate shift during the Messinian [5.33 - 7.25 Mya; [40]] or earlier in the Miocene [7.25 - 23.03 Mya; [41]]. The rate shift was constrained in two separate analyses to lie within those periods, and the two models were compared by approximating a Bayes factor. Robalo et al. [42] favored the latter hypothesis based on their molecular clock analysis, although without specifically testing it in a statistical framework. Our analysis suggests that the Messinian Lago Mare phase had no particular effect on the radiation of the genus Chondrostoma (BF = 3.14), as a significant decrease in the speciation rate has to be placed before that period, thus supporting Robalo et al.'s conclusion.
Clade-specific analysis: geographic patterns in the radiation of Lupinus(Fabaceae)
Model comparison in Lupinus.
no. rates | partition settings | L_{M} | BF |
---|---|---|---|
1 | λ_{I+II+III+IV} | -184.46 | 77.01 |
2 | λ_{I+II+III}, λ_{IV} | -157.46 | 23.02 |
3 | λ_{I}, λ_{II+III}, λ_{IV} | -145.96 | 0 |
4 | λ_{I}, λ_{II}, λ _{III}, λ_{IV} | -146.08 | 0.25 |
A meta-analysis approach: Diversification of the Cape flora, South Africa
Implementation
The method described has been implemented in a computer program called "BayesRate" (available at http://sourceforge.net/projects/bayesrate/ or from the authors) written in Python [45] (based on the Numpy [46] and Scipy [47] libraries) and R [48], integrating codes using the Python module rpy2 [49]. The thermodynamic integration supports multi-core computation by simultaneously running individual Markov chains on different processors. The log files can be examined with the program Tracer [50] to check for efficiency of the sampling (ESS), and convergence between independent runs.
Discussion
We presented a new Bayesian approach, which provides a powerful tool to estimate rates of speciation and extinction on dated phylogenies based on the likelihood functions of the pure-birth, and birth-death processes while accounting for phylogenetic uncertainty and incomplete taxon sampling. On empirical data, our rate estimation requires a two steps analysis: 1) sampling a posterior distribution of dated phylogenies using a Bayesian molecular clock approach, and 2) estimating posterior diversification rates on these trees. Available programs such as BEAST [51] and mcmctree [52] apply in their relaxed molecular clock implementation different birth-death processes as priors on the node ages [53, 54]. When applied to phylogenies obtained under these assumptions, our approach therefore requires that priors on the diversification parameters are specified twice, while ideally divergence times and diversification rates should be estimated jointly. The majority of the diversification processes considered here are however currently not implemented in these programs, and thus need to be estimated independently. Analyses on simulated data show that the default uniform priors on net diversification and extinction fraction in BEAST do not affect the subsequent rate estimates (relative rate variations lower than 3%; Additional file 3). Alternatively, researchers could choose to run molecular clock analyses in which the prior on the node ages is not based on a birth-death process, but modeled using uniform or Dirichlet distributions [55, 56], as implemented in e.g. Multidivtime [57] and PhyloBayes [58].
The range of models implemented can be used to detect a number of different scenarios of rate variation, including specific events of rate increase or decrease, continuous rate variation through time, and clade-specific diversification rates, while simultaneously accounting for taxon sampling. In addition, the Bayesian framework extends the use of the birth-death models beyond the simple rate estimation allowing the comparison of alternative scenarios of diversification for hypothesis testing.
The Bayes factor test computed via thermodynamic integration has shown to represent a reliable and powerful approach to choose among different models of diversification. BF can be applied to compare non-nested models and does not require to be explicitly corrected for the number of model parameters. For these properties it is particularly suitable not only to select the best model in a Bayesian framework, but also to compare specific hypotheses. The power of Bayes factors to detect the correct number of rates predominantly depends on the magnitude of the rate shifts. Similarly, different birth-death and pure-birth processes can generate diversification patterns that might be difficult to distinguish [4, 8]. For instance, an increase in the net diversification rate can be the result of an increased speciation rate in the absence of extinction or a high extinction rate in a constant rate birth-death process. Nevertheless, we found that the Bayes factors test has the power to discern between most of such scenarios.
Analyses on simulated data show that for both speciation and extinction rates the posterior estimates are accurate. However, the width of the 95% HPD intervals also highlights the sometimes considerable uncertainty in the parameter estimates, especially in case of small phylogenies. Our simulations have shown that this uncertainty is most pronounced with a low relative extinction rate, in which case extinction tends to be overestimated [see also [32]], and estimates have a wide 95% credibility interval. This corroborates previous studies that pointed out that the estimation of extinction rates from molecular phylogenies with reasonable degrees of confidence is very problematic [7, 12, 59, 60]. It should be noted, however, that the wide credibility intervals reflect not only the uncertainties of the parameter estimation, but also the uncertainty of the data (i.e. the node ages). Estimates of the speciation rate on the other hand appear to be more robust. In contrast to Paradis [60], we find that the posterior estimate of λ has a small relative error, even under high relative extinction rates, suggesting that the accuracy in the estimates of λ might be decoupled from the relative rate of extinction. The pure-birth and birth-death models with taxon sampling are found to provide accurate rate estimates, although a poor sampling yields substantially wider 95% HPDs. Finally, the pure-birth process with rate-shifts tends to slightly underestimate the true variation of λ, as a consequence of accounting for the uncertainty of the time of rate shift.
The approach provides a very flexible framework for customized analyses and hypothesis testing such as predefined times of rate shift or constrained parameter values. We have shown with the diversification of Chondrostoma, that the pure-birth model with variable rates can be easily adapted to test for specific hypotheses, running the analysis on fixed time frames defined for example on the basis of geological events or climate changes. The implementation of clade-specific rate estimation further extends the range of options for hypothesis testing, and its application on the radiation of Lupinus showed that it can be used to identify differential rates of diversification between clades. In particular, the option to account for clade-specific sampling biases provides an important feature, as complete taxon sampling is often difficult to achieve, especially for species-rich groups.
Finally, with the F-model, we introduce a new approach to test hypotheses in a meta-analysis framework, and extend the focus from the taxon-specific rate of diversification to a parameter that might be linked to the difference between e.g. geologic periods or geographic regions. The current implementation allows to compare hypotheses with two rates, assigned to either fixed time frames or clades, while accounting for clade-specific taxon sampling. Because the F parameter is constant across data sets, we assume that the magnitude of the rate variation in time or between clades is equal among all data sets. While this certainly represents a simplification of the diversification process, the F-model allows the analyses of potentially many data sets, limiting the number of parameters, and yielding an estimation of general trends across different taxonomic groups. Moreover, even relatively small rate variations can be detected if supported by a sufficient number of data sets.
Conclusions
In summary, the approach presented here shows that temporal dynamics of species diversification resulting from biologically relevant events such as key innovations or the impact of environmental change should best be studied in a Bayesian framework. The use of MCMC sampling provides an elegant way to estimate speciation and extinction rates while taking into account the often considerable uncertainty on divergence times. Furthermore, the model with taxon sampling represents an important step towards a more realistic estimation of the diversification parameters, where a non-random distribution of missing taxa can be incorporated with clade-specific sampling proportions. In addition to the models implemented in this study, recently developed modifications of the birth-death process [13, 61] could also be integrated in the algorithm. With the possibility to run customized analyses specifically designed for hypothesis testing, this method provides a useful and flexible statistical framework to investigate diversification processes. A promising future development would be to relax the F-model to incorporate more than two rates, by assigning a specific multiplier to each time frame or group of clades.
Methods
Bayesian estimation of the diversification parameters
The function reduces to a pure-birth process (PB) in the absence of extinction (μ = 0).
- 1.
Assign initial values to the model parameters (e.g. λ, μ)
- 2.
Sample new r, a values (from which new λ, μ are obtained)
- 3.
Accept or reject the proposal based on the acceptance probability
- 4.
Repeat steps 2 and 3 many times
- 5.
Repeat steps 1 to 4 over different trees sampled from their posterior distribution
- 6.
Summarize the MCMC over all sampled trees by calculating mean and credibility interval for each parameter of interest
The MCMC iteration starts with random parameter values and successive proposals for λ and μ (step 2) are based on the sampling strategy described by Bokma [32], randomly drawing values of r = λ - μ (net diversification) and a = μ/λ (extinction fraction) from normal distributions centered on their current values. To avoid proposals lying outside of the valid interval (e.g. negative values) we use reflection at the boundary. The acceptance probability is proportional to the likelihood ratio, and uniform distributions are applied as flat priors on the rates. The MCMC is run over a distribution of trees, sampling λ and μ on each tree individually after a burnin phase (step 5) and the parameters of interest are summarized over all trees to account for the uncertainty on the node ages (step 6). The means of the posterior distributions of λ and μ are used as rate estimates and the respective credibility intervals are calculated as the 95% highest posterior density (HPD) intervals.
Assuming that a number of species are missing in a phylogeny, the missing lineages can be modeled as the result of an extinction event that occurs exactly at the present time [33]. Thus, when only a subset s of the total species S is included in the phylogeny, the likelihood of a set of branching times becomes a function of the proportion of sampled species ρ = s/S. This model assumes that taxon sampling is random with respect to the phylogeny. However, in case of a non-random sampling bias, individual clades in the phylogeny are represented to different extents. Thus, pure-birth and birth-death models were implemented in the MCMC framework with the possibility to assign a different sampling proportion (ρ) to each clade.
where λ_{0} is the initial speciation rate, and μ_{0} the final extinction rate. A constant speciation rate is found with k = 0, whereas the extinction tends to be constant when z is very large. We implement Rabosky and Lovette's [8] SPVAR model (where speciation rates decrease through time while extinction rates remain constant) by applying a uniform prior in range [0, 10] for k and setting z to 10, 000. The parameters sampled by the algorithm are λ_{0}, μ, and k.
A uniform prior from 0 to the root age is assumed for the temporal position of the rate shift s. Because the temporal position of the rate-shift is not fixed but estimated through the MCMC sampling, we summarize the marginal rates as mean value and 95% credibility interval within predefined time frames (e.g. 1 Myr intervals) from the root age to the tips of the trees and use these estimates to draw rates-through-time plots (RTT). Thus, the marginal rates reflect the uncertainty on the time of rate shift. The sampling frequencies of the rate shifts through time are used to infer the temporal placement of the rate variation events. We identify the time of rate shift by finding the modal value of the frequency distribution of each parameter s_{ i } , representing the most frequently sampled value and approximating the maximum-a-posteriori estimate (MAP).
When testing for rate differences across predefined clades, the joint likelihood of all clades C is used to estimate the posterior distribution of the speciation and extinction rates (λ_{c}, μ_{c}) of each individual clade c. The parameters λ_{c} and μ_{c} can be constrained to be equal among clades (linked model) or estimated independently (unlinked model). A model comparison between linked and unlinked parameters is performed using Bayes factors (see below) to assess the significance of the rate difference between clades.
Model selection: Bayes factors via thermodynamic integration
We found that the shape of the Bézier spline described by Beerli and Palczewski [72] provided a good approximation of the curve obtained by applying many scaling factors under different models of diversification, and therefore adopted it in our computation of the marginal likelihood. After testing various numbers of scaling classes to calculate the discrete thermodynamic estimate of the marginal likelihood (not shown), six classes were found to be a good compromise between accuracy of the result and computational time. Once the log marginal likelihoods L_{ M } were obtained via TDI, the log Bayes factor (BF) between pairs of models M_{ 0 } and M_{ 1 } was computed as BF_{ 01 } = 2(M_{ 1 } - M_{ 0 } ) and its interpretation based on the values suggested by Kass and Raftery [65]. Thus BF_{ 01 } greater than 2 represent positive evidence for model M_{ 1 } , and greater than 6 provide strong evidence. To assess the power of Bayes factor in discerning between different modes of diversification, we analyzed several simulated data set under birth-death models and pure-birth assuming one to three rate shifts with a special focus on processes that generate similar patterns (e.g. birth-death and pure-birth with rate increase).
Statistical evaluation
To test the performance of our method, we analyzed simulated phylogenies generated under a range of models using the R-package TreeSim [34, 73]. A total of 38 data sets of 100 phylogenies (with 50, 100, or 400 tips) were simulated under different models of diversification (Additional file 4). We simulated constant rate birth-death models with extinction fractions ranging from low to very high (0.1, 0.5, and 0.9), and different taxon sampling proportions (25%, 50%, 75%, and 100%). Pure-birth processes were simulated with either constant rates, or including shifts in diversification rates (one or two shifts) under small (twofold), moderate (fivefold), and large (eightfold) rate variations, respectively (Additional file 4). Trees (50 and 100 tips) with approximately continuously decreasing speciation rates were obtained by imposing nine equally spaced rate shifts, under two different diversification scenarios where speciation rates follow an exponential decrease (λ_{0} = 1, μ = 0.1, and k = 0.25; λ_{0} = 5, μ = 0, and k = 0.95). Because of the limitations of the SPVAR [8] model under variable or high extinction rates [74, 75], we assumed absent or very low and constant extinction. As these simulations only approximate continuously decreasing rates, we report the parameter estimates under the SPVAR model, but do not perform model comparisons via Bayes factors.
To assess the accuracy of the rate estimates, we calculated the relative errors [cf. [12]] as (r_{est} - r_{true})/r_{ true } , where r_{est} is the estimated rate of speciation or extinction and r_{true} is the true value. A positive relative error indicates overestimation of the parameter, whereas a negative value indicates its underestimation. For the pure-birth model with rate variation, the marginal diversification rates through time were calculated for time categories of 1 million years, and their relative errors were calculated in relation to the true values between shift points. The modal values of the posterior distribution of the shift points were compared against the true shift times and their relative error was calculated as (t_{est} - t_{true})/T, where t_{est} is the estimated time of rate shift, t_{true} is its true value, and T is the average root node age of the analyzed trees.
To address the impact of estimating rates on a single tree compared to analyzing a distribution of trees, we used a tree topology simulated in Phyl-o-Gen [76] under the birth-death process (100 tips; r = 1; a = 0.9) to simulate nucleotide sequences (3978 bp, HKY+I+Γ) using the program SeqGen [77]. Phylogenetic trees were then reconstructed in BEAST [v.1.6.1; [51]]. For comparison, we also inferred the maximum likelihood estimates of λ and μ on the consensus tree (Figure 3) through a birth-death optimization as implemented in LASER [78].
To empirically assess the potential impact of specifying priors on the birth-death parameters in both the molecular clock analysis and the subsequent rate estimation, additional simulations were performed. We generated trees in Phyl-o-Gen (50 tips; extinction fraction a = 0, 0.5, and 0.9) on which nucleotide sequences (5000 bp, HKY+I+Γ) were simulated using the program SeqGen [77]. Dated phylogenies were reconstructed in BEAST using the default uniform priors on the birth-death parameters and constraining the root node to the age of the initial tree. The posterior rates were then estimated using our MCMC approach on both the initial tree (used to simulate the alignment) and the distribution of trees obtained from BEAST. The rate estimates were compared by calculating their variation in terms of relative error (Additional file 3).
Analyses on the case studies
The phylogenetic relationships of the genus Chondrostoma were reconstructed using mitochondrial cytochrome b and nuclear ß-actin gene sequences for all currently recognized taxa [39]. Phylogenetic trees and divergence times were reconstructed using BEAST [v.1.6.1; [51]] and assuming the GTR+I+Γ model of sequence evolution. A speciation model following a Yule process was selected as the tree prior, with an uncorrelated lognormal (UCLN) model for the rate variation among branches. Secondary calibration points were used, following Gante et al. [39], constraining nodes to a normal prior: the crown node of Chondrostoma was constrained with a mean of 15.1 Mya (central 95% range 12.5 - 17.6 Mya). The split between C. olisiponensis and its sister clade was constrained with a mean of 10.1 Mya (central 95% range 7.7 - 12.4 Mya). The analysis was run for 15 million generations, sampling states every 2, 000 generations. The adequacy of the sampling was assessed with Tracer [50] using the Effective Sample Size diagnostic (Additional file 2). We evaluated the temporal patterns of diversification using all diversification models implemented in our approach applied on a random sample of 100 trees from the molecular clock analysis.
We reconstructed the phylogeny and divergence times of the genus Lupinus based on a combined alignment of ITS and LEGCYC1. Following Hughes and Eastwood [16], the two markers were partitioned and analyzed under GTR+Γ and GTR+I models, respectively. A relaxed molecular clock analysis was carried out using BEAST, assuming an uncorrelated lognormal clock model, running 30 million MCMC generations. A normal distribution with a mean of 16.01 Mya and a standard deviation of 2.6 for the stem node of Lupinus was set as calibration point. We carried out the diversification analyses on a random sample of 100 trees obtained from a relaxed molecular clock analysis, applying a pure-birth process after model selection. The estimation of the diversification rates was performed assuming clade-specific taxon sampling (ρ_{I} = 0.77, ρ_{II} = 0.55, ρ_{III} = 0.18, ρ_{IV} = 0.40) under different models in which the rates were linked or unlinked among clades.
The meta-analysis on the Cape Floristic Region was based on four dated phylogenies of different Cape clades [23, 44]. The analyses using the F-model were performed on a random set of 100 trees sampled from the posterior distribution of each data set. All clades have on average only a small proportion of missing taxa, which was accounted for by means of clade-specific sampling fractions. Each data set was split into Cape and non-Cape clades based on their main geographic distribution, high degrees of endemism - particularly within the CFR - allowed a simple assignment of clades (Figure 5C). The meta-analysis was performed implementing pure-birth models, which were favored over a birth-death process with a Bayes factor value of 3.46.
Notes
Declarations
Acknowledgements
We thank Nicolas Salamin and Bob O'Hara for valuable discussions, Hugo F. Gante and Colin Hughes for providing the alignments for Chondrostoma and Lupinus, respectively, Ziheng Yang for help implementing the birth-death model with taxon sampling, Fernando Fernandez-Mendoza and Peter Beerli for aid with the thermodynamic integration. We also thank David Posada, Folmer Bokma, and Jeffrey L. Thorne for helpful comments on an earlier version on the manuscript. All simulations were run on the Frankfurt Cloud Server, we thank Kai Bosch and Matthew Forrest for support. This study was funded by Hesse's Ministry of Higher Education, Research, and the Arts (funding program "LOEWE - Landes-Offensive zur Entwicklung wissenschaftlich-ökonomischer Exzellenz") and the German Research Foundation (DFG ZI 557/7-1, SCHU 2426/1-1).
Authors’ Affiliations
References
- Nee S, Mooers AØ, Harvey PH: Tempo and mode of evolution revealed from molecular phylogenies. Proc Natl Acad Sci USA. 1992, 89 (17): 8322-8326. 10.1073/pnas.89.17.8322.View ArticlePubMedPubMed CentralGoogle Scholar
- Sanderson MJ, Donoghue MJ: Reconstructing shifts in diversification rates on phylogenetic trees. Trends Ecol Evol. 1996, 11 (1): 15-20. 10.1016/0169-5347(96)81059-7.View ArticlePubMedGoogle Scholar
- Barraclough TG, Nee S: Phylogenetics and speciation. Trends Ecol Evol. 2001, 16 (7): 391-399. 10.1016/S0169-5347(01)02161-9.View ArticlePubMedGoogle Scholar
- Ricklefs RE: Estimating diversification rates from phylogenetic information. Trends Ecol Evol. 2007, 22 (11): 601-610. 10.1016/j.tree.2007.06.013.View ArticlePubMedGoogle Scholar
- Jablonski D, Roy K, Valentine JW, Price RM, Anderson PS: The impact of the pull of the recent on the history of marine diversity. Science. 2003, 300 (5622): 1133-1135. 10.1126/science.1083246.View ArticlePubMedGoogle Scholar
- Jaramillo C, Rueda MJ, Mora G: Cenozoic plant diversity in the Neotropics. Science. 2006, 311 (5769): 1893-1896. 10.1126/science.1121380.View ArticlePubMedGoogle Scholar
- Nee S, Holmes EC, May RM, Harvey PH: Extinction rates can be estimated from molecular phylogenies. Phil Trans R Soc B. 1994, 344 (1307): 77-82. 10.1098/rstb.1994.0054.View ArticlePubMedGoogle Scholar
- Rabosky DL, Lovette IJ: Explosive evolutionary radiations: Decreasing speciation or increasing extinction through time?. Evolution. 2008, 62 (8): 1866-1875. 10.1111/j.1558-5646.2008.00409.x.View ArticlePubMedGoogle Scholar
- Nee S, May RM, Harvey PH: The reconstructed evolutionary process. Phil Trans R Soc B. 1994, 344: 305-311. 10.1098/rstb.1994.0068.View ArticlePubMedGoogle Scholar
- Magallón S, Sanderson MJ: Absolute diversification rates in angiosperm clades. Evolution. 2001, 55 (9): 1762-1780.View ArticlePubMedGoogle Scholar
- Ricklefs RE: Global variation in the diversification rate of passerine birds. Ecology. 2006, 87 (10): 2468-2478. 10.1890/0012-9658(2006)87[2468:GVITDR]2.0.CO;2.View ArticlePubMedGoogle Scholar
- Rabosky DL: Likelihood methods for detecting temporal shifts in diversification rates. Evolution. 2006, 60 (6): 1152-1164.View ArticlePubMedGoogle Scholar
- Morlon H, Potts MD, Plotkin JB: Inferring the dynamics of diversification: a coalescent approach. PLoS Biol. 2010, 8 (9): e1000493-10.1371/journal.pbio.1000493.View ArticlePubMedPubMed CentralGoogle Scholar
- Alfaro ME, Santini F, Brock C, Alamillo H, Dornburg A, Rabosky DL, Carnevale G, Harmon LJ: Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates. Proc Natl Acad Sci USA. 2009, 106 (32): 13410-13414. 10.1073/pnas.0811087106.View ArticlePubMedPubMed CentralGoogle Scholar
- Baldwin BG, Sanderson MJ: Age and rate of diversification of the Hawaiian silversword alliance (Compositae). Proc Natl Acad Sci USA. 1998, 95 (16): 9402-9406. 10.1073/pnas.95.16.9402.View ArticlePubMedPubMed CentralGoogle Scholar
- Hughes C, Eastwood R: Island radiation on a continental scale: Exceptional rates of plant diversification after uplift of the Andes. Proc Natl Acad Sci USA. 2006, 103 (27): 10334-10339. 10.1073/pnas.0601928103.View ArticlePubMedPubMed CentralGoogle Scholar
- Valente LM, Savolainen V, Vargas P: Unparalleled rates of species diversification in Europe. Proc R Soc Lond B. 2010, 277 (1687): 1489-1496. 10.1098/rspb.2009.2163.View ArticleGoogle Scholar
- Day JJ, Cotton JA, Barraclough TG: Tempo and mode of diversification of lake Tanganyika cichlid fishes. PLoS ONE. 2008, 3 (3): e1730-10.1371/journal.pone.0001730.View ArticlePubMedPubMed CentralGoogle Scholar
- Bininda-Emonds ORP, Cardillo M, Jones KE, MacPhee RDE, Beck RMD, Grenyer R, Price SA, Vos RA, Gittleman JL, Purvis A: The delayed rise of present-day mammals. Nature. 2007, 446: 507-512. 10.1038/nature05634.View ArticlePubMedGoogle Scholar
- Magallón S, Castillo A: Angiosperm diversification through time. Am J Bot. 2009, 96 (1): 349-365. 10.3732/ajb.0800060.View ArticlePubMedGoogle Scholar
- Phillimore AB, Freckleton RP, Orme CDL, Owens IPF: Ecology predicts large-scale patterns of phylogenetic diversification in birds. Am Nat. 2006, 168 (2): 220-229. 10.1086/505763.View ArticlePubMedGoogle Scholar
- Linder HP: Plant species radiations: where, when, why?. Phil Trans R Soc B. 2008, 363: 3097-3105. 10.1098/rstb.2008.0075.View ArticlePubMedPubMed CentralGoogle Scholar
- Valente L, Reeves G, Schnitzler J, Pizer Mazon I, Fay M, Rebelo T, Chase M, Barraclough T: Diversification of the African genus Protea in the Cape biodiversity hotspot and beyond: equal rates but different spatial scales. Evolution. 2010, 64 (3): 745-760. 10.1111/j.1558-5646.2009.00856.x.View ArticlePubMedGoogle Scholar
- Rabosky DL: Ecological limits and diversification rate: alternative paradigms to explain the variation in species richness among clades and regions. Ecol Lett. 2009, 12 (8): 735-743. 10.1111/j.1461-0248.2009.01333.x.View ArticlePubMedGoogle Scholar
- Ree RH: Detecting the historical signature of key innovations using stochastic models of character evolution and cladogenesis. Evolution. 2005, 59 (2): 257-265.View ArticlePubMedGoogle Scholar
- Hodges SA, Arnold ML: Spurring plant diversification: Are floral nectar spurs a key innovation?. Proc R Soc Lond B. 1995, 262: 343-348. 10.1098/rspb.1995.0215.View ArticleGoogle Scholar
- Maddison WP, Midford PE, Otto SP: Estimating a binary character's effect on speciation and extinction. Syst Biol. 2007, 56 (5): 701-710. 10.1080/10635150701607033.View ArticlePubMedGoogle Scholar
- Moore BR, Donoghue MJ: A Bayesian approach for evaluating the impact of historical events on rates of diversification. Proc Natl Acad Sci USA. 2009, 106 (11): 4307-4312. 10.1073/pnas.0807230106.View ArticlePubMedPubMed CentralGoogle Scholar
- Kendall DG: On the generalized birth-and-death process. Ann Math Stat. 1948, 19 (1): 1-15. 10.1214/aoms/1177730285.View ArticleGoogle Scholar
- Stadler T: Mammalian phylogeny reveals recent diversification rate shifts. Proc Natl Acad Sci USA. 2011, 108 (15): 6187-6192. 10.1073/pnas.1016876108.View ArticlePubMedPubMed CentralGoogle Scholar
- Rabosky DL: Extinction rates should not be estimated from molecular phylogenies. Evolution. 2010, 64 (6): 1816-1824. 10.1111/j.1558-5646.2009.00926.x.View ArticlePubMedGoogle Scholar
- Bokma F: Bayesian estimation of speciation and extinction probabilities from (in)complete phylogenies. Evolution. 2008, 62 (9): 2441-2445. 10.1111/j.1558-5646.2008.00455.x.View ArticlePubMedGoogle Scholar
- Yang Z, Rannala B: Bayesian phylogenetic inference using DNA sequences: A Markov chain Monte Carlo method. Mol Biol Evol. 1997, 14 (7): 717-724.View ArticlePubMedGoogle Scholar
- Stadler T: On incomplete sampling under birth-death models and connections to the sampling-based coalescent. J Theor Biol. 2009, 261 (1): 58-66. 10.1016/j.jtbi.2009.07.018.View ArticlePubMedGoogle Scholar
- Höhna S, Stadler T, Ronquist F, Britton T: Inferring speciation and extinction rates under different species sampling schemes. Mol Biol Evol. 2011, 28 (9): 2577-2589. 10.1093/molbev/msr095.View ArticlePubMedGoogle Scholar
- Gelman A, Meng X-L: Simulating normalizing constants: from importance sampling to bridge sampling to path sampling. Statistical Science. 1998, 13 (2): 163-185.View ArticleGoogle Scholar
- Lartillot N, Philippe H: Computing Bayes factors using thermodynamic integration. Syst Biol. 2006, 55 (2): 195-207. 10.1080/10635150500433722.View ArticlePubMedGoogle Scholar
- Ogata Y: A Monte-Carlo method for high dimensional integration. Numer Math. 1989, 55 (2): 137-157. 10.1007/BF01406511.View ArticleGoogle Scholar
- Gante HF, Santos CD, Alves MJ: Phylogenetic relationships of the newly described species Chondrostoma olisiponensis (Teleostei: Cyprinidae). J Fish Biol. 2010, 76 (4): 965-974. 10.1111/j.1095-8649.2010.02536.x.View ArticleGoogle Scholar
- Bianco PG: Potential role of the palaeohistory of the Mediterranean and Paratethis basins on the early dispersal of Euro-Mediterranean freshwater Wshes. Ichthyol Explor Freshwaters. 1990, 1: 167-184.Google Scholar
- Bănărescu P: Zoogeography of fresh waters. Volume 2: distribution and dispersal of freshwater animals in North America and Eurasia. 1991, Wiesbaden: AULA-VerlagGoogle Scholar
- Robalo JI, Almada VC, Levy A, Doadrio I: Re-examination and phylogeny of the genus Chondrostoma based on mitochondrial and nuclear data and the definition of 5 new genera. Mol Phylogenet Evol. 2007, 42 (2): 362-372. 10.1016/j.ympev.2006.07.003.View ArticlePubMedGoogle Scholar
- Linder HP: The radiation of the Cape flora, southern Africa. Biol Rev (Camb). 2003, 78: 597-638.View ArticleGoogle Scholar
- Schnitzler J, Barraclough TG, Boatwright JS, Goldblatt P, Manning JC, Powell MP, Rebelo T, Savolainen V: Causes of plant diversification in the Cape biodiversity hotspot of South Africa. Syst Biol. 2011, 60 (3): 343-357. 10.1093/sysbio/syr006.View ArticlePubMedGoogle Scholar
- Python Core Development Team: Python programming language v.2.6.4. 2010, [http://www.python.org/]Google Scholar
- Ascher D, Dubois PF, Hinsen K, Hugunin J, Oliphant T: Numerical Python. UCRL-MA-128569. 2001, Livermore, CA 94566: Lawrence Livermore National LaboratoryGoogle Scholar
- Jones E, Oliphant T, Peterson P: SciPy: Open source scientific tools for Python. 2001, [http://www.scipy.org]Google Scholar
- R Development Core Team: R: A language and environment for statistical computing v.2.13.1. 2011, R Foundation for Statistical Computing, [http://www.r-project.org/]Google Scholar
- Gautier L: rpy2: A simple and efficient access to R from Python. 2008, [http://rpy.sourceforge.net/]Google Scholar
- Rambaut A, Drummond AJ: Tracer v.1.5. 2007, [http://beast.bio.ed.ac.uk/Tracer]Google Scholar
- Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.View ArticlePubMedPubMed CentralGoogle Scholar
- Yang ZH: PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24 (8): 1586-1591. 10.1093/molbev/msm088.View ArticlePubMedGoogle Scholar
- Gernhard T: The conditioned reconstructed process. J Theor Biol. 2008, 253 (4): 769-778. 10.1016/j.jtbi.2008.04.005.View ArticlePubMedGoogle Scholar
- Yang ZH, Rannala B: Bayesian estimation of species divergence times under a molecular clock using multiple fossil calibrations with soft bounds. Mol Biol Evol. 2006, 23 (1): 212-226.View ArticlePubMedGoogle Scholar
- Thorne JL, Kishino H: Divergence time and evolutionary rate estimation with multilocus data. Syst Biol. 2002, 51 (5): 689-702. 10.1080/10635150290102456.View ArticlePubMedGoogle Scholar
- Lepage T, Bryant D, Philippe H, Lartillot N: A general comparison of relaxed molecular clock models. Mol Biol Evol. 2007, 24 (12): 2669-2680. 10.1093/molbev/msm193.View ArticlePubMedGoogle Scholar
- Thorne JL, Kishino H, Painter IS: Estimating the rate of evolution of the rate of molecular evolution. Mol Biol Evol. 1998, 15 (12): 1647-1657.View ArticlePubMedGoogle Scholar
- Lartillot N, Philippe H: A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol Biol Evol. 2004, 21 (6): 1095-1109. 10.1093/molbev/msh112.View ArticlePubMedGoogle Scholar
- Kubo T, Iwasa Y: Inferring the rates of branching and extinction from molecular phylogenies. Evolution. 1995, 49 (4): 694-704. 10.2307/2410323.View ArticleGoogle Scholar
- Paradis E: Can extinction rates be estimated without fossils?. J Theor Biol. 2004, 229 (1): 19-30. 10.1016/j.jtbi.2004.02.018.View ArticlePubMedGoogle Scholar
- Rabosky DL: Primary controls on species richness in higher taxa. Syst Biol. 2010, 59 (6): 634-645. 10.1093/sysbio/syq060.View ArticlePubMedGoogle Scholar
- Hastings WK: Monte Carlo sampling methods using Markov chains and their applications. Biometrika. 1970, 57 (1): 97-109. 10.1093/biomet/57.1.97.View ArticleGoogle Scholar
- Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AW, Teller E: Equations of state calculations by fast computing machines. J Chem Phys. 1953, 21 (6): 1087-1091. 10.1063/1.1699114.View ArticleGoogle Scholar
- Jeffreys H: Some tests of significance, treated by the theory of probability. P Camb Philos Soc. 1935, 31 (2): 203-222. 10.1017/S030500410001330X.View ArticleGoogle Scholar
- Kass RE, Raftery AE: Bayes Factors. J Amer Stat Assoc. 1995, 90 (430): 773-795. 10.2307/2291091.View ArticleGoogle Scholar
- Newton MA, Raftery AE: Approximate Bayesian inference with weighted likelihood bootstrap. J Roy Stat Soc B. 1994, 56 (1): 3-48.Google Scholar
- Nylander JAA, Ronquist F, Huelsenbeck J, Nieves-Aldrey JL: Bayesian phylogenetic analysis of combined data. Syst Biol. 2004, 53 (1): 47-67. 10.1080/10635150490264699.View ArticlePubMedGoogle Scholar
- Pagel M, Meade A: Bayesian analysis of correlated evolution of discrete characters by reversible-jump Markov chain Monte Carlo. Am Nat. 2006, 167 (6): 808-825. 10.1086/503444.View ArticlePubMedGoogle Scholar
- Rodrigue N, Aris-Brosou S: Fast Bayesian choice of phylogenetic models: Prospecting data augmentation-based thermodynamic integration. Syst Biol. 2011Google Scholar
- Fan Y, Wu R, Chen M, Kuo L, Lewis P: Choosing among partition models in Bayesian phylogenetics. Mol Biol Evol. 2011, 28 (1): 523-532. 10.1093/molbev/msq224.View ArticlePubMedPubMed CentralGoogle Scholar
- Xie W, Lewis PO, Fan Y, Kuo L, Chen M-H: Improving marginal likelihood estimation for Bayesian phylogenetic model selection. Syst Biol. 2011, 60 (2): 150-160. 10.1093/sysbio/syq085.View ArticlePubMedPubMed CentralGoogle Scholar
- Beerli P, Palczewski M: Unified framework to evaluate panmixia and migration direction among multiple sampling locations. Genetics. 2010, 185 (1): 313-326. 10.1534/genetics.109.112532.View ArticlePubMedPubMed CentralGoogle Scholar
- Hartmann K, Wong D, Stadler T: Sampling trees from evolutionary models. Syst Biol. 2010, 59 (4): 465-476. 10.1093/sysbio/syq026.View ArticlePubMedGoogle Scholar
- Bokma F: Problems detecting density-dependent diversification on phylogenies. Proc R Soc Lond B. 2009, 276 (1659): 993-994. 10.1098/rspb.2008.1249.View ArticleGoogle Scholar
- Rabosky DL, Lovette IJ: Problems detecting density-dependent diversification on phylogenies: reply to Bokma. Proc R Soc Lond B. 2009, 276 (1659): 995-997. 10.1098/rspb.2008.1584.View ArticleGoogle Scholar
- Rambaut A: Phyl-O-Gen. Phylogenetic Tree Simulator Package v.1.1. 2002, University of Oxford, [http://tree.bio.ed.ac.uk]Google Scholar
- Rambaut A, Grassly NC: Seq-Gen: An application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic frees. Comput Appl Biosci. 1997, 13 (3): 235-238.PubMedGoogle Scholar
- Rabosky DL: LASER: A maximum likelihood toolkit for detecting temporal shifts in diversification rates from molecular phylogenies. Evol Bioinform Online. 2006, 2: 257-260.Google Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.