- Research article
- Open Access
Phylogenomic analysis of Copepoda (Arthropoda, Crustacea) reveals unexpected similarities with earlier proposed morphological phylogenies
BMC Evolutionary Biology volume 17, Article number: 23 (2017)
Copepods play a critical role in marine ecosystems but have been poorly investigated in phylogenetic studies. Morphological evidence supports the monophyly of copepods, whereas interordinal relationships continue to be debated. In particular, the phylogenetic position of the order Harpacticoida is still ambiguous and inconsistent among studies. Until now, a small number of molecular studies have been done using only a limited number or even partial genes and thus there is so far no consensus at the order-level.
This study attempted to resolve phylogenetic relationships among and within four major copepod orders including Harpacticoida and the phylogenetic position of Copepoda among five other crustacean groups (Anostraca, Cladocera, Sessilia, Amphipoda, and Decapoda) using 24 nuclear protein-coding genes. Phylogenomics has confirmed the monophyly of Copepoda and Podoplea. However, this study reveals surprising differences with the majority of the copepod phylogenies and unexpected similarities with postembryonic characters and earlier proposed morphological phylogenies; More precisely, Cyclopoida is more closely related to Siphonostomatoida than to Harpacticoida which is likely the most basally-branching group of Podoplea. Divergence time estimation suggests that the origin of Harpacticoida can be traced back to the Devonian, corresponding well with recently discovered fossil evidence. Copepoda has a close affinity to the clade of Malacostraca and Thecostraca but not to Branchiopoda. This result supports the hypothesis of the newly proposed clades, Communostraca, Multicrustacea, and Allotriocarida but further challenges the validity of Hexanauplia and Vericrustacea.
The first phylogenomic study of Copepoda provides new insights into taxonomic relationships and represents a valuable resource that improves our understanding of copepod evolution and their wide range of ecological adaptations.
Copepods represent the largest biomass of all animals on earth [1–3]. They are aquatic animals, primarily marine, and make up the dominant zooplankton assemblages in nearshore environments [2, 3]. In spite of their critical ecological roles, the taxonomic classification has received poor attention. Copepods exhibit extreme morphological diversity and occupy an enormous range of habitats in the aquatic realm, from freshwater to hypersaline, shallow pool, and cave to deep sea environments [4–6]. Humes  described that there are 11,302 species (198 families, 1633 genera; as of the end of 1993) and estimated that a hypothetical total of 75,347 species may exist on the planet . Copepods are also particularly notorious for cryptic speciation [7–10].
Traditionally, there are ten orders of the subclass Copepoda Milne-Edwards, 1840 containing a large different number of families, genera, and species . The morphological phylogenetic analyses of Copepoda have been extensively investigated and there are general agreements such as the monophyletic status of Copepoda [5, 11–14]. Furthermore, copepods can be divided into two infraclasses, Progymnoplea and Neocopepoda . Progymnoplea contains only one order (Platycopioida) and Neocopepoda can be further classified into two superorders, Gymnoplea and Podoplea [5, 12]. For several decades, however, the phylogenetic relationships among the copepod orders have been a matter of controversy [5, 11–17]. Due to an extreme diversity of body forms, the phylogenetic relationships based on traditional morphological data have led to much controversy (see Fig. 1). For example, Ho  and Huys and Boxshall  analyzed 21 and 54 morphological characters across ten copepod orders [5, 11]. They agreed that Platycopioida and Calanoida were the most basal groups (Fig. 1ab). However, the cladogram from Ho  depicted Harpacticoida and Gelyelloida were closely related, but this group was a distinct cluster to the group of Siphonostomatoida, while that of Huys and Boxshall  appeared that Harpacticoida had a close affinity to a sister-group of Siphonostomatoida but a discrete to Gelyelloida. Later, some modifications for the morphological phylogenetic models have been proposed [12, 13]. However, as Ho et al.  pointed out, the inconsistent position of Harpacticoida that represents an important ecological group in aquatic environments has been still problematical .
Furthermore, some molecular-based studies were not congruent with morphological evidence (Fig. 2). Braga et al.  focused on the phylogenetic relationships within the copepod family Euchaetidae and also showed the three copepod orders (Harpacticoida, Calanoida, and Poecilostomatoida with a barnacle, Semibalanus balanoides as an outgroup) using the large subunit ribosomal RNA (28S rRNA) gene (a total aligned sequence length of 484 bp) . The tree appeared to be markedly inconsistent with morphological phylogenies; Harpacticoida was closer to Calanoida than to Poecilostomatoida, which was in conflict to the superorder Podoplea (Fig. 2a). Later, other molecular studies recovered and supported the monophyletic podoplean group using the 18S small subunit ribosomal RNA gene (18S rRNA), but still unresolved the phylogenetic position of Harpacticoida (Fig. 2) [19–22]. Recent study using concatenated twelve mitochondrial genes showed that Harpacticoida (Tigriopus californicus) was more closely related to Siphonostomatoida (Lepeophtheirus salmonis and Caligus rogercresseyi) than Calanoida (Calanus sinicus) (Fig. 2d) . This mitochondrial phylogenetic hypothesis was generally congruent with the majority of the morphological phylogenies [5, 12, 13] except for the phylogenetic position of Poecilostomatoida (Fig. 2d). Moreover, in the 18S rRNA gene trees of Poecilostomatoida, the Clausidiiform complex and the remaining poecilostomatoid taxa appeared to be paraphyletic (Fig. 2e) [21, 24]. Harpacticoida also may be a paraphyletic taxon with Polyarthra (consisting of the families Canuellidae and Longipediidae) and Oligoarthra (all remaining harpacticoid families) [17, 22, 25]. From the 28S rRNA gene tree (505 bp from the v-x region), two Polyarthra taxa (Canuella perplexa and Longipedia gonzalezi) were more closely related to other copepods than to Oligoarthra (Fig. 2F) . All these molecular phylogenetic studies used a relatively short length of the sequences (<2,000 bp) or fast evolving genes that were not acceptable for interordinal relationships (Fig. 2; see details in Discussion).
The purpose of the present study was therefore to clarify the phylogenetic relationships among four major orders of copepods using phylogenomics, the inference of phylogenetic relationships using genome-scale data which has increasingly become a powerful tool to resolve difficult phylogenetic questions [26–30]. In particular, the aim was to include the following: 1) an extensive analysis of the phylogenetic position of Harpacticoida and to evaluate all possible phylogenetic hypotheses; 2) the phylogenetic relationships of copepods among other crustacean groups; and 3) the divergence times of the major copepod orders. Accordingly, the orthologous sequences of 24 nuclear protein-coding genes were retrieved from 18 arthropod species representing four copepod orders (nine species), five other crustaceans (Anostraca, Cladocera, Thecostraca, Amphipoda, and Decapoda), two insects, and two closely related outgroups (Myriapoda and Chelicerata). This study was the first report that provides a rich taxon sampling with genomics-based evidence focusing on the evolution of copepods and their divergence time. Thus, for an ecological perspective, understanding the phylogenetic relationships of copepods would have provided a first step toward elucidating an ecological interaction, habitat colonization, and speciation in Copepoda.
Taxonomic sampling and identification of orthologous genes
The genome and transcriptome assemblies for 18 arthropod species were obtained from multiple sources (see below). For eight copepod species, five transcriptome (Caligus rogercresseyi, Lernaea cyprinacea, Tigriopus californicus, Calanus sinicus, and Acartia fossae) and three genome sequences (Lepeophtheirus salmonis, Mesocyclops edax, and Calanus finmarchicus) were downloaded from the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) database (http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/sra) [31, 32]. Three additional crustacean species were also included: the giant tiger prawn Penaeus monodon (Malacostraca: Penaeidae), the purple barnacle Amphibalanus amphitrite (Thecostraca: Balanidae), and the brine shrimp Artemia franciscana (Branchiopoda: Artemiidae) from NCBI SRA. Three other crustacean genome sequences (two copepods and an amphipod; Eurytemora affinis, Tigriopus californicus, and Hyalella azteca) were downloaded from Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), as a part of the pilot project for the i5K arthropod genomes project . Among these crustacean species examined, none of the orthologous sequences for the 24 nuclear protein-coding sequences (see below) was identified in Calanus finmarchicus which was excluded from further analysis. Also, the orthologous sequences were further retrieved from the non-redundant (NR) protein database at NCBI (http://www.ncbi.nlm.nih.gov). All orthologous sequences from the copepod Acanthocyclops vernalis were obtained from NCBI NR database. All orthologous sequences identified in this study and the GenBank accession numbers were summarized in Additional file 1: Table S1. In addition to the crustacean species mentioned above, five publicly released genomes were added in this study. These sequences of the water flea Daphnia pulex (Branchiopoda), the fruit fly (Drosophila melanogaster), the red flour beetle (Tribolium castaneum), the centipede Strigamia maritima (Myriapoda), and blacklegged tick Ixodes scapularis (Chelicerata) were downloaded from the wFleaBase (http://wfleabase.org), FlyBase (http://flybase.org), BeetleBase (http://beetlebase.org), BCM-HGSC (https://www.hgsc.bcm.edu), and VectorBase (https://www.vectorbase.org), respectively [34–37].
The previously reported nuclear protein-coding genes that were used for the phylogenetic analysis were retrieved as search queries. These sequences were obtained from Regier et al.  and Wiegmann et al. . The orthologous genes were defined by the Basic Local Alignment Search Tool (BLAST, ver. 2.2.30+) programs [39, 40]. The E-value threshold of 1 × 10−30 with the database size 1.4 × 1010 was used to identify orthologous candidates against the genome and transcriptome assemblies. The putative orthologous genes were verified by searches using tblastn against NCBI NR database. After partial sequences or no apparent orthologs were excluded from the analysis, 24 nuclear protein-coding genes were then determined in more than half of the copepod species (Additional file 1: Table S1). All identified copepod protein and nucleotide sequences are provided in Additional files 2 and 3.
Multiple sequence alignments
Multiple alignments of each of the protein gene families were generated using mafft (ver. 7.245)  with the L-INS-i algorithm (1,000 maxiterate and 100 retree) which uses a consistency-based objective function and local pairwise alignment with affine gap costs. Alignments were adjusted manually when necessary. Poorly aligned regions with more than 70% of gaps were removed using trimAl (ver. 1.2) . The corresponding coding nucleotide alignments were generated using PAL2NAL . The single gene sequence alignments are available in: http://bioinformatics.unl.edu/eyun/Copepoda_Phylogenomics. All sequences were concatenated using a custom Perl script (ConCat_seq.pl). This Perl script is available upon request from the author. The concatenated dataset used in this study is available in Additional file 4.
Phylogenetic analysis and alternative topology tests
Phylogenetic relationship using the concatenated sequences was reconstructed by the maximum-likelihood (ML) method with the Le and Gascuel (LG) matrix, gamma distributed rates, invariant sites, and the observed amino acid frequencies using PhyML (ver. 3.1) [44–46]. The best-fit model for the concatenated dataset was selected using the Akaike Information Criterion (AIC) as a statistical tool in ProtTest (ver. 3.2) . Non-parametric bootstrapping with 1000 pseudo-replicates was used to estimate the confidence of branching patterns for the ML phylogeny . Bayesian inferences (BI) of phylogeny were performed using MrBayes (ver. 3.2.6)  with the LG substitution model, gamma-distributed rate variation, and invariant sites. The Markov chain Monte Carlo search was run for 5 × 106 generations, with a sampling frequency of 103, using three heated and one cold chain and with a burn-in of 103 trees.
The phylogenetic trees were also reconstructed using the “degen-1” coding sequences, in which nucleotides at any codon position that have the potential of synonymous substitutions were degenerated . To produce the degenerated synonymous matrices (the “degen-1” coding sequences) , the Perl script (Degen_v1_4.pl) written by Andreas Zwick and April Hussey was used (http://www.phylotools.com). For the morphological reanalysis, the data matrix of 54 morphological characters was obtained from Ho et al. . This morphological data matrix in Nexus format is available in Additional file 5. Phylogenetic inference of the morphological data was conducted with MrBayes (ver. 3.2.6) , using the Mk (Markov K) model , a variable rate among characters (“rates = gamma”), and 5 × 108 generations. The Mk model assumes equal state frequencies. In this analysis, trees were sampled every 103 generations with the first 25% discarded as burn-in and summarized using a 50% majority rule consensus tree. Presentation of the phylogenies was done with FigTree (ver. 1.4.2) (http://tree.bio.ed.ac.uk/software/figtree).
The Kishino-Hasegawa (KH) , the Shimodaira-Hasegawa (SH) , and the Approximately Unbiased (AU)  tests were used to statistically assess the phylogenetic hypotheses. The site log-likelihood of each tree was calculated in TREE-PUZZLE (ver. 5.3.rc16) , and KH, SH, and AU tests were performed in CONSEL (ver. 0.20) with default options .
Divergence time estimation
The divergence times of lineages were estimated using BEAST2 (ver. 2.4.3)  with Bayesian inference using the calibrated Yule model for the tree prior and the uncorrelated relaxed clock model proposed by Drummond et al. . BEAST2 was using a random tree with 5 × 107 generations and a sample frequency of 5 × 103 generations. Four fossil-based minimum ages were applied for the major splits; 497 MYA for the Diptera-Cladocera divergence, 405 MYA for the Cladocera-Anostraca divergence, 313.7 MYA for the Diptera-Coleoptera divergence, and 358.5 MYA for the Amphipoda-Decapoda divergence . The fossil record of Wujicaris muelleri Zhang et al.  was also used as the minimum constraint on the crown group of Pancrustacea [58–60].
Monophyly of copepods and their interordinal relationships
24 nuclear protein-coding genes were obtained from 18 arthropod species including nine copepod species (four major orders of copepods) (Additional file 1: Table S1). The common names of species examined with the current taxonomic classification were listed in Table 1. Among 18 arthropods, two non-pancrustacean taxa, the centipede Strigamia maritima and blacklegged tick Ixodes scapularis, were used as the outgroups [28, 61]. All 24 nuclear protein-coding sequences were concatenated for further phylogenetic analyses (see details in Methods). The data set of the concatenated sequences consisted of 16,710 amino acid sequences (50,106 bp). The phylogenetic relationships obtained from the concatenated sequences were reconstructed by the maximum-likelihood (ML) and Bayesian inferences (BI). The two algorithms confirmed the monophyly of copepods with 100% bootstrap values (Fig. 3). The nine copepod species examined can be classified into two superorders, Gymnoplea (Calanoida) and Podoplea (Siphonostomatoida, Cyclopoida, and Harpacticoida) (Fig. 1; see Discussion). The phylogenomic analyses with ML and BI generated the same topologies supporting the monophyly of the podoplean group. The superorder Podoplea was strongly supported with the high maximum likelihood bootstrap value (MLB = 100%) and Bayesian posterior probability (BPP = 1.00) (Fig. 3).
Most notably, within Podoplea, the interordinal relationships inferred from the phylogenomic analysis differed from that of the widely accepted hypothesis presented in the majority of the morphological and molecular phylogenetic studies that Harpacticoida was generally affiliated with Siphonostomatoida rather than with Cyclopoida [5, 12, 19, 21, 23] (Figs. 1 and 2; see Discussion). In addition to order level relationships in this study, all family and genus level relationships were also clearly resolved by high bootstrap values (MLB = 100% and BPP = 1.00) (Fig. 3). This study included three families for Calanoida: ((Acartiidae, Temoridae), Calanidae) and three genera in Cyclopoida: ((Acanthocyclops, Mesocyclops), Lernaea).
Phylogenetic position of Harpacticoida
To confirm the phylogenetic position of Harpacticoida (Tigriopus californicus, Oligoarthra), four different phylogenetic analyses were attempted; 1) Bayesian reestimation of morphological characters, 2) the “degen-1” coding sequences of Regier et al. , 3) small-scale phylogeny dealing with only nine copepod species with two closely related outgroups, and 4) statistical analyses were performed for all possible trees (three topologies here). First, the morphological phylogeny was reconstructed by Bayesian inferences (Additional file 6: Figure S1). Bayesian analysis using 54 morphological characters showed the same topology with that of Ho et al.  which was reconstructed by maximum parsimony. This tree yielded mostly congruent results with the majority of the copepod phylogeny above. However, all posterior probabilities with the morphology-only data set showed a general lack (BPP < 0.80) of support except for three nodes (indicated by boldfaces in Additional file 6: Figure S1). This suggested that these morphological data might not have sufficient phylogenetic signal. For the second and third attempts, the phylogenetic trees were reconstructed using the “degen-1” coding sequences and from only nine copepod species with Amphibalanus amphitrite (Sessilia) and Penaeus monodon (Decapoda) as the outgroups. Both the phylogenetic approaches returned the same topology as that obtained from Fig. 3 (Additional files 7 and 8: Figures S2 and S3). The phylogenetic relationships were less resolved (>64% MLB for the clade of Siphonostomatoida and Cyclopoida) using the “degen-1” nucleotide dataset than that of Fig. 3, but better resolved in the small-scale phylogeny by high bootstrap values (>76% MLB for that clade) (Additional files 7 and 8: Figures S2 and S3). Lastly, to evaluate those previously proposed hypotheses shown in Figs. 1 and 2, statistical analyses were performed using TREE-PUZZLE (ver. 5.3.rc16) and CONSEL (ver. 0.20) [54, 55] (see in Methods). In Table 2, the first hypothesis, as mentioned earlier, was obtained from the majority of the copepod phylogeny. The second hypothesis was the best maximum likelihood tree obtained from the concatenated PhyML tree in this study. The third hypothesis was a theoretical tree, in which Harpacticoida was closely related to Cyclopoida. All statistical tests rejected the third hypothesis. Although the KH test (P = 0.109) and the SH test (P = 0.479) were unable to reject the first hypothesis, the AU topology test was marginally rejected (P = 0.085) at the 0.10 level of significance. This was most likely due to the conservative nature of the KH test and the SH test. The KH test was invalid in this case because the second hypothetical tree was the best ML tree . The SH test is the most conservative estimate and is sensitive to the unlikely tree (i.e., the third hypothesis in Table 2) . Among the three tests, the AU test is known as the best approach to overcome these problems . Thus, the results of the statistical test supported that the most likely phylogenetic scenario is the second hypothesis. Taken together, these results strongly suggested that Siphonostomatoida was closer to Cyclopoida than Harpacticoida.
Copepoda is a sister group to Communostraca
According to the present phylogenomic analysis, the resulting trees revealed that Copepoda was a sister lineage to a group of Thecostraca and Malacostraca but distinct to Branchiopoda (Fig. 3 and Additional file 7: Figure S2), consistent with results from Regier et al.  and Oakley et al. . Both ML and BI inferred the following interclass relationships: ((Insecta, Branchiopoda), (Copepoda, (Thecostraca, Malacostraca))). The purple barnacle A. amphitrite (Sessilia: Balanidae) was considered to be a sister group to copepods, namely Maxillopoda. In this study, however, this species appeared to be a sister group to the group (Malacostraca) of H. azteca (Amphipoda) and P. monodon (Decapoda), but distinctly related to copepods (Fig. 3 and Additional file 7: Figure S2). The MLB and BPP values for the clade of Sessilia, Amphipoda, and Decapoda were highly supported (MLB > 87% and BPP = 1.00) (Fig. 3 and Additional file 7: Figure S2). Therefore, this study supported the newly proposed clade, Communostraca (common shelled ones) that includes Malacostraca (e.g., crabs or shrimp) and Thecostraca (e.g., barnacles) and the newly proposed clade, Multicrustacea (Copepoda, Malacostraca, and Thecostraca) with high support values (MLB > 71% and BPP = 1.00)  (See Discussion).
This study also supported a proposed clade of Insecta and Branchiopoda, consistent with results from Oakley et al.  representing the Allotriocarida (Hexapoda/Branchiopoda/Remipedia) clade . A very recent study also supported the monophyly of Allotriocarida . Branchiopoda (a group of Cladocera and Anostraca) was considered to belong to the subphylum Crustacea. However, this group was more closely related to Insecta but distinct to all other crustaceans examined in this study. Although the MLB value for the clade of Allotriocarida (Insecta and Branchiopoda) was not very strong (>73% in Fig. 3 and > 62% in Additional file 7: Figure S2), this hypothesis is often congruent with those obtained from recent studies [27, 30, 64, 65]. Therefore, the phylogenetic trees in this study supported the hypotheses of the three newly proposed clades, Communostraca, Multicrustacea, and Allotriocarida, but challenged the validity of Hexanauplia and Vericrustacea (See Discussion).
Estimation of divergence time in Copepoda
Divergence times were estimated using BEAST2 (ver. 2.4.3)  with Bayesian inference. The tree topology was the same as the PhyML tree shown in Fig. 3. Divergence between the groups of podopleans and gymnopleans was estimated to have occurred during the period from the late Cambrian to the Devonian (446.2 ± 47.3 MYA). The origin of T. californicus appeared to have occurred in the Devonian (between the late Silurian and the early Carboniferous, 381.4 ± 51.1 MYA) (Fig. 4). The divergence time between the two orders Siphonostomatoida and Cyclopoida occurred in the Carboniferous (351.8 ± 58.1 MYA) which predated approximately the origin of Harpacticoida (Fig. 4). Seven extant families in this analysis arose before the Cenozoic era and possibly prior to the early stage of breakup of Gondwana .
The present study provides the first phylogenomic evidence to support the monophyletic origin of four major orders of copepods and the group of podopleans. The monophyletic status of Copepoda has been broadly accepted by both morphological [5, 14] and large-scale phylogenomic analyses [28–30]. Although this study does not include all copepod orders, there can be no doubt of the monophyly of copepods. The subclass Copepoda consists of two infraclasses, Progymnoplea and Neocopepoda, suggested by Huys and Boxshall . The infraclass Neocopepoda can be further divided into two superorder groups, Gymnoplea and Podoplea (Fig. 1). The concept of this classification was proposed by Giesbrecht  and became generally accepted [5, 12, 68]. However, the naupliar musculature and the molecular phylogeny using partial nuclear 28S rRNA gene (a total aligned sequence length of 484 bp from the D9/D10 region) (Fig. 2A) showed conflicting results and suggested a possible paraphyletic origin of podopleans [15, 18]. Later, morphological [13, 14] and molecular [19, 20, 23] phylogenetic analyses recovered the monophyly of podopleans. In this study, the phylogenomic analysis shows that three podoplean copepod orders are clearly clustered as a monophyletic clade (supported by high bootstrap values, MLB > 99% and BPP = 1.00) (Fig. 3 and Additional files 7 and 8: Figures S2 and S3).
Unexpectedly, the current phylogenomic evidence is in conflict to the majority of the copepod phylogenies (Figs. 1 and 2; see Results). The present schematic phylogeny resemble those found in the earlier phylogenies and postembryonic data [11, 14, 68] which show that Calanoida represents the most basal split among the four copepod orders and that Harpacticoida is the basally-branching group of Podoplea. On the basis on postembryonic apomorphies, naupliar characters can be represented by plesiomorphic states because postembryonic stages (both early and later) provide a valuable resource for evolutionary history . His study implied that Harpacticoida is the more basally-branching group than Misophrioida within podopleans, which is hardly reported in previous studies [5, 11–13, 15, 17]. Interestingly, our preliminary survey based on weighted morphological characters after removing the convergent characters appears that Harpacticoida is the most basally-branching podoplean group (Eyun et al., unpublished data). For example, some morphological characters support the current phylogenomic phylogeny; following the characters from Huys and Boxshall , character 11 (male antennulary segment XXIII), character 21 (outer seta on basis of maxillule), and character 54 (seta b on exopod of male fifth leg). These morphological characters can be the candidates to investigate the order-level relationships of copepods and morphological transitions (e.g., character 21). Based on character 54 which is absent of in Harpacticoida but is present in Misophrioida and many other podopleans, Harpacticoida seems to be the most basally-branching group within Podoplea. Furthermore, as keenly pointed out by Ho , some characters such as character 13 (male antennulary segments XXIV and XXV), character 29 (praecoxal seta on maxilliped), and character 39 (number of setae on inner margin of second endopodal segment of first swimming leg) are confirmed as convergent characters in this study. These implies that differential weighting criteria for the morphological phylogeny  and the removal of convergent characters can reduce the phylogenetic noise. In fact, from the preliminary survey removing the convergent characters, the posterior probabilities in Bayesian phylogenetic inference are increased (Eyun et al., unpublished data).
Recent studies have given rise to a new taxonomic classification of Copepoda. Although many progresses have been made toward unraveling the phylogeny and taxonomy of Copepoda, there is so far no consensus of their order-level classification. This should be due to their extreme morphological diversity and a lack of genetic information. Huys and Boxshall  summarized ten copepod orders . Ho et al.  proposed a new order, Thaumatopsylloida because the family Thaumatopsyllidae was a distinct group from the order Cyclopoida and differed from Monstrilloida and Siphonostomatoida . Boxshall and Halsey  suggested that Poecilostomatoida was merged into Cyclopoida . Huys et al. , Minxiao et al. , and Huys et al.  supported this view (but as a sister group) using 18S rRNA and the concatenated twelve mitochondrial genes [19, 23, 24]. Another molecular sequence study using 18S rRNA (a total aligned sequence length of about 1,941 bp) suggested that the order Monstrilloida (indicated by grey in Figs. 1 and 2) was nested within a fish-parasitic clade of the order Siphonostomatoida and thus was considered as the subgroup of Siphonostomatoida . The 18S rRNA gene and 28S rRNA gene trees showed that Poecilostomatoida and Harpacticoida were paraphyletic, respectively (Fig. 2EF) [21, 22, 24].
Some studies have argued that adding more sequences is more important than adding taxa for improved phylogenetic accuracy [70, 71] (but see  for the benefits of adding taxa). Indeed, in copepods, insufficient and only partial sequences have been used and showed a limitation for certain order-level [21, 73, 74]. Blanco-Bercial et al.  discussed that the use of a single gene at the family or superfamily level of copepods contributed to the disparate results, and the relationships in the superfamily Centropagoidea (Order Calanoida) were still unresolved using the four concatenated genes (18S rRNA, 28S rRNA, cytochrome c oxidase subunit I, and cytochrome b) . Therefore, the phylogenomic approach will make notable contributions to a better resolution of copepod evolution and then can be anchored to certain taxonomic clades. Furthermore, the resulting phylogenomic tree can provide an independent test of morphological character homology and can help to determine the assumptions of plesiomorphic or apomorphic characters and the convergent or homoplastic characters, which are considered as the most difficult issue for copepod taxonomy [5, 11, 12].
The class Maxillopoda (Phylum Arthropoda) is one of the most diverse groups of crustaceans including copepods, barnacles, and a number of related animals (such as a branchiuran fish louse and tongue worms) . However, the monophyly of Maxillopoda seemed increasingly doubtful and the maxillopodan concept became obsolete due to the phylogenetic studies of the Arthropoda [27–30, 61, 76, 77]. These studies appear in the polyphyly of Maxillopoda. In addition, the phylogenetic position of copepods in relationship to other crustacean groups has been controversial, resulting a particularly ambiguous resolution of Copepoda, Thecostraca, Malacostraca, and Branchiopoda. Therefore, the phylogenetic relationships among crustaceans are still far from being resolved [78, 79]. Recent phylogenomic studies advocate a new taxonomic nomenclature for the crustacean groups. Regier et al.  and Oakley et al.  proposed several crustacean classifications; Communostraca (Malacostraca, Thecostraca), Multicrustacea (Copepoda, Malacostraca, and Thecostraca), and Vericrustacea (Copepoda, Malacostraca, Thecostraca, and Branchiopoda)  and Allotriocarida (Hexapoda, Remipedia, Cephalocarida, and Branchiopoda) and Hexanauplia (Copepoda and Thecostraca) . From the currently inferred phylogenies including six crustacean groups (Cladocera, Anostraca, Copepoda, Sessilia, Amphipoda, and Decapoda), the tree supports well the hypothesis of the three newly proposed clades, Communostraca, Multicrustacea, and Allotriocarida. However, this study challenges the validity of Hexanauplia and Vericrustacea, corroborating those obtained from other phylogenomic analyses (Fig. 3 and Additional file 7: Figure S2) [27, 29, 30].
This study confirms that the rapidly evolving genes tend to generate the phylogenetic noise  and that the slower evolving genes contain more informative positions . For instance, the phylogenies using a single gene tree from 6-phosphogluconate dehydrogenase, carbamoylphosphate synthetase, and alanyl-tRNA synthetase show the non-monophyly of Copepoda (Additional file 9: Figure S4). This may be due to incomplete sequences of genes which are not identified to cover the intact region in this study but also to a relatively high level of sequence variation. Regier et al.  also categorizes these genes as the fast evolving genes (the gene numbers: 11, 19, and 23) . Therefore, the phylogenetic signals from the fast evolving genes could generate misleading effects in evolutionary studies . Note that, however, the copepod topology after excluding these genes is same as the one shown above (data not shown).
Divergence between the groups of podopleans and gymnopleans is estimated to have occurred in the very late Ordovician. This implies that the origin of copepods may be earlier (probably Cambrian age) than this period [82, 83]. It is because all copepod taxa in this study belong to the Infraclass Neocopepoda, and Platycopioida (the other infraclass Progymnoplea) is known to be the most primitive group of copepods and possibly closer to the ancestral form [5, 12]. Only few fossil records of copepods are available because of their fragile nature and thus having a very low level of potential fossilization. Divergence time estimations in this study are in good agreement with these known fossil records [84–87]. Recently, a new fossil of freshwater harpacticoids (most likely Canthocamptidae) has been found in carboniferous bitumen, dating back to at least 303 MYA . Interestingly, the origin of T. californicus assumed in this study is almost congruent with this fossil record (Fig. 4). The family Canthocamptidae is the largest group (>600 species) of harpacticoids and predominately inhabit fresh water . Boxshall and Jaume  speculated that harpacticoids invaded fresh waters on Pangaea based on the pattern of colonization of continental waters. This study supports this hypothesis by molecular sequence analysis. To study the adaptation on the different types of environments (e.g., cave or groundwater) and the timing of colonization events, a strong phylogenetic hypothesis must be established. For the future, comparative genomics of copepod species will help us understanding their evolutionary history and shed light on a wide range of ecological adaptations.
A series of molecular phylogenetic analyses of nine copepod species with five other crustacean groups, two hexapods, and two outgroups (myriapod and spider) is presented using the 24 orthologous nuclear protein-coding genes. Given the phylogeny, this hypothesis provides an overview of the useful directions for future studies and thus will shed a light into new taxonomic investigations. As more sequences become available in the near future, further studies with more comprehensive taxa are essential to evaluate the various hypotheses as well as fully resolve the evolutionary history and taxonomy of Copepoda. Also, some copepod orders (e.g., Thaumatopsylloida, Monstrilloida, and some groups of Poecilostomatoida and Harpacticoida) need to be refined by further phylogenomic studies. The large scale of molecular data such as genomes and transcriptomes of copepods provides us a valuable resource for understanding copepod evolution and a wide range of ecological adaptations.
Bayesian posterior probability
Le and Gascuel
Maximum likelihood bootstrap value
Humes AG. How many copepods? Hydrobiologia. 1994;292/293:1–7.
Mauchline J. The Biology of Calanoid Copepods. Adv Mar Biol. 1998;33:1–710.
Verity P, Smetacek V. Organism life cycles, predation, and the structure of marine pelagic ecosystems. Mar Ecol Prog Ser. 1996;130:277–93.
Hardy A: The Open Sea. It's Natural History: The World of Plankton: Collins. London: Houghton Mifflin Company; 1956.
Huys R, Boxshall GA. Copepod Evolution. London: The Ray Society; 1991.
Martin JW, Davis GE. An updated classification of the recent Crustacea. Nat Hist Mus Los Angel Cty Sci Ser. 2001;39:1–124.
Lee CE. Global phylogeography of a cryptic copepod species complex and reproductive isolation between genetically proximate "populations". Evolution. 2000;54:2014–27.
Goetze E. Cryptic speciation on the high seas; global phylogenetics of the copepod family Eucalanidae. Proc R Soc Lond B Biol Sci. 2003;270:2321–31.
Eyun S, Lee Y-H, Suh H-L, Kim S, Soh HY. Genetic Identification and Molecular Phylogeny of Pseudodiaptomus Species (Calanoida, Pseudodiaptomidae) in Korean Waters. Zoolog Sci. 2007;24:265–71.
Chen G, Hare MP. Cryptic diversity and comparative phylogeography of the estuarine copepod Acartia tonsa on the US Atlantic coast. Mol Ecol. 2011;20:2425–41.
Ho J-S. Phylogenetic Analysis of Copepod Orders. J Crustac Biol. 1990;10:528–36.
Ho J-S. Copepod phylogeny: a reconsideration of Huys & Boxshall's 'parsimony versus homology'. Hydrobiologia. 1994;292/293:31–9.
Ho J-S, Dojiri M, Gordon H, Deets GB. A New Species of Copepoda (Thaumatopsyllidae) Symbiotic with a Brittle star from California, U.S.A., and Designation of a New Order Thaumatopsylloida. J Crustac Biol. 2003;23:582–94.
Dahms H-U. Postembryonic Apomorphies Proving the Monophyletic Status of the Copepoda. Zool Stud. 2004;43:446–53.
Dussart BH: A propos du répertoire mondial des Calanoïdes des eaux continentales. Crustaceana 1984;(Suppl 7):25–31.
Boxshall GA, Halsey SH. An Introduction to Copepod Diversity. London: The Ray Society; 2004.
Por FD: Canuellidae Lang (Harpacticoida, Polyarthra) and the Ancestry of the Copepoda. Crustaceana 1984;(Suppl 7):1–24.
Braga E, Zardoya R, Meyer A, Yen J. Mitochondrial and nuclear rRNA based copepod phylogeny with emphasis on the Euchaetidae (Calanoida). Mar Biol. 1999;133:79–90.
Huys R, Llewellyn-Hughes J, Olson PD, Nagasawa K. Small subunit rDNA and Bayesian inference reveal Pectenophilus ornatus (Copepoda incertae sedis) as highly transformed Mytilicolidae, and support assignment of Chondracanthidae and Xarifiidae to Lichomolgoidea (Cyclopoida). Biol J Linn Soc. 2006;87:403–25.
Huys R, Llewellyn-Hughes J, Conroy-Dalton S, Olson PD, Spinks JN, Johnston DA. Extraordinary host switching in siphonostomatoid copepods and the demise of the Monstrilloida: integrating molecular data, ontogeny and antennulary morphology. Mol Phylogenet Evol. 2007;43:368–78.
Tung C-H, Cheng Y-R, Lin C-Y, Ho J-S, Kuo C-H, Yu J-K, Su Y-H. A New Copepod With Transformed Body Plan and Unique Phylogenetic Position Parasitic in the Acorn Worm Ptychodera flava. Biol Bull. 2014;226:69–80.
Schizas NV, Dahms H-U, Kangtia P, Corgosinho PHC, Galindo Estronza AM. A new species of Longipedia Claus, (Copepoda: Harpacticoida: Longipediidae) from Caribbean mesophotic reefs with remarks on the phylogenetic affinities of Polyarthra. Mar Biol Res. 1863;2015(11):789–803.
Minxiao W, Song S, Chaolun L, Xin S. Distinctive mitochondrial genome of Calanoid copepod Calanus sinicus with multiple large non-coding regions and reshuffled gene order: Useful molecular markers for phylogenetic and population studies. BMC Genomics. 2011;12:73.
Huys R, Fatih F, Ohtsuka S, Llewellyn-Hughes J. Evolution of the bomolochiform superfamily complex (Copepoda: Cyclopoida): New insights from ssrDNA and morphology, and origin of umazuracolids from polychaete-infesting ancestors rejected. Int J Parasitol. 2012;42:71–92.
Dahms H-U. Exclusion of the Polyarthra from Harpacticoida and its reallocation as an underived branch of the Copepoda (Arthropoda, Crustacea). Invertebr Zool. 2004;1:29–51.
Regier JC, Shultz JW, Ganley ARD, Hussey A, Shi D, Ball B, Zwick A, Stajich JE, Cummings MP, Martin JW, et al. Resolving Arthropod Phylogeny: Exploring Phylogenetic Signal within 41 kb of Protein-Coding Nuclear Gene Sequence. Syst Biol. 2008;57:920–38.
Meusemann K, von Reumont BM, Simon S, Roeding F, Strauss S, Kück P, Ebersberger I, Walzl M, Pass G, Breuers S, et al. A Phylogenomic Approach to Resolve the Arthropod Tree of Life. Mol Biol Evol. 2010;27:2451–64.
Regier JC, Shultz JW, Zwick A, Hussey A, Ball B, Wetzer R, Martin JW, Cunningham CW. Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences. Nature. 2010;463:1079–83.
von Reumont BM, Jenner RA, Wills MA, Dell’Ampio E, Pass G, Ebersberger I, Meyer B, Koenemann S, Iliffe TM, Stamatakis A, et al. Pancrustacean Phylogeny in the Light of New Phylogenomic Data: Support for Remipedia as the Possible Sister Group of Hexapoda. Mol Biol Evol. 2012;29:1031–45.
Oakley TH, Wolfe JM, Lindgren AR, Zaharoff AK. Phylotranscriptomics to Bring the Understudied into the Fold: Monophyletic Ostracoda, Fossil Placement, and Pancrustacean Phylogeny. Mol Biol Evol. 2013;30:215–33.
Mojib N, Amad M, Thimma M, Aldanondo N, Kumaran M, Irigoien X. Carotenoid metabolic profiling and transcriptome-genome mining reveal functional equivalence among blue-pigmented copepods and appendicularia. Mol Ecol. 2014;23:2740–56.
Eyun S, Soh HY, Posavi M, Munro J, Hughes DST, Murali SC, Qu J, Dugan S, Lee SL, Chao H, et al. Evolutionary history of chemosensory-related gene families across the Arthropoda. Mol Biol Evol. Accepted pending major revision.
i5K Consortium. The i5K Initiative: Advancing Arthropod Genomics for Knowledge, Human Health, Agriculture, and the Environment. J Hered. 2013;104:595–600.
Chipman AD, Ferrier DEK, Brena C, Qu J, Hughes DST, Schröder R, Torres-Oliva M, Znassi N, Jiang H, Almeida FC, et al. The First Myriapod Genome Sequence Reveals Conservative Arthropod Gene Content and Genome Organisation in the Centipede Strigamia maritima. PLoS Biol. 2014;12:e1002005.
Colbourne JK, Pfrender ME, Gilbert D, Thomas WK, Tucker A, Oakley TH, Tokishita S, Aerts A, Arnold GJ, Basu MK, et al. The Ecoresponsive Genome of Daphnia pulex. Science. 2011;331:555–61.
Tribolium Genome Sequencing Consortium. The genome of the model beetle and pest Tribolium castaneum. Nature. 2008;452:949–55.
Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD, Amanatides PG, Scherer SE, Li PW, Hoskins RA, Galle RF, et al. The Genome Sequence of Drosophila melanogaster. Science. 2000;287:2185–95.
Wiegmann B, Trautwein M, Kim J-W, Cassel B, Bertone M, Winterton S, Yeates D. Single-copy nuclear genes resolve the phylogeny of the holometabolous insects. BMC Biol. 2009;7:34.
Altschul SF. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinf. 2009;10:1–9.
Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013;30:772–80.
Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3.
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609–12.
Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0. Syst Biol. 2010;59:307–21.
Yang Z. Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. J Mol Evol. 1994;39:306–14.
Le SQ, Gascuel O. An Improved General Amino Acid Replacement Matrix. Mol Biol Evol. 2008;25:1307–20.
Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21:2104–5.
Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–91.
Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice across a Large Model Space. Syst Biol. 2012;61:539–42.
Lewis PO. A likelihood approach to estimating phylogeny from discrete morphological character data. Syst Biol. 2001;50:913–25.
Kishino H, Hasegawa M. Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoidea. J Mol Evol. 1989;29:170–9.
Shimodaira H, Hasegawa M. Multiple Comparisons of Log-Likelihoods with Applications to Phylogenetic Inference. Mol Biol Evol. 1999;16:1114–6.
Shimodaira H. An Approximately Unbiased Test of Phylogenetic Tree Selection. Syst Biol. 2002;51:492–508.
Schmidt HA, Strimmer K, Vingron M, von Haeseler A. TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002;18:502–4.
Shimodaira H, Hasegawa M. CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics. 2001;17:1246–7.
Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu C-H, Xie D, Suchard MA, Rambaut A, Drummond AJ. BEAST 2: A Software Platform for Bayesian Evolutionary Analysis. PLoS Comput Biol. 2014;10:e1003537.
Drummond AJ, Ho SYW, Phillips MJ, Rambaut A. Relaxed Phylogenetics and Dating with Confidence. PLoS Biol. 2006;4:e88.
Wolfe JM, Daley AC, Legg DA, Edgecombe GD. Fossil calibrations for the arthropod Tree of Life. Earth-Sci Rev. 2016;160:43–110.
Zhang X-g, Maas A, Haug JT, Siveter DJ, Waloszek D. A Eucrustacean Metanauplius from the Lower Cambrian. Curr Biol. 2010;20:1075–9.
Lee Michael SY, Soubrier J, Edgecombe Gregory D. Rates of Phenotypic and Genomic Evolution during the Cambrian Explosion. Curr Biol. 2013;23:1889–95.
Giribet G, Edgecombe GD, Wheeler WC. Arthropod phylogeny based on eight molecular loci and morphology. Nature. 2001;413:157–61.
Strimmer K, Rambaut A. Inferring confidence sets of possibly misspecified gene trees. Proc R Soc Lond B Biol Sci. 2002;269:137–42.
Lozano-Fernandez J, Carton R, Tanner AR, Puttick MN, Blaxter M, Vinther J, Olesen J, Giribet G, Edgecombe GD, Pisani D: A molecular palaeobiological exploration of arthropod terrestrialization. Philos Trans R Soc Lond B Biol Sci 2016;371(1699). doi:10.1098/rstb.2015.0133.
Kashiyama K, Seki T, Numata H, Goto SG. Molecular Characterization of Visual Pigments in Branchiopoda and the Evolution of Opsins in Arthropoda. Mol Biol Evol. 2009;26:299–311.
Andrew DR, Brown SM, Strausfeld NJ. The minute brain of the copepod Tigriopus californicus supports a complex ancestral ground pattern of the tetraconate cerebral nervous systems. J Comp Neurol. 2012;520:3446–70.
Krause DW, O'Connor PM, Rogers KC, Sampson SD, Buckley GA, Rogers RR. Late Cretaceous terrestrial vertebrates from Madagascar: implications for Latin American biogeography. Ann Mo Bot Gard. 2006;93:178–208.
Giesbrecht W. Systematik und Faunistik der pelagischen Copepoden des Golfes von Neapel und der angrenzenden Meeres-abschnitte. Fauna Flora Golfes Neapel. 1892;19:1–831.
Kabata Z. Parasitic Copepoda of British Fishes. Ray Society: London, England; 1979.
Farris JS. The retention index and rescaled consistency index. Cladistics. 1989;5:417–9.
Rosenberg MS, Kumar S. Incomplete taxon sampling is not a problem for phylogenetic inference. Proc Natl Acad Sci U S A. 2001;98:10751–6.
Rosenberg MS, Kumar S. Taxon sampling, bioinformatics, and phylogenomics. Syst Biol. 2003;52:119–24.
Wiens JJ, Tiu J. Highly Incomplete Taxa Can Rescue Phylogenetic Analyses from the Negative Impacts of Limited Taxon Sampling. PLoS ONE. 2012;7:e42925.
Wu S, Xiong J, Yu Y. Taxonomic Resolutions Based on 18S rRNA Genes: A Case Study of Subclass Copepoda. PLoS ONE. 2015;10:e0131498.
Baek SY, Jang KH, Choi EH, Ryu SH, Kim SK, Lee JH, Lim YJ, Lee J, Jun J, Kwak M, et al. DNA Barcoding of Metazoan Zooplankton Copepods from South Korea. PLoS ONE. 2016;11:e0157307.
Blanco-Bercial L, Bradford-Grieve J, Bucklin A. Molecular phylogeny of the Calanoida (Crustacea: Copepoda). Mol Phylogenet Evol. 2011;59:103–13.
Rota-Stabelli O, Campbell L, Brinkmann H, Edgecombe GD, Longhorn SJ, Peterson KJ, Pisani D, Philippe H, Telford MJ. A congruent solution to arthropod phylogeny: phylogenomics, microRNAs and morphology support monophyletic Mandibulata. Proc R Soc Lond B Biol Sci. 2011;278:298–306.
Jenner RA. Higher-level crustacean phylogeny: Consensus and conflicting hypotheses. Arthropod Struct Dev. 2010;39:143–53.
Koenemann S, Jenner RA, Hoenemann M, Stemme T, von Reumont BM. Arthropod phylogeny revisited, with a focus on crustacean relationships. Arthropod Struct Dev. 2010;39:88–110.
Stollewerk A. The water flea Daphnia - a 'new' model system for ecology and evolution? J Biol. 2010;9:21.
Regier JC, Zwick A. Sources of Signal in 62 Protein-Coding Nuclear Genes for Higher-Level Phylogenetics of Arthropods. PLoS ONE. 2011;6:e23408.
Philippe H, Brinkmann H, Lavrov DV, Littlewood DTJ, Manuel M, Wörheide G, Baurain D. Resolving Difficult Phylogenetic Questions: Why More Sequences Are Not Enough. PLoS Biol. 2011;9:e1000602.
Harvey THP, Vélez MI, Butterfield NJ. Exceptionally preserved crustaceans from western Canada reveal a cryptic Cambrian radiation. Proc Natl Acad Sci U S A. 2012;109:1589–94.
Harvey THP, Pedder BE. Copepod mandible palynomorphs from the Nolichucky Shale (Cambrian, Tennessee): Implications for the taphonomy and recovery of small carbonaceous fossils. Palaios. 2013;28:278–84.
Cressey R, Boxshall G. Kabatarina pattersoni, a Fossil Parasitic Copepod (Dichelesthiidae) from a Lower Cretaceous Fish. Micropaleontol. 1989;35:150–67.
Cressey R, Patterson C. Fossil Parasitic Copepods from a Lower Cretaceous Fish. Science. 1973;180:1283–5.
Selden PA, Huys R, Stephenson MH, Heward AP, Taylor PN. Crustaceans from bitumen clast in Carboniferous glacial diamictite extend fossil record of copepods. Nat Commun. 2010;1:50.
Palmer AR. Miocene Copepods from the Mojave Desert, California. J Paleo. 1960;34:447–52.
Boxshall GA, Jaume D. Making waves: The repeated colonization of fresh water by copepod crustaceans. Adv Ecol Res. 2000;31:61–79.
The author sincerely thanks to Drs. Hae-Lip Suh and Ho Young Soh (Chonnam National University, Korea) for providing the initial inspiration. Dr. Ju-shey Ho (California State University at Long Beach, USA) provided helpful comments and suggesions on an earlier draft of this manuscript. The author also thanks Susumu Ohtsuka (Hiroshima University, Japan) for critical reading of the manuscript.
This work was supported by the Nebraska Research Initiative (to SE).
Availability of data and materials
SE carried out the data analysis and wrote the manuscript.
The author declares that he has no competing interests.
Consent for publication
Ethics and consent to participate
All identified copepod protein and nucleotide sequences can be found in Additional files 2 and 3 respectively. These sequences are also available from the local server: http://bioinformatics.unl.edu/eyun/Copepoda_Phylogenomics.
A custom Perl script, ConCat_seq.pl, is available upon request from the author.
Table S1. Orthologous sequences used and identified in this study. (XLSX 18 kb)
This file contains the copepod amino acid sequences in FASTA format. (TXT 71 kb)
This file contains the copepod nucleotide sequences in FASTA format. (TXT 207 kb)
This file contains the aligned and concatenated dataset used in this study. (DOCX 102 kb)
This file contains the data matrix of 54 morphological characters from Ho et al.  in Nexus format. (DOCX 17 kb)
Figure S1. Bayesian phylogenetic analysis of copepod orders with morphological characters taken from Ho et al. . (DOCX 63 kb)
Figure S2. Bayesian phylogeny using the “degen-1” nucleotide coding sequences. (DOCX 172 kb)
Figure S3. Bayesian phylogeny of nine copepod species with two outgroups. (DOCX 107 kb)
Figure S4. Maximum-likelihood phylogenies of arthropods focused on copepod species, based on a single gene region from A) 6-phosphogluconate dehydrogenase, B) carbamoylphosphate synthetase, and C) alanyl-tRNA synthetase. (DOCX 185 kb)
About this article
Cite this article
Eyun, S. Phylogenomic analysis of Copepoda (Arthropoda, Crustacea) reveals unexpected similarities with earlier proposed morphological phylogenies. BMC Evol Biol 17, 23 (2017) doi:10.1186/s12862-017-0883-5
- Divergence time