Research Article |
Corresponding author: Xiao Zhang ( xzhang_cn@163.com ) Academic editor: Brian Wiegmann
© 2023 Zehui Kang, Yuanyuan Xu, Guoquan Wang, Ding Yang, Xiao Zhang.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
|
Limoniidae, the most speciose family in the superfamily Tipuloidea, consists of four subfamilies and more than 11,000 species. However, mitochondrial (mt) genome sequences, which have been widely used for phylogenetic study, are available for only 11 species across three subfamilies. Thus, a larger variety of mt genome sequences in Limoniidae are required to improve our understanding of tipuloid phylogeny and genomic evolution. Here we present mt genomes of Elephantomyia (Elephantomyia) inulta Alexander, 1938 and Helius (Helius) pluto Alexander, 1932, representing the first mt genomes of the tribe Elephantomyiini (Limoniidae). The two mt genomes are typical circular DNA molecules and show similar gene order, nucleotide composition and codon usage. Standard ATN start and TAR stop codons are present in most protein-coding genes. All transfer RNA (tRNA) genes exhibited the cloverleaf secondary structure typical for metazoans except in tRNASer(AGN), which lacks the dihydrouridine arm. Phylogenetic analyses were performed based on four nucleotide matrixes for the currently sequenced species of Tipuloidea using Bayesian inference and maximum likelihood methods. Four-cluster likelihood mapping was used to study incongruent signals between different topologies. Pediciidae is supported as the earliest lineage in Tipuloidea, and the sister-group relationship between Cylindrotomidae and Tipulidae is also supported, but the monophyly of Limoniidae is not supported. Our study also supports the monophyly of Elephantomyiini (Elephantomyia + Helius), as one of origins of flower-visiting in Limoniidae. Although Elephantomyiini is sister to Limoniinae + Epiphragma (Limnophilinae) in our study, a more precise understanding of its phylogenetic position in Tipuloidea will require additional studies that include a broader species sample.
Elephantomyiinae, Elephantomyia, flower-visiting, Helius, mitogenome, phylogeny
Crane flies are one of the most taxonomically diverse groups of flies with more than 15,000 described species in about 500 genera and subgenera (
Limoniidae is the most speciose family in Tipuloidea and consists of about 150 genera and more than 11,000 species around the world (
Previous hypotheses for the relationships among major Tipuloidea groups proposed by A
Elephantomyiini is a tribe within Limoniidae and includes three genera: Elephantomyia Osten Sacken, 1860, Helius Lepeletier and Serville, 1828 and Protohelius Alexander, 1928 (
General morphology of limoniid crane flies with elongate mouthparts, represented by A Elephantomyia (Elephantomyodes) tianmushana Zhang, Li and Yang, 2015, B Elephantomyia (E.) laohegouensis Zhang, Li and Yang, 2015, C Toxorhina (Ceratocheilus) omnifusca Zhang, Li and Yang, 2015, D Helius (H.) pluto Alexander, 1932, E Helius (H.) pallidissimus Alexander, 1930 and F Geranomyia subablusa Qian and Zhang, 2020. Scale bars = 2.0 mm.
In the past three decades, a large number of taxonomic studies have been carried out on the tribe Elephantomyiini, mainly focusing on the species in Asia (
In addition, the monophyly, taxonomic status and position of Elephantomyiini have been subject to debate (Fig.
The mitochondrial (mt) genome is a double strand molecule of 15–16 kb in size that typically contains 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes and a noncoding A + T-rich region (control region). It has become the most extensively studied genomic system in insects. It is now widely used in the study of insect phylogenetics and molecular evolution due to its maternal inheritance, fast evolutionary rate, and highly conserved gene content (
Adult specimens of Elephantomyia (Elephantomyia) inulta Alexander, 1938 were collected from Motuo, Linzhi, Tibet, China, and adult specimens of Helius (Helius) pluto Alexander, 1932 were collected from Mount Daming, Nanning, Guangxi, China. Specimens were identified based on
An Illumina TruSeq library was prepared with 450 bp average insert size and sequenced on the Illumina Hiseq 2500 platform with 250 bp paired-end reads. The genomes of the two species were sequenced on one lane. About 4 Gb of clean data was obtained from the library after trimming using Trimmomatic (
The protein-coding, rRNA and tRNA genes were identified using the MITOS2 Webserver (http://mitos2.bioinf.uni-leipzig.de/index.py). Protein-coding genes that could not be predicted by that program were annotated by alignment with the homologous genes reported in other crane flies. Nucleotide composition of mt genomes and PCG codon usage was analyzed in MEGA 7.0 (
A total of 31 mt genomes were used for molecular analyses, including the two newly sequenced mt genomes of Elephantomyiini, 26 complete or nearly complete mt genomes of Tipuloidea available in GenBank, and three mt genomes of Trichoceridae used as outgroup (Table
Information of species used in molecular analysis with GenBank accession numbers of mitochondrial genome sequences.
Family | Subfamily | Species | Accession Number |
Pediciidae | Pediciinae | Pedicia sp. | KT970062 |
Limoniidae | Chioneinae | Chionea crassipes gracilistyla Alexander, 1936 | MK941181 |
Symplecta (Symplecta) hybrida (Meigen, 1804) | NC_030519 | ||
Limnophilinae | Conosia irrorata (Wiedemann, 1828) | NC_057072 | |
Epiphragma (Epiphragma) mediale Mao and Yang, 2009 | NC_057085 | ||
Euphylidorea (Euphylidorea) dispar (Meigen, 1818) | MT410841 | ||
Paradelphomyia sp. | KT970061 | ||
Pseudolimnophila (Pseudolimnophila) brunneinota Alexander, 1933 | MN398932 | ||
Limoniinae | Dicranomyia (Dicranomyia) modesta (Meigen, 1818) | MT628560 | |
Elephantomyia (Elephantomyia) inulta Alexander, 1938 | This study | ||
Helius (Helius) pluto Alexander, 1932 | This study | ||
Limonia phragmitidis (Schrank, 1781) | NC_044484 | ||
Metalimnobia (Metalimnobia) quadrinotata (Meigen, 1818) | MT584154 | ||
Rhipidia (Rhipidia) chenwenyoungi Zhang, Li and Yang, 2012 | KT970063 | ||
Cylindrotomidae | Cylindrotominae | Cylindrotoma sp. | KT970060 |
Tipulidae | Ctenophorinae | Tanyptera (Tanyptera) hebeiensis Yang and Yang, 1988 | NC_053795 |
Tipulinae | Nephrotoma flavescens (Linnaeus, 1758) | MT628586 | |
Nephrotoma quadrifaria quadrifaria (Meigen, 1804) | MT872674 | ||
Nephrotoma tenuipes (Riedel, 1910) | MN053900 | ||
Nigrotipula nigra nigra (Linnaeus, 1758) | MT483653 | ||
Tipula (Acutipula) cockerelliana Alexander, 1925 | NC_030520 | ||
Tipula (Dendrotipula) flavolineata Meigen, 1804 | MT410828 | ||
Tipula (Formotipula) melanomera gracilispina Savchenko, 1960 | MK864102 | ||
Tipula (Lunatipula) fascipennis Meigen, 1818 | NC_050319 | ||
Tipula (Nippotipula) abdominalis (Say, 1823) | JN861743 | ||
Tipula (Tipula) paludosa Meigen, 1830 | MT483696 | ||
Tipula (Vestiplex) aestiva Savchenko, 1960 | NC_063751 | ||
Tipula (Yamatotipula) nova Walker, 1848 | NC_057055 | ||
Trichoceridae | Paracladurinae | Paracladura trichoptera (Osten Sacken, 1877) | JN861751 |
Trichocerinae | Trichocera bimacula Walker, 1848 | JN861750 | |
Trichocera sp. | MW263048 |
The protein-coding and RNA genes were aligned individually with the MAFFT 7.0 online server with the algorithm G-INS-i strategy (
Heterogeneous sequence divergence can lead to strong biases in tree reconstructions, such as long branch effects or the misplacement of rogue taxa (
The mt genomes of two crane fly species in the tribe Elephantomyiini, E. (E.) inulta and H. (H.) pluto, are sequenced and analyzed for the first time. The nearly complete mt genomes of E. (E.) inulta (GenBank accession no. OP556661) and H. (H.) pluto (GenBank accession no. OP556662) are 14,551 bp and 14,358 bp in length, respectively. The control regions and short stretches on either side of the control regions are not obtained for either species. In the mt genome of E. (E.) inulta, 35 genes are detected (tRNAIle, tRNAGln and partial small rRNA (srRNA) are not detected), while in the mt genome of H. (H.) pluto, 34 genes are detected (tRNAIle, tRNAGln, tRNAMet, partial srRNA and partial ND2 are not detected) (Fig.
Gene maps of the mitochondrial genomes of two Elephantomyiini species sequenced in this study. The circular maps were drawn with OGDRAW (https://chlorobox.mpimp-golm.mpg.de/OGDraw.html). The transcriptional direction is indicated by arrows.
Organization of the mitochondrial genomes of Elephantomyia (Elephantomyia) inulta and Helius (Helius) pluto.
Gene | Direction | Location | Size (bp) | Anticodon | Codon | Intergenic nucleotide* | |
Start | Stop | ||||||
tRNA Met | J | 1-70/- | 70/- | CAT/- | |||
ND2 | J | 71-1096/1-912 | 1026/912 | ATT/- | TAA | 0/– | |
tRNA Trp | J | 1106-1174/911-980 | 69/70 | TCA | 9/–2 | ||
tRNA Cys | N | 1167-1236/973-1042 | 70/70 | GCA | –8/18 | ||
tRNA Tyr | N | 1251-1315/1065-1134 | 65/70 | GTA | 14/22 | ||
COI | J | 1314-2849/1150-2685 | 1536/1536 | TCG | TAA | –2/15 | |
tRNA Leu(UUR) | J | 2852-2918/2689-2756 | 67/68 | TAA | 2/3 | ||
COII | J | 2920-3601/2766-3450 | 682/685 | ATG | T-tRNA | 1/9 | |
tRNA Lys | J | 3602-3672/3451-3521 | 71/71 | CTT | 0/0 | ||
tRNA Asp | J | 3672-3736/3525-3591 | 65/67 | GTC | –1/3 | ||
ATP8 | J | 3737-3898/3592-3756 | 162/165 | ATT | TAA/TAG | 0/0 | |
ATP6 | J | 3892-4566/3750-4427 | 675/678 | ATG | TAA | –7/–7 | |
COIII | J | 4572-5360/4427-5215 | 789/789 | ATG | TAA | 5/–1 | |
tRNA Gly | J | 5361-5424/5219-5283 | 64/65 | TCC | 0/3 | ||
ND3 | J | 5425-5778/5284-5637 | 354/354 | ATG/ATT | TAA | 0/0 | |
tRNA Ala | J | 5780-5843/5642-5709 | 64/68 | TGC | 1/4 | ||
tRNA Arg | J | 5844-5910/5712-5777 | 67/66 | TCG | 0/2 | ||
tRNA Asn | J | 5911-5976/5779-5844 | 66/66 | GTT | 0/1 | ||
tRNA Ser(AGN) | J | 5977-6043/5845-5911 | 67/67 | GCT | 0/0 | ||
tRNA Glu | J | 6049-6117/5914-5979 | 69/66 | TTC | 5/2 | ||
tRNA Phe | N | 6138-6204/5999-6064 | 67/66 | GAA | 10/19 | ||
ND5 | N | 6205-7939/6072-7808 | 1735/1737 | ATG | T-tRNA/TAA | 0/7 | |
tRNA His | N | 7940-8005/7809-7875 | 66/67 | GTG | 0/0 | ||
ND4 | N | 8013-9353/7877-9217 | 1341/1341 | ATG | TAA | 7/1 | |
ND4L | N | 9347-9643/9211-9507 | 297/297 | ATG | TAA | –7/–7 | |
tRNA Thr | J | 9646-9712/9510-9575 | 67/66 | TGT | 2/2 | ||
tRNA Pro | N | 9713-9776/9576-9640 | 64/65 | TGG | 0/0 | ||
ND6 | J | 9779-10303/9642-10166 | 525/525 | ATT | TAA | 2/1 | |
CytB | J | 10303-11439/10170-11306 | 1137/1137 | ATG | TAA | –1/3 | |
tRNA Ser(UCN) | J | 11445-11512/11313-11380 | 68/68 | TGA | 5/6 | ||
ND1 | N | 11531-12472/11397-12341 | 942/945 | ATG/TTG | TAG | 18/16 | |
tRNA Leu(CUN) | N | 12474-12538/12343-12407 | 65/65 | TAG | 1/1 | ||
lrRNA | N | 12539-13864/12408-13728 | 1326/1321 | 0/0 | |||
tRNA Val | N | 13865-13936/13729-13800 | 72/72 | TAC | 0/0 | ||
srRNA | N | 13937-14551/13801-14358 | 615/558 | 0/0 | |||
* Intergenic nucleotide: minus indicates overlapping between genes. |
The mt genomes of both Elephantomyiini species are biased to high A+T% across their four major genome partitions (i.e. PCGs, tRNA genes, lrRNA gene and srRNA gene). The AT contents of whole mt genome, PCGs, tRNA genes and lrRNA gene in E. (E.) inulta (76.4%, 75.2%, 78.8% and 81.5%) are lower than those in H. (H.) pluto (76.8%, 75.8%, 79.4% and 82.1%), but the AT content of the srRNA gene in E. (E.) inulta (78.9%) is higher than that in H. (H.) pluto (76.3%). Both species show slightly positive AT-skew (0.01, 0.02) and negative GC-skew (–0.18, –0.21) for the whole mt genome, but show negative AT-skew (–0.16, –0.16; –0.04, –0.05) and positive GC-skew (0.04, 0.03; 0.33, 0.33) for PCGs and the lrRNA gene. For tRNAs, both species show insignificant or no AT-skew (0.01, 0.00) and positive GC-skew (0.11, 0.14). For the srRNA gene, E. (E.) inulta shows positive AT (0.02) and GC-skews (0.32), while H. (H.) pluto shows negative AT-skew (–0.03) and positive GC-skew (0.27) (Table
Nucleotide composition of the mitochondrial genomes of two Elephantomyiini species.
Region | E. (E.) inulta | H. (H.) pluto | |
Whole mt genome | A+T% | 76.4 | 76.8 |
G+C % | 23.7 | 23.2 | |
AT-skew | 0.01 | 0.02 | |
GC-skew | –0.18 | –0.21 | |
PCGs | A+T% | 75.2 | 75.8 |
G+C % | 24.7 | 24.3 | |
AT-skew | –0.16 | –0.16 | |
GC-skew | 0.04 | 0.03 | |
PCGs(J) | A+T% | 74.3 | 74.5 |
G+C % | 25.7 | 25.5 | |
AT-skew | –0.13 | –0.12 | |
GC-skew | –0.11 | –0.14 | |
PCGs(N) | A+T% | 76.8 | 77.7 |
G+C % | 23.1 | 22.3 | |
AT-skew | –0.22 | –0.22 | |
GC-skew | 0.28 | 0.32 | |
tRNAs | A+T% | 78.8 | 79.4 |
G+C % | 21.2 | 20.7 | |
AT-skew | 0.01 | 0.00 | |
GC-skew | 0.11 | 0.14 | |
lrRNA | A+T% | 81.5 | 82.1 |
G+C % | 18.5 | 17.9 | |
AT-skew | –0.04 | –0.05 | |
GC-skew | 0.33 | 0.33 | |
srRNA | A+T% | 78.9 | 76.3 |
G+C % | 21.2 | 23.7 | |
AT-skew | 0.02 | –0.03 | |
GC-skew | 0.32 | 0.27 | |
AT-skew = (A-T)/(A+T); GC-skew = (G-C)/(G +C) |
Each of the two newly sequenced mt genomes has 13 PCGs, of which COI, COII, COIII, CytB, ATP6, ATP8, ND2, ND3 and ND6 are coded on the majority strand, and ND4, ND4L, ND5 and ND1 are coded on the minority strand (Fig.
The total number of codons of mt genomes are 3,733 in E. (E.) inulta, and 3,700 in H. (H.) pluto but with incomplete ND2 (Tables S1, S2). Codon usage values are described by relative synonymous codon usage (RSCU), which reflects how often each codon is used relative to the expected number in the absence of usage bias. All RSCU values for each amino acid are similar between the two mt genomes, with Leu (UUR) and Ser (UCN) being the two most frequently used amino acids, and Leu (CUN), Met and Trp being the least. The most frequently used codon in each amino acids solely comprises A or T, reflecting the high AT content of PCGs (Fig. S1). These phenomena were also recorded from other mt genomes of lower Diptera (
To further investigate evolutionary patterns across the PCGs, the ratio of Ka (rates of nonsynonymous mutations)/Ks (rates of synonymous mutations) are calculated for each (Fig. S2). The Ka/Ks values for all 13 PCGs are lower than 1 (<0.70), implying purifying selection on all these genes. The Ka/Ks ratio of ND2 is obviously higher than other PCGs, which indicates that ND2 has a relatively higher evolutionary rate. In contrast, COI has the lowest Ka/Ks ratio, indicating that this gene has been subjected to the highest purifying selection.
Twenty tRNA genes are detected in E. (E.) inulta and 19 tRNAs in H. (H.) pluto. The tRNA genes lengths range from 64 bp to 72 bp (Table
As in the ancestral insect (
AliGROOVE analysis indicates that Chionea crassipes gracilistyla Alexander, 1936 has the strongest heterogeneity relative to other Tipuloidea species in all four datasets (Fig. S4), which may cause bias tree reconstructions and node support in phylogenetic analysis (
In all BI and ML trees, Pediciidae is sister to all other Tipuloidea, and a sister relationship between Cylindrotomidae and Tipulidae is strongly supported. These arrangements are consistent with the phylogeny by
Limoniidae is not supported as monophyletic clade in any phylogenetic trees. Symplecta (Symplecta) hybrida (Meigen, 1804) (Chioneinae) is sister to all non-pediciid crane flies in trees inferred from the PCG12RNA and PCG12 datasets under BI and ML methods (96% PP, 45% BV; 64% PP, 42% BV) (Figs
Limoniinae (including Elephantomyiini) + Epiphragma (Epiphragma) mediale Mao and Yang, 2009 (Limnophilinae) forms a clade in all phylogenetic trees (100% or 99% PPs for all BI trees; 65%, 56%, 43% and 37% BVs for ML trees). Elephantomyiini (Elephantomyia + Helius) (100% PP for all BI trees; 94%, 92%, 92% and 89% BV for ML trees) and Limoniinae (100% PP/BV for all trees) are two well-supported clades, which to some extent supports the suggestion of
Limnophilinae is a controversial group with respect to both its monophyly and relationships with other Limoniidae. The main clade of Limnophilinae (including four species) is sister to the clade containing Elephantomyiini, Epiphragma and Limoniinae in the trees inferred from the PCG12RNA dataset under BI and ML methods (95% PP, 27% BV) (Figs
Topologies I and II (Fig.
Questions | Groups | Number of Species |
Is Symplecta sister to all non-pediciid crane flies, or to Cylindrotomidae + Tipulidae? | G1: Pediciidae | 1 |
G2: Symplecta | 1 | |
G3: remaining Limoniidae | 11 | |
G4: Cylindrotomidae + Tipulidae | 14 | |
Is Limnophilinae part of Limoniinae, or does Limnophilinae have a closer relationship with Cylindrotomidae + Tipulidae? | G1: Pediciidae | 1 |
G2: Limnophilinae (except E. (E.) mediale) | 4 | |
G3: Limoniidae (except Chioneinae and Limnophilinae) | 7 | |
G4: Cylindrotomidae + Tipulidae | 14 |
Our FcLM analysis shows a support for the sister-group relationship between Symplecta and all non-pediciid crane flies (51.0%/43.9%/81.4%/81.3%) (Fig. S11). FcLM results for the placement of Symplecta are concordant with topology I (Fig.
Here, we present the first two mt genomes for the tribe Elephantomyiini, which are typical circular DNA molecules with lengths of 14,551 bp and 14,358 bp. Like the mt genomes of other crane flies, these two mt genomes show similar gene order, nucleotide composition and codon usage. Phylogenetic results support both new and traditional arrangements. The traditional views, that Pediciidae is sister to all remaining Tipuloidea, while Cylindrotomidae and Tipulidae are sister groups, are reconfirmed in this study. The four-family system of Tipuloidea and four-subfamily system of Limoniidae are found to be unstable classification systems. The monophyly of Limoniidae is not supported in our study, which indicates that Limoniidae may not be a natural group. In addition, two limoniid subfamilies (i.e. Limoniinae and Limnophilinae) may be para- or polyphyletic, as Epiphragma (Limnophilinae) has a closer relationship with Limoniinae. Our study supports the monophyly of Elephantomyiini, as Elephantomyia and Helius form a strongly supported clade, which represents a significant origin of flower-visiting in Limoniidae. However, the more precise phylogenetic position of Elephantomyiini in Tipuloidea, as well as other phylogenetic arrangements within Limoniidae, needs to be further revealed through additional studies with more species.
The authors have declared that no competing interests exist.
We express our sincere thanks to Fan Song (Beijing) for his great help in sequencing. This work was funded by the National Natural Science Foundation of China (32100356) and the High-level Talents Funds of Qingdao Agricultural University, China (663-1118015).
Tables S1, S2
Data type: .docx
Explanation note: Table S1. Codon usage of the mitochondrial genome of Elephantomyia (Elephantomyia) inulta. — Table S2. Codon usage of the mitochondrial genome of Helius (Helius) pluto.
Figures S1–S12
Data type: .pdf
Explanation note: Figure S1. Relative synonymous codon usage (RSCU) of the protein-coding genes in two newly sequenced mitochondrial genomes of Elephantomyiini. Leu1 = Leu (CUN); Leu2 = Leu (UUR); Ser1 = Ser (AGN); Ser2 = Ser (UCN). — Figure S2. The ratio of Ka/Ks of 13 PCGs in two newly sequenced mitochondrial genomes of Elephantomyiini. — Figure S3. Secondary structures of tRNAs of two Elephantomyiini species. All tRNAs are labeled with the abbreviations of their corresponding amino acids. Dashes indicate Watson-Crick base pairing and dots indicate G-U base pairing. — Figure S4. AliGROOVE analysis for four datasets. The mean similarity score between sequences is represented by a colored square, based on AliGROOVE scores ranging from minus one, indicating a large difference in sequence composition from the remainder of the dataset (red coloration), to plus one, indicating similarity to all other comparisons (blue coloration). — Figure S5. Phylogenetic tree of Tipuloidea inferred from the dataset PCG12RNA under ML method. Numbers at the nodes are bootstrap values. The family Trichoceridae was set as the outgroup. — Figure S6. Phylogenetic tree of Tipuloidea inferred from the dataset PCG under ML method. Numbers at the nodes are bootstrap values. The family Trichoceridae was set as the outgroup. — Figure S7. Phylogenetic tree of Tipuloidea inferred from the dataset PCG12 under BI method. Numbers at the nodes are posterior probabilities. The family Trichoceridae was set as the outgroup. — Figure S8. Phylogenetic tree of Tipuloidea inferred from the dataset PCG12 under ML method. Numbers at the nodes are bootstrap values. The family Trichoceridae was set as the outgroup. — Figure S9. Phylogenetic tree of Tipuloidea inferred from the dataset PCGRNA under BI method. Numbers at the nodes are posterior probabilities. The family Trichoceridae was set as the outgroup. — Figure S10. Phylogenetic tree of Tipuloidea inferred from the dataset PCGRNA under ML method. Numbers at the nodes are bootstrap values. The family Trichoceridae was set as the outgroup. — Figure S11. Results of four-cluster likelihood mapping of the first question as 2D simplex graphs based on four datasets. — Figure S12. Results of four-cluster likelihood mapping of the second question as 2D simplex graphs based on four datasets.