Research Article |
Corresponding author: Carmelo Andújar ( candujar@um.es ) Academic editor: Martin Fikácek
© 2024 Carmelo Andújar, Peter Hlaváč, Vasily V. Grebennikov.
This is an open access article distributed under the terms of the CC0 Public Domain Dedication.
Citation:
|
The tribe Raymondionymini has long been neglected in phylogenetic studies. The tribe is characterized by uncertain monophyly, fluctuating taxonomic status, and a composition prone to instability. All raymondionymine weevils are wingless and have eyes either completely absent or, rarely, consisting of a single ommatidium. With body lengths predominantly below three millimeters, they inhabit deep soil environments and are infrequently collected. The core of this tribe comprises nine genera distributed in Europe and around the Mediterranean region and encompassing 76 species, while six additional genera include 17 species distributed in USA (California), Mexico, Ecuador, Venezuela, Russian Far East, and Madagascar. Here, we present eight new mitogenomes, complemented by one publicly available, encompassing all but two Mediterranean genera of raymondionymine weevils. We used publicly available Curculionoidea mitogenomes to compile an all-inclusive dataset with 391 terminals and a reduced dataset with 61 terminals representing main families of Curculionoidea and subfamilies within Curculionidae. Our maximum likelihood and Bayesian phylogenetic analyses, employing both DNA and amino acids datasets under alternative partition schemes, consistently produced congruent phylogenies. Our results show that the Mediterranean raymondionymines form a strongly supported clade, and their easternmost and morphologically distinct genus Ubychia is sister to the rest of them. Most notably, our results consistently recover a sister relationship between the clade of Mediterranean raymondionymine weevils and a clade encompassing all remaining Curculionidae. Consequently, we propose a revision of weevil taxonomy: (i) Our target group is removed from the non-monophyletic subfamily Brachycerinae; (ii) this clade is resurrected to its former subfamily level within Curculionidae, as the subfamily Raymondionyminae stat. rev; (iii) the nine Mediterranean genera Alaocephala, Alaocyba, Coiffaitiella, Derosasius, Ferreria, Raymondiellus, Raymondionymus, Tarattostichus, and Ubychia compose Raymondionyminae stat. rev; (iv) and non-Mediterranean genera Alaocybites, Bordoniola, Gilbertiola, Homosomus, Neoubychia, and Schizomicrus are considered as “incertae sedis” pending further phylogenetic corroboration. We hypothesize that the remaining Brachycerinae and the non-Mediterranean representatives within Raymondionyminae constitute a series of species-poor early-diverging lineages representing currently unrecognized subfamilies of Curculionidae.
Shot-gun sequencing, mitochondrial metagenomics, Brachycerinae, Raymondionymini, Raymondionyminae, endogean, deep soil
The limits of the superfamily Curculionoidea have not been disputed given the easily observable possession of the adult rostrum that defines the clade. Similarly, the monophyly of the so called “true” weevils, classified as the family Curculionidae, is well supported based on both molecular and morphological data (
Given the phylogenetic uncertainties in the early evolution of Curculionidae, subfamily Brachycerinae has been defined as “the evolutionary twilight zone of true weevils” (
These sampling difficulties in obtaining deep soil beetles, which are often known only by the typical series or from only the type localities widely scattered across the Globe (
Three phylogenetic studies tangentially addressed the monophyly and/or sister group relationship of raymondionymine weevils. All of them, however, were limited in their design and, therefore, remained inconclusive in their findings.
Our study was triggered by the availability of difficult-to-obtain DNA-grade specimens of European raymondionymine weevils, our technical expertise in assembling mitochondrial genomes and phylogenetics, and the availability of a mitogenome dataset for weevils that was demonstrated highly informative (
A total of 16 specimens representing six of the nine described genera of European Raymondionymini are here firstly DNA extracted and barcoded (Table
DNA extraction was conducted from whole specimens (excepting the large-bodied N. scirpi for which a leg was used) using non-destructive procedures and Omega Mag-Bind® Blood and Tissue DNA Kit (Omega Bio-tek) in the KingFisher robotic system (Thermo Fisher Scientific inc.). PCR amplification was done for the 5’ end COI gene (standard barcode region for Metazoa;
One representative per species was selected for mitogenome sequencing and assembly following the mitochondrial metagenomics approach (
A first and preliminary analysis was designed to construct a guide tree including our nine newly generated mitogenomes plus available Curculionoidea mitogenomes within the NCBI nucleotide database. This guide tree was to serve two purposes. Firstly, we wanted to preliminarily replicate the basal weevil branching events reported in earlier studies and summarized in
We designed our restricted phylogenetic analysis based on congruence between trees obtained for the preliminary dataset (391 terminals) and the well-resolved weevil topology of
Individual gene alignments from the reduced dataset were concatenated, yielding (i) a dataset of 15 genes and 12,603 bp (Reduced Dataset 1; RD1); (ii) a dataset with exclusively the 13 PCGs and a length of 10,191 bp (RD2); and (iii) a dataset with amino acids sequences obtained from the 13 PCGs (invertebrate mitochondrial code) with a length of 3,396 AAs (RD3). These three datasets were used for maximum likelihood (ML) and Bayesian phylogenetic analyses. ML trees were obtained using RAxML v.8 (
Reassembly within Geneious of contigs generated with IDBA, SPADES, and RAY showed wide overlap and a perfect match with barcode cox1 sequences generated using Sanger sequencing, allowing to unambiguously identify newly generated mitogenomes. The two pairs of specimens each representing the genera Alaocyba and Raymondiellus that were included in two independent libraries (Table
Specimens of Curculionidae: Brachycerinae (including those of the tribe Raymondionymini herein re-classified as the subfamily Raymondionyminae) used in our DNA analyses. An asterisk (*) indicates sequences retrieved from GenBank. Two and three asterisks (** and ***) indicate two mitogenomes, each obtained twice from independent libraries, corresponding to specimens CNCCOLVG000010827 and CNCCOLVG000010826 in
Taxa | Voucher code | Barcode GB accession | Mitogenome GB accession | Country | Latitude | Longitude | Tribe |
Ubychia sp1. | sci3153 | PP949483 | PP889716 | Croatia | 45.356 | 14.766 | Raymondionymini |
Ubychia sp2. | sci3141 | PP949471 | PP889720 | Georgia | 41.6514 | 41.764 | Raymondionymini |
Ubychia sp2. | sci3142 | PP949472 | Georgia | 41.6514 | 41.764 | Raymondionymini | |
Raymondiellus sp. | sci3592** | PP949484 | PP889721 | Italy | 39.26 | 8.65 | Raymondionymini |
Raymondiellus sp. | sci3148 | PP949478 | Italy | 39.234 | 8.599 | Raymondionymini | |
Raymondiellus sp. | sci3149 | PP949479 | Italy | 39.234 | 8.599 | Raymondionymini | |
Derosasius damryi | sci3150 | PP949480 | PP889719 | Italy | 40.534 | 9.584 | Raymondionymini |
Coiffaitiella sp. | BMNH 1041911 | n.a. | MK692586* | Spain | 36.772637 | –5.423984 | Raymondionymini |
Ferreria marqueti | sci3621 | PP949486 | PP889715 | Spain: Canary Islands | 28.497137 | –16.345822 | Raymondionymini |
Ferreria marqueti | sci3143 | PP949473 | England | 52.034 | –2.423 | Raymondionymini | |
Ferreria marqueti | sci3144 | PP949474 | England | 52.034 | –2.423 | Raymondionymini | |
Ferreria marqueti | sci3145 | PP949475 | England | 52.034 | –2.423 | Raymondionymini | |
Alaocyba sp. | sci3595*** | PP949485 | PP889722 | Italy | 39.26 | 8.65 | Raymondionymini |
Alaocyba sp. | sci3146 | PP949476 | Italy | 39.234 | 8.599 | Raymondionymini | |
Alaocyba sp. | sci3147 | PP949477 | Italy | 39.234 | 8.599 | Raymondionymini | |
Raymondionymus laneyriei | sci3151 | PP949481 | PP889718 | France | 43.191 | 6.371 | Raymondionymini |
Raymondionymus lavagnei | sci3152 | PP949482 | PP889717 | France | 43.954 | 3.647 | Raymondionymini |
Notaris scirpi | CNCCOLVG00008489 | n.a. | PP889723 | Poland | 51.54 | 17.86 | Brachycerini |
Brachicerus muricatus | BMNH 696973 | n.a. | JN163970* | France | n.a. | n.a. | Brachycerini |
Lissorhopthus oryzophilus | n.a. | n.a. | MW732716* | China: Ningxia | n.a. | n.a. | Erirhinini |
Echinocnemus sp. | CG210 | n.a. | MH404139* | Australia | n.a. | n.a. | Erirhinini |
Ocladius sp. | CG288 | n.a. | MH404142* | RSA | n.a. | n.a. | Erirhinini |
Mitogenomic guide weevil phylogenetic trees obtained with the preliminary DNA and AA datasets (Supplementary Material 1) were highly congruent among themselves and with the basal weevil dichotomies found by
The reduced dataset included 61 terminals selected to represent all main lineages described above. Completeness score for the alignment as estimated in AliStat (
Results of 21 phylogenetic analyses of 61 Curculionoidea mitogenomes focussing on the monophyly and phylogenetic position of Raymondionyminae. Columns two to seven define various analytical parameters (DNA or proteins, software used, number of genes, partition scheme, number of replicates, and the representative replicate shown in Table S2). Column taxonomic abbreviations: BRE: Brentidae; CUR: Curculionidae; Dry: Dryophthorinae; Pla: Platypodinae; Ray: Raymondionyminae; Bag: Bagous, Bra: Brachycerus; Ech: Echinocnemus; Lis: Lissorhoptrus; CEGH: the CEGH clade (Cyclominae, Entiminae, Gonipterini, Hyperinae); CCCSM: the CCCMS clade (Conoderinae, Cossoninae, Curculioninae, Molytinae, Scolytinae). Cell values and colour represent statistical support for respective branches: >94 (dark grey), 90-94 (gray), 70-89 (light gray), <70 (white).
DNA/ proteins | Software | Genes | Partitions | Maxdiff * (Phylobayes) | Meandiff * (Phylobayes) | Total reps | Rep n | BRE | CUR | BRE + CUR | Ray | Cur– Ray | Cur-(Ray, Lis, Ech) | Dry | Pla | Dry + Pla | (Pla + Bra) + Dry | CEGH | CCCMS | CEGH + CCCMS | (CEGH + CCCMS) + Bag |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
proteins | IqTree | 13 | Bygene | 2 | 1 | 100 | 100 | 100 | 100 | 100 | 94 | 93 | 100 | none | 91 | 100 | 100 | 72 | 100 | ||
proteins | IqTree | 13 | Bygene | 2 | 2 | 100 | 100 | 100 | 100 | 100 | 94 | 90 | 100 | none | 90 | 100 | 100 | 71 | 100 | ||
proteins | Phylobayes | 13 | N.a. | 0.228 | 0.00929 | 2 | 1 | 100 | 100 | 97 | 100 | 96 | 72 | 59 | 100 | none | 72 | 100 | 100 | 50 | 95 |
proteins | Phylobayes | 13 | N.a. | 0.212 | 0.01116 | 2 | 2 | 100 | 100 | 100 | 100 | 93 | 72 | 63 | 100 | 94 | none | 100 | 100 | 74 | 100 |
proteins | RAxML | 13 | Bygene | 1 (1000 searches) | 1 | 100 | 100 | 100 | 100 | 90 | 68 | 60 | 100 | 93 | none | 100 | 100 | 74 | 99 | ||
DNA | IqTree | 13 | Bygene | 2 | 1 | 100 | 100 | 100 | 100 | 99 | 96 | 93 | 100 | none | 95 | 100 | 100 | 92 | 97 | ||
DNA | IqTree | 13 | Bygene | 2 | 2 | 100 | 100 | 100 | 100 | 99 | 97 | 96 | 100 | none | 97 | 100 | 100 | 93 | 98 | ||
DNA | IqTree | 13 | Bygeneand bycodon | 2 | 1 | 100 | 100 | 100 | 100 | 100 | 95 | 79 | 100 | none | 89 | 100 | 100 | 90 | 97 | ||
DNA | IqTree | 13 | Bygeneand bycodon | 2 | 2 | 100 | 100 | 100 | 100 | 100 | 96 | 84 | 100 | none | 89 | 100 | 100 | 88 | 97 | ||
DNA | RAxML | 13 | Bygene | 1 (1000 searches) | 1 | 100 | 100 | 100 | 100 | 85 | 73 | 67 | 100 | none | 60 | 100 | 100 | 72 | 67 | ||
DNA | RAxML | 13 | Bygene | 1 (1000 searches) | 1 | 100 | 100 | 100 | 100 | 94 | none | 52 | 100 | none | none | 100 | 100 | 69 | 54 | ||
DNA | Phylobayes | 13 | N.a. | 0.290 | 0.01496 | 2 | 1 | 100 | 100 | 100 | 100 | 91 | 69 | 58 | 100 | 93 | none | 100 | 100 | 72 | 99 |
DNA | Phylobayes | 13 | N.a. | 0.312 | 0.01718 | 2 | 2 | 100 | 100 | 100 | 100 | 90 | 70 | 57 | 100 | 94 | none | 100 | 100 | 72 | 99 |
DNA | IqTree | 15 | Bygene | 5 | 1 | 100 | 100 | 100 | 100 | 100 | 95 | 99 | 100 | none | 83 | 100 | 100 | 88 | 97 | ||
DNA | IqTree | 15 | Bygene | 2 | 2 | 100 | 100 | 100 | 100 | 100 | 96 | 100 | 100 | none | 82 | 100 | 100 | 89 | 95 | ||
DNA | IqTree | 15 | Bygeneand bycodon | 2 | 1 | 100 | 100 | 100 | 100 | 100 | 96 | 99 | 100 | none | 65 | 100 | 100 | 80 | 98 | ||
DNA | IqTree | 15 | Bygeneand bycodon | 2 | 2 | 100 | 100 | 100 | 100 | 100 | 96 | 99 | 100 | none | 64 | 100 | 100 | 85 | 98 | ||
DNA | RAxML | 15 | Bygene | 1 (1000 searches) | 1 | 100 | 100 | 100 | 100 | 88 | 61 | 88 | 100 | none | 39 | 100 | 100 | 70 | 56 | ||
DNA | RAxML | 15 | Bygene | 1 (1000 searches) | 1 | 100 | 100 | 100 | 100 | 92 | none | 86 | 100 | none | none | 100 | 100 | 72 | 44 | ||
DNA | Phylobayes | 15 | N.a. | 0.368 | 0.01648 | 2 | 1 | 100 | 100 | 100 | 100 | 96 | none | none | 100 | 99 | none | 100 | 100 | 99 | 99 |
DNA | Phylobayes | 15 | N.a. | 0.0998 | 0.00484 | 2 | 2 | 100 | 100 | 100 | 100 | 95 | none | none | 100 | 99 | none | 100 | 100 | 99 | 99 |
* Largest (maxdiff) and mean (meandiff) discrepancy observed across all bipartitions after burn-in in Phylobayes analyses: maxdiff < 0.1: good run; maxdiff < 0.3: acceptable; 0.3 < maxdiff < 1: the sample is not yet sufficiently large) |
Maximum likelihood phylogenetic tree obtained with PhyloBayes AA dataset including nine Raymondionyminae (branches in red) and 51 other weevils. Circles indicate support values. Habitus images of congeneric (not necessarily conspecific) specimens were taken by us (Alaocyba sp.), by Udo Schmidt (Ferreria marqueti (Aubé)), by Ilya Zabaluev (Sitophilus zeamais (Motschulsky) and Bagous meregallii Caldara et O’Brien), and by Kirill Makarov (Rhynchites bacchus (Linnaeus), Nanophyes marmoratus (Goeze), Notaris scirpi (Fabricius), Ocladius salicorniae (Olivier), Platypus sp., Sitona obsoletus (Gmelin), Curculio aino Kono, Scolytus ratzeburgi Janson, and Ceutorhynchinae sinicus Voss); not to scale; used with permission.
All 21 restricted analyses recovered the non-raymondionymine rest of the family Curculionidae as a clade. In 13 of these analyses this clade was strongly supported (Bootstrap ≥ 95; pp ≥ 0.95; Table
Considering these results, we implement the following taxonomic acts within the family Curculionidae: (i) the tribe Raymondionymini is removed from the non-monophyletic subfamily Brachycerinae and resurrected to its former subfamily level as Raymondionyminae stat. rev.; (ii) the Mediterranean genera Alaocephala, Alaocyba, Coiffaitiella, Derosasius, Ferreria, Raymondiellus, Raymondionymus, Tarattostichus, and Ubychia are retained within Raymondionyminae stat. rev.; and (iii) all non-Mediterranean genera previously placed within Raymondionymini (genera Alaocybites, Bordoniola, Gilbertiola, Homosomus Richard, Neoubychia Gilbert and Howden, and Schizomicrus) are considered as “incertae sedis” within the subfamily Raymondionyminae stat. rev. pending further phylogenetic corroboration.
Our results are remarkably consistent with the gradually emerging phylogenetic framework of weevil (see Introduction). Our topologies display the composition and arrangement of the main weevil clades (Figs
Four of the five non-raymondionymine Brachycerinae genera here studied (i.e., Ocladius, Brachycerus, Lissorhoptrus and Echinocnemus) were among the seven Brachycerinae analyzed by
Within the lineage formed by the Mediterranean raymondionymines, we have found that the genus Ubychia forms the sister clade to the rest of the subfamily in all our analyses. This result is consistent with (i) Ubychia being the easternmost representative of the subfamily and the only genus inhabiting the Caucasus and Elburs mountains (
Mediterranean raymondionymine weevils, here represented by seven of the nine known genera, form a strongly supported lineage sister to the remaining Curculionidae, in agreement with the phylogenetic position reported by
The absence within our analyses of representatives from the six non-Mediterranean genera (19 species) prevents any conclusion about the global monophyly and the limits of the group. The relationship between the non-Mediterranean genera and the Mediterranean lineage has been previously questioned (
Consequently, the phylogenetic position of all non-Mediterranean genera previously placed within Raymondionymini, including Alaocybites (two species endemic to California and two species endemic to the Russian Far East), Bordoniola (seven species in Ecuador and Venezuela), Gilbertiola (two species in California), Homosomus (three species in Madagascar), Neoubychia (monotypic in Mexico), and Schizomicrus (monotypic in California), can not be established according to current phylogenetic or morphological evidence and are left as incertae sedis within Raymondionyminae stat. rev., requiring further phylogenetic corroboration. Given the results of the present study, we hypothesize that additional currently undetected species-poor early offshoots of the true weevil radiation might await taxonomic recognition as subfamilies of Curculionidae. Most likely these lineages are hidden in what we consider the evolutionary twilight zone of true weevils. The latter is formed by the remaining members of the herein circumscribed subfamily Brachycerinae and the six non-Mediterranean genera of the subfamily Raymondionyminae. Bringing these intuitively classified organisms under a phylogenetic spotlight will significantly improve our knowledge on the early evolution and allow to fine-tune the systematics of the charismatic and megadiverse clade of true weevils.
Our mitogenomic phylogenetic analyses using maximum likelihood and Bayesian inferences recovered congruent topologies, which show that the Mediterranean raymondionymines form a strongly supported clade with a sister relationship with a clade encompassing all remaining Curculionidae. Remarkably, our findings align closely with a recent phylogenetic reconstruction by
Author contributions. Carmelo Andújar: Methodology, Formal analysis, Investigation, Resources, Writing – Original Draft, Writing – Review and Editing. Peter Hlaváč: Investigation, Resources, Writing – Review and Editing. Vasily Grebennikov: Conceptualization, Methodology, Investigation, Resources, Writing – Original Draft, Writing – Review and Editing, Project administration.
Conflict of interest. The authors declare that there is no conflict of interest.
Data availability statement. The molecular data newly generated for this study is available in GenBank. Accession numbers PP889715–PP889723 for mitogenomes and PP949471–PP949486 for cox1 barcode sequences.
Leo Fancello (Cagliari, Italy), Jon Cooter (Hereforg, UK), Volker Brachat (Geretsried, Germany), Heinrich Meybohm (Grosshansdorf, Germany), Christian Perez (Istres, France) and late Volker Assing (Germany) all are highly acknowledged for the collecting of DNA grade raymondionymine weevils used in this project. Marek Wanat (Wrocław, Poland) collected, identified, and made available the sequenced specimen of Notaris scirpi. Kirill V. Makarov (Moscow, Russia), Udo Schmidt (Selbitz, Germany), and Ilya A. Zabaluev (Tver, Russia) took and made available habitus photographs used in Fig.
Figures S1–S25
Data type: .pdf
Explanation notes: Figure S1. ML tree obtained in Geneious using FastTree for the 16 newly generated barcode sequences. — Figure S2. Consensus tree obtained from PhyloBayes for the Preliminary Dataset 1 (DNA; 15 genes; 391 terminals). — Figure S3. Consensus tree obtained from PhyloBayes for the Preliminary Dataset 1 (DNA; 13 genes; 391 terminals). — Figure S4. Consensus tree obtained from PhyloBayes for the Preliminary Dataset 1 (AA; 13 genes; 391 terminals). — Figure S5. Tree estimated with IqTree with the Reduced Dataset 3 (AA, 13 genes, 61 terminals). Replicate 1. — Figure S6. Tree estimated with IqTree with the Reduced Dataset 3 (AA, 13 genes, 61 terminals). Replicate 2. — Figure S7. Tree estimated with RAxML with the Reduced Dataset 3 (AA, 13 genes, 61 terminals). — Figure S8. Consensus tree obtained from PhyloBayes with the Reduced Dataset 3 (AA, 13 genes, 61 terminals). Replicate 1. — Figure S9. Consensus tree obtained from PhyloBayes with the Reduced Dataset 3 (AA, 13 genes, 61 terminals). Replicate 2. — Figure S10. Tree estimated with IqTree with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals) partitioning by gene. Replicate 1. — Figure S11. Tree estimated with IqTree with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals) partitioning by gene. Replicate 2. — Figure S12. Tree estimated with IqTree with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals) partitioning by gene and by codon. Replicate 1. — Figure S13. Tree estimated with IqTree with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals) partitioning by gene and by codon. Replicate 2. — Figure S14. Tree estimated with RAxML with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals) partitioning by gene. — Figure S15. Tree estimated with RAxML with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals) partitioning by gene and by codon. — Figure S16. Consensus tree obtained from PhyloBayes with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals). Replicate 1. — Figure S17. Consensus tree obtained from PhyloBayes with the Reduced Dataset 2 (DNA, 13 genes, 61 terminals). Replicate 2. — Figure S18. Tree estimated with IqTree with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals) partitioning by gene. Replicate 1. — Figure S19. Tree estimated with IqTree with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals) partitioning by gene. Replicate 2. — Figure S20. Tree estimated with IqTree with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals) partitioning by gene and by codon. Replicate 1. — Figure S21. Tree estimated with IqTree with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals) partitioning by gene and by codon. Replicate 2. — Figure S22. Tree estimated with RAxML with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals) partitioning by gene. Fig. S23. Tree estimated with RAxML with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals) partitioning by gene and by codon. — Figure S24. Consensus tree obtained from PhyloBayes with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals). Replicate 1. — Figure S25. Consensus tree obtained from PhyloBayes with the Reduced Dataset 1 (DNA, 15 genes, 61 terminals). Replicate 2.
Table S1
Data type: pdf
Explanation notes: Summary statistics for the alignments used estimated with AliStat (