Research Article |
Corresponding author: Mattia Ragazzini ( mattia.ragazzini@uzh.ch ) Academic editor: Vinicius S. Ferreira
© 2025 Mattia Ragazzini, Roy M. J. C. Kleukers, Luc Willemse, Baudewijn Ode, Lara-Sophie Dey, Oliver Hawlitschek.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
|
The Mediterranean Basin, recognized as a global biodiversity hotspot, harbors a remarkable diversity of grasshoppers, katydids, and crickets, many of which are endemic and potentially contain cryptic lineages. In this study, we generated a comprehensive dataset comprising 1,441 barcodes from 270 identified species within the Ensifera and Caelifera suborders. These were combined with existing data to form a dataset of 2,606 barcodes representing 351 species. We employed Maximum Likelihood (ML) topology reconstruction and applied five species delimitation methods (BIN, ABGD, ASAP, GMYC, and PTP) to detect potential incongruences between Operational Taxonomic Units (OTUs) and existing taxonomic classifications. Our analysis revealed that OTUs delimited by these methods corresponded to 71.39% of the evaluated species, with a notably higher congruence in Ensifera (88.53%) compared to Caelifera (52.15%). Across the dataset, we identified 54 lineages comprising cryptic species, indicating significant unrecognized diversity within these groups. Additionally, 21 instances of species being merged into consensus OTUs were observed, suggesting either the need for taxonomic revision or highlighting the limitations of current genetic markers. Among the methods tested, ABGD, particularly with the Kimura two-parameter model, was the most consistent with traditional taxonomy, yielding the highest consensus rates. In contrast, the PTP method exhibited the lowest consensus, often leading to an oversplitting of lineages. These findings underscore the complexity of species delimitation in recently radiated taxa and emphasize the importance of using multiple methodologies to accurately capture biodiversity, especially in regions characterized by a high prevalence of cryptic species.
BOLD (Barcoding Of Life Data), COI (Cytochrome C Oxidase subunit 1), endemism, HGT (Horizontal Gene Transfer), HTS (High-Throughput Sequencing), ILS (Incomplete Lineage Sorting), Mediterranean area, MOTU (Molecularly defined Operational Taxonomic Unit)
Over the last two decades, DNA barcoding has developed into an invaluable tool for taxonomists, enhancing the ability to diagnose species across all life history stages, particularly when traditional morphology-based taxonomy is challenging or when taxa are morphologically ambiguous (
Orthoptera are prime candidates for DNA barcoding due to their significant diversity and ecological importance (
So far, comparatively few dedicated barcoding studies worldwide have targeted Orthoptera (
Shortcomings still affect DNA barcode efficiency, especially in caeliferans (
Nabholz et al. (2023), analysing data from
Species delimitation requires integrating multiple independent lines of evidence, in order to obtain accurate results (
Several European orthopteran species were barcoded for the first time in this study, contributing to the expanding dataset of COI sequences available for Orthoptera. Given the challenges posed by incomplete lineage sorting (ILS), mtDNA introgression, and nuclear mitochondrial pseudogenes, this study highlights the importance of integrating multiple complementary species delimitation methods to overcome these limitations. By employing a multifaceted approach, we aim to enhance species identification accuracy, uncover cryptic diversity, and define more reliable Molecular Operational Taxonomic Units (MOTUs). Through the integration of molecular and morphological evidence, this study reveals hidden biodiversity and establishes a foundation for future taxonomic revisions and conservation efforts. Our findings will help refine species delimitation in Orthoptera, supporting biodiversity assessments and guiding targeted conservation strategies, particularly in the Mediterranean, a region renowned for its high endemism and ecological significance.
Since 1999, the entomologists and orthopterologists Baudewijn Odé, Rob Felix, Luc Willemse, and Roy Kleukers have been organizing Orthoptera collection expeditions on the field, in southern Europe, focusing primarily on Spain, Portugal, Italy, Greece, and Romania. Voucher specimens were killed using ethyl acetate, dried, mounted, and identified utilizing literature and identification keys specifically designed for European orthopterans (e.g.,
The DNA extraction, amplification and sequencing were performed in two parts. 630 samples were processed at the Leibniz Institute for the Analysis of Biodiversity Change (LIB) in Hamburg (Germany), and 1,068 samples at the Naturalis Biodiversity Center in Leiden (Netherlands).
At LIB, DNA was extracted according to the methods of Chelex® 100 resin-based protocol (
The Sanger sequencing results (ab1 files) underwent a quality check. Sequences chromatograms were analysed using Geneious Prime 2023.1.1, trimming poor-quality edges and manually correcting bases with low-quality peak maps. Sequences with unreliable peaks were subjected to PCR and sequencing repetition, adjusting the annealing temperature to 49°C, which improved outcomes in several instances. Sanger sequencing data were filtered, selecting sequences longer than 500 bp (base pairs), except for some species where only few longer sequences were available.
At Naturalis, DNA extraction was performed adhering to the ARISE (Authoritative and Rapid Identification System for Essential biodiversity information) protocol (van Ommen Kloeke 2022; protocols.io; bomb.bio). Following DNA extraction and cleanup using KingFisher, PCR was conducted on all samples. This step employed the same primers (COBL and COBU).
PCR products were verified on a 2% agarose E-Gel, with nearly all samples showing positive results, prompting continuation with all samples (detailed PCR protocol in File S1 [protocol b]). Subsequently, the Nanopore dual barcoding protocol with kit 14 was applied for the sequencing. In this protocol, samples with no bands in the elektrophoresis were also used for library preparation.
All sequences were aligned using MAFFT V 7.505 (
The newly generated sequence data and metadata were then uploaded to the Barcode of Life Data Systems (BOLD PROJECT “MEDOR Barcoding of Mediterranean Orthoptera”). Thanks to our preliminary quality assessments, all barcode sequences met the automatic quality criteria of BOLD upon upload.
The initial dataset was augmented with two additional datasets of European Orthoptera sequences available on the BOLD system, named GBORT-GBOL (
DNA barcoding shows variable efficiency between Caelifera and Ensifera due to factors such as numts frequency (
Two preliminary Maximum Likelihood (ML) trees (
Specimens not identified at the species level were retained for further analysis, provided their most specifically identified clade was accurately placed within the ML tree, aligning with currently accepted Orthoptera taxonomy (
After quality measures, each definitive subset was re-aligned and trees reconstructed using the same parameters as for the preliminary analyses described above.
All phylogenetic analyses were executed with XSEDE (eXtreme Science and Engineering Discovery Environment) through the CIPRES Science Gateway (www.phylo.org;
The effectiveness of DNA barcoding in species identification was assessed by calculating the ratio between the number of species not exhibiting BIN sharing and the total number of species assigned a BIN code, following the methodology described by
Species delimitation analyses were conducted on the definitive datasets using various species delimitation approaches to detect and compare MOTUs, to explore species diversity and deepen the understanding of species boundary delimitation issues. For this purpose, both similarity (BIN, ABGD, ASAP) and clustering-based (GMYC, PTP) approaches designed for a single-locus strategy were utilized. In all methods we employed, the term “partition” refers to the grouping of sequences into distinct clusters or units, typically representing putative species. In methods such as ABGD and ASAP, partitions emerge from the identification of genetic gaps or distance thresholds, while in GMYC and PTP, partitions are determined by tree-based criteria, such as branch length distributions or coalescent processes (
BIN method: the Cluster Sequences tool, implemented by BOLD, generates MOTUs through the REfined Single Linkage (RESL) algorithm, grounded on uncorrected pairwise distances (p-distance). The Barcode Index Number (BIN) informatics system provides a unique alphanumeric code for MOTUs (BOLD: 3 letters, 4 numbers (
ABGD method: the Automatic Barcode Gap Discovery was adopted to divide samples based on genetic distance, detecting the so-called “barcode gap” (
ASAP method: the Assemble Species by Automatic Partitioning creates new partitions by amalgamating sequences at equal pairwise distances into progressively larger groups until the final partition encompasses all records (
For both ABGD and ASAP, the initial partition refers to the first grouping of sequences based on broader genetic thresholds, while the recursive partition further refines these clusters by progressively splitting them to detect finer genetic distinctions, potentially identifying cryptic species. Both ABGD and ASAP analyses were run on the respective online platforms (available at https://bioinfo.mnhn.fr/abi/public/abgd and https://bioinfo.mnhn.fr/abi/public/asap/asapweb.html, respectively).
GMYC method: the Generalized Mixed Yule Coalescent discerns the most likely putative species clusters basing on an ultrametric tree (
PTP method: this method delineates hypothetical species clusters by analysing branch length distributions in a rooted non-ultrametric gene tree. The PTP model, enhanced by the bPTP version with Bayesian support for delimited species on the input tree (
The Python-based script SPdel v.2.0 (
A comprehensive description of the command, input files, and options used for the SPdel.py analysis can be found in File S8.
The final MEDOR (Barcoding of Mediterranean Orthoptera) alignment comprises 1,441 newly generated, quality-checked barcodes, each 726 bp in length (accession numbers for newly generated sequences on BOLD (
Incorporating data from
The analysis of the merged dataset revealed significant variation in barcoding effectiveness between ensiferans and caeliferans. Overall, 76.72% of all studied Orthoptera species were accurately identified through DNA barcoding. However, the success rate differs markedly between the suborders: 63.64% for Caelifera, primarily due to BIN sharing among certain species, compared to 88.53% for Ensifera. The datasets for ensiferans and caeliferans also highlight differences in the species richness of the respective genera, with a weighted mean of 3.78 and 8.29 species per genus, respectively.
The ML trees of Caelifera and Ensifera are shown in Figs
Maximum Likelihood tree of Caelifera from Central and Southern Europe. The tree was created with IQ-Tree v.2.2.2.7 on XSEDE (eXtreme Science and Engineering Discovery Environment) through the CIPRES Science Gateway (www.phylo.org). The tree displays 1,443 barcodes of identified (164) and unidentified species. Higher taxonomic levels (all families and the subfamilies of Acrididae) are coloured in the tree. The species depicted inside the tree is Arcyptera (Arcyptera) tornosi Bolívar, 1884.
Maximum Likelihood tree of Ensifera from Central and Southern Europe. It was created with IQ-Tree v.2.2.2.7 on XSEDE (eXtreme Science and Engineering Discovery Environment) through the CIPRES Science Gateway (www.phylo.org). The tree displays 1,111 barcodes of identified (185) and unidentified species. Higher taxonomic levels (all families and the subfamilies of Tettigoniidae) are coloured in the tree. The species depicted inside the tree is Pseudomogoplistes vicentae Gorochov, 1996.
The various species delimitation approaches employed yielded hypothetical OTU counts ranging from 355 to 505, compared to 351 previously identified species (Table
Barcoding data and putative species clusters, according to all combinations of species delimitation approaches. — * displayed in SPdel summary graph (I. lemnotica, T. caudata, and G. obtusus delimitation features were discussed but not included in the graph).
Quantity | Caelifera | Ensifera | Total |
Barcordes count | |||
MEDOR dataset | 821 | 620 | 1441 |
|
482 | 263 | 745 |
|
175 | 245 | 720 |
Merged dataset | 1478 | 1128 | 2606 |
Identified species count | 166 | 185 | 351 |
Records without BIN | 35 | 17 | 52 |
MOTU count* | |||
BINs count | 162 | 283 | 445 |
ABGD Jukes-Cantor Initial partition | 130 | 237 | 367 |
ABGD Jukes-Cantor Recursive partition | 145 | 250 | 395 |
ABGD Kimura two-parameter Initial partition | 116 | 239 | 355 |
ABGD Kimura two-parameter Recursive partition | 123 | 250 | 373 |
ASAP Jukes-Cantor Recursive partition | 138 | 218 | 356 |
ASAP Kimura two-parameter Recursive partition | 130 | 275 | 405 |
GMYC Yule process and strict clock model | 160 | 304 | 464 |
GMYC Yule process and relaxed clock model | 160 | 265 | 425 |
GMYC Coalescent Constant Pop. process and strict clock | 160 | 318 | 478 |
PTP | 234 | 271 | 505 |
The ABGD method, especially its initial partition using the Kimura two-parameter model, emerged as the most conservative, contrasting with the PTP method, which identified the highest number of hypothetical species. Parameter variations within ABGD resulted in partitions of varying sizes, with the Jukes-Cantor model generally less conservative than the Kimura two-parameter, leading to a higher MOTUs count across both suborders, apart from the initial partition for caeliferans. Recursive partitions consistently indicated more species than initial partitions. Contrary to ABGD findings, the ASAP method with Kimura two-parameter resulted in larger partitions compared to those from Jukes-Cantor.
The GMYC method, applied with four different parameter combinations, produced a consistent number of species clusters for both ensiferans and caeliferans. However, the application of the Coalescent Constant Population process combined with a relaxed clock model resulted in an anomalously high species count for this dataset, leading to its results being set aside due to poor fit. The outcome of this specific combination, indicating an unexpectedly large number of species, suggests it may not be suitable for this particular dataset analysis. The PTP output in caeliferans surpassed the average species count across BIN, ABGD, ASAP, and GMYC methods, while in ensiferans, the 271 clusters of PTP aligned closely with the averages from other methods. The bPTP analysis, despite extensive generation counts, failed to converge, leading to the exclusion of its results in favour of maximum likelihood outcomes, as recommended by
The graphic outputs from SPdel are depicted in File S3 [graphs a, b], showcasing delimitation bars in the order of 11 method and parameter combinations, culminating in a final consensus.
In the maximum likelihood tree, while barcodes are correctly assigned in the absence of BIN sharing, higher taxonomic groups are often not retrieved as monophyletic. This suborder’s species delimitation methods generally concur on the number of putative species clusters, except for PTP, which delineates a higher species count. The hypothetical species range from 118 to 251. For further details see Table
Delimitation pattern of the caeliferan taxa. This only includes cases for which less than seven methods out of eleven reflect the traditional taxonomy. Unidentified MOTUs are included too. Unidentified records clustering within species-level identified MOTUs are not included. The first column indicates the taxon name, while the second one represents the relative number of specimens represented in the current database. The third column describes the pattern of consensus delimitation observed, while the fourth one delineates the name and number of methods not supporting the OTUs proposed in the previous field. In the fourth column, in square brackets, the method names are coded as follows: 1 = BIN; 2 = ABGD_JC_INIT; 3 = ABGD_JC_REC; 4 = ABGD_K2_INIT; 5 = ABGD_K2_REC; 6 = ASAP_JC; 7 = ASAP_K2; 8 = GMYC_STRICT_YULE; 9 = GMYC_REL_YULE; 10 = GMYC_STRICT_COAL; 11 = PTP. — (*): taxa for which the delimitation pattern is not clear, due to methods splitting and merging the same clusters simultaneously.
Taxa | Number of specimens | Consenus delimitation | Methods not supporting the consensus delimitation |
---|---|---|---|
Tetrigidae | |||
Tetrix bipunctata | 8 | Merged with T. kraussii | — |
Tetrix ceperoi | 11 | Merged in a single MOTU | 5/11 [3,5,6,7,11] |
Tetrix depressa | 13 | Split into two MOTUs | 1/11 [11] |
Tetrix kraussii | 9 | Merged with T. bipunctata | — |
Tetrix nodulosa | 2 | Split into two MOTUs | — |
Pamphagidae | |||
Acinipe segurensis | 5 | Split into two MOTUs | 1/11 [11] |
Eumigus monticola | 12 | Split into three MOTUs | 3/11 [1,3,5] |
Eumigus sp. 1 (MEDOR863-23) | 1 | Independent unidentified MOTU | — |
Eumigus sp. 2 | 3 | Independent unidentified MOTU | — |
Ocnerodes sp. (MEDOR392-23) | 1 | Independent unidentified MOTU | — |
Orchamus sp. (MEDOR081-23) | 1 | Independent unidentified MOTU | — |
Pamphagidae sp. 1 (MEDOR1128-23) | 1 | Merged with Glauia sp. | — |
Pamphagidae sp. 2 (MEDOR348-23, MEDOR349-23) | 2 | Independent unidentified MOTU | — |
Acrididae – Dericorythinae | |||
Dericorys sp. | 3 | Independent unidentified MOTU | — |
Acrididae – Pezotettiginae | |||
Pezotettix giornae | 6 | Split into three MOTUs | 3/11 [4,5,11] |
Acrididae – Melanoplinae | |||
Miramella alpina | 4 | Forming an independent MOTU, but one record fits with M. irena | 2/11 [4,11] |
Odontopodisma schmidtii | 3 | Split into two MOTUs | 5/11 [2,4,5,6,7] |
Odontopodisma sp. | 6 | Independent unidentified MOTU | 6/11 [1,2,3,4,5,11] * |
Peripodisma sp. | 1 | Independent unidentified MOTU | — |
Acrididae – Calliptaminae | |||
Calliptamus barbarus | 28 | Split into two MOTUs | — |
Calliptamus siciliae | 13 | Split into two MOTUs | 1/11 [11] |
Acrididae – Acridinae | |||
Acrida sp. (GBORT781-15) | 1 | Independent unidentified MOTU | — |
Acrididae – Oedipodinae | |||
Acrotylus fischeri | 5 | Merged with A. insubricus and part of A. patruelis | — |
Acrotylus insubricus | 11 | Merged with A. fischeri and part of A. patruelis | — |
Acrotylus patruelis | 11 | Forming an independent MOTU, but two records fit with A. insubricus and A. fischeri | — |
Aiolopus puissanti | 2 | Merged with A. thalassinus | — |
Aiolopus thalassinus | 9 | Forming an independent MOTU, but eight records fit with A. puissanti | 1/11 [4] |
Bryodemella tuberculata | 8 | Split into two MOTUs | 3/11 [2,4,11] |
Oedaleus decorus | 11 | Split into two MOTUs | 1/11 [4] |
Oedipoda coerulea | 7 | Merged with O. fuscocincta and O. germanica | — |
Oedipoda fuscocincta | 2 | Merged with O. germanica and O. coerulea | — |
Oedipoda germanica | 3 | Merged with O. fuscocincta and O. coerulea | — |
Psophus stridulus | 5 | Split into two MOTUs | 5/11 [1,2,4,6,11] |
Sphingonotus spp. (except Sphingonotus guanchus) | 94 | Divided into three main MOTUs, from different species: S. azurescens, S. almeriense, S. sublaevis, S. caerulans, S. morini, S. nodulosus; S. caerulans, S. rubescens, S. lluciapomaresi, S. lusitanicus, S. azurescens and S. azurescens, S. almeriense, S. imitans | 10/11 [1-10] * |
Thalpomena sp. | 2 | Independent unidentified MOTU | — |
Acrididae – Gomphocerinae | |||
Arcyptera spp. | 32 | Merged in a single MOTU | 5/11 [1,8,9,10,11] |
Chorthippus dorsatus, Chorthippus loratus and Chorthippus dichrous | 27 | Divided into two main MOTUs, one including just Chorthippus dorsatus and one including all three species* | 4/10 [2,4,5,7] |
Chorthippus jucundus | 2 | Split into two MOTUs | — |
Chorthippus sp. (MEDOR421-23) | 222 | Merged in a single MOTU | 10/11 [1,2,3,4,5,7,8,9,10,11] * |
Chorthippus vagans | 16 | Merged in a single MOTU, but C. v. dissimilis specimens cluster into a different MOTU | 6/11 [2,3,4,5,7,11] * |
Euchorthippus spp. (except Euchorthippus albolineatus) | 47 | Merged in a single MOTU | 5/11 [1,8,9,10,11] |
Omocestus panteli, O. viridulus, O. rufipes, O. haemorrhoidalis, O. femoralis, Myrmeleotettix maculatus, Stenobothrus festivus, S. bolivarii, S. stigmaticus, S. grammicus, S. sp. | 116 | Merged in a single MOTU | 8/11 [1,4,5,7,8,9,10,11] * |
Pseudochorthippus parallelus and Pseudochorthippus montanus | 27 | Divided into five main MOTUs, three ones including just P. parallelus and two ones including both species | 10/11 [1,2,3,4,5,7,8,9,10,11] * |
Stauroderus scalaris, Gomphocerus sibiricus, Stenobothrus sp., Gomphocerippus rufus, Chorthippus yersini, Ch. apricarius, C. brunneus, C. mollis, C. jacobsi, C. binotatus, C. messinai, C. nevadensis, C. biroi, C. acroleucus, C. macrocerus, C. mollis ignifer, C. biguttulus, C. sp., C. vagans dissimilis, C. maritimus maritimus | 137 | Merged in a single MOTU | 10/11 [1,2,3,4,5,7,8,9,10,11] * |
Stenobothrus eurasius, S. lineatus, S. fischeri, S. stigmaticus, S. nigromaculatus, S. rubicundulus, S. sp. | 43 | Merged in a single MOTU | 4/11 [4,5,7,11] |
Stenobothrus grammicus, S. lineatus, S. festivus, S. bolivarii, S. stigmaticus, S. crassipes, S. sp., Omocestus bolivari, O. minutissimus, O. uhagonii, O. femoralis, O. antigai antigai, Myrmeleotettix maculatus, M. antennatus | 236 | Merged in a single MOTU | 5/11 [1,8,9,10,11] |
In the maximum likelihood tree, while barcodes are correctly assigned in the absence of BIN sharing, higher taxonomic groups are often not retrieved as monophyletic. In Ensifera, the different delimitation methods show an overall agreement on the number of putative species clusters, with the only exception of PTP, showing a higher number of delimited species. The hypothetical species range from 118 to 251. For further details see Table
Delimitation pattern of the ensiferan taxa. This only includes cases for which less than seven methods out of eleven reflect the traditional taxonomy. Unidentified MOTUs are included too. Unidentified records clustering within species-level identified MOTUs are not included. The first column indicates the taxa name, while the second one represents the relative number of specimens represented in the current database. The third column describes the pattern of consensus delimitation observed, while the fourth one delineates the name and number of methods not supporting the OTUs proposed in the previous field. In the fourth column, in square brackets, the method names are coded as follows: 1 = BIN; 2 = ABGD_JC_INIT; 3 = ABGD_JC_REC; 4 = ABGD_K2_INIT; 5 = ABGD_K2_REC; 6 = ASAP_JC; 7 = ASAP_K2; 8 = GMYC_STRICT_YULE; 9 = GMYC_REL_YULE; 10 = GMYC_STRICT_COAL; 11 = PTP. — (*): Taxa for which the delimitation pattern is not clear, due to methods splitting and merging the same clusters simultaneously. — (**): See supplementary for discussion on this species’ delimitation pattern.
Taxa | Number of specimens | Consenus delimitation | Methods not supporting the consensus delimitation |
---|---|---|---|
Gryllidae | |||
Eugryllodes escalerae | 7 | Split into two MOTUs | 4/11 [7,8,10,11] |
Eugryllodes pipiens pipiens | 7 | Split into two MOTUs | — |
Gryllomorpha longicauda | 12 | Split into three MOTUs | 7/11 [2,3,6,8,9,10,11] * |
Modicogryllus frontalis | 2 | Merged with M. truncatus** | — |
Modicogryllus sp. 1 (MEDOR957-23, MEDOR958-23) | 2 | Independent unidentified MOTU | 5//11 [2,4,8,9,11] |
Modicogryllus sp. 2 | 7 | Independent unidentified MOTU | 5//11 [2,4,8,9,11] |
Modicogryllus truncatus | 1 | Merged with M. frontalis** | — |
Ovaliptila sp. 1 (MEDOR1122-23, MEDOR169-23) | 2 | Independent unidentified MOTU | — |
Ovaliptila sp. 2 (MEDOR1121-23, MEDOR080-23) | 2 | Independent unidentified MOTU | — |
Petaloptila aliena | 4 | Merged with P. galaica | — |
Petaloptila galaica | 13 | Merged with P. aliena | — |
Petaloptila sp. 1 (MEDOR429-23) | 1 | Independent unidentified MOTU | 4/11 [1,2,4,6] |
Petaloptila sp. 2 | 12 | Independent unidentified MOTU | — |
Oecanthidae | |||
Oecanthus pellucens | 15 | Split into two MOTUs | 8/11 [2,3,4,5,6,8,10,11] * |
Mogoplistidae | |||
Paramogoplistes dentatus | 3 | Split into two MOTUs | 2/11 [6,9] |
Gryllotalpidae | |||
Gryllotalpa vineae | 3 | Split into two MOTUs | 2/11 [6,9] |
Trigonidiidae | |||
Nemobius sylvestris | 16 | Split into three MOTUs | 6/11 [1,2,3,6,9,11] * |
Pteronemobius sp. | 2 | Independent unidentified MOTU | — |
Rhaphidophoridae | |||
Dolichopoda (MEDOR276-23) | 1 | Independent unidentified MOTU | — |
Tettigoniidae – Meconematinae | |||
Cyrtaspis scutata | 6 | Split into two MOTUs | — |
Meconema meridionale | 6 | Split into two MOTUs | — |
Tettigoniidae – Phaneropterinae | |||
Barbitistes serricauda | 7 | Split into two MOTUs | — |
Isophya brevicauda | 4 | Merged in a single MOTU | 6/11 [2,3,4,5,6,9] * |
Isophya kraussii | 8 | Merged in a single MOTU | 8/11 [2,3,4,5,6,8,9,10] * |
Isophya modestior | 6 | Split into three MOTUs | 3/11 [6,9,11] |
Leptophyes albovittata | 6 | Split into three MOTUs | 5/11 [1,6,7,8,10] |
Odontura sp. 1 (MEDOR1009-23, MEDOR1010-23, MEDOR054-23) | 3 | Independent unidentified MOTU | — |
Odontura sp. 2 (MEDOR1016-23) | 1 | Independent unidentified MOTU | 5/11 [2,3,4,5,6] |
Odontura sp. 3 (MEDOR1017-23, MEDOR1020-23, MEDOR1021-23) | 3 | Independent unidentified MOTU | 5/11 [2,3,4,5,6] |
Odontura sp. 4 (MEDOR1018-23, MEDOR1019-23, MEDOR1022-23, MEDOR531-23) | 4 | Independent unidentified MOTU | 6/11 [2,3,4,5,6,11] |
Phaneroptera nana | 22 | Split into two MOTUs | — |
Phaneroptera sparsa | 8 | Split into two MOTUs | — |
Poecilimon (MEDOR1253-23) | 1 | Independent unidentified MOTU | — |
Poecilimon cretensis | 3 | Split into two MOTUs | — |
Poecilimon fussii | 7 | Split into two MOTUs | 3/11 [1,7,10] |
Poecilimon schmidtii | 4 | Split into two MOTUs | 1/10 [10] |
Tettigoniidae – Conocephalinae | |||
Conocephalus sp. (MEDOR778-23, MEDOR779-23) | 2 | Independent unidentified MOTU | — |
Tettigoniidae – Bradyporinae | |||
Ephippiger diurnus | 4 | Split into two MOTUs | 2/11 [6,9] |
Ephippiger sp. (MEDOR820-23, MEDOR821-23) | 2 | Independent unidentified MOTU | — |
Ephippigerida diluta | 5 | Split into two MOTUs | 1/11 [9] |
Ephippigerini sp. (MEDOR223-23, MEDOR441-23) | 2 | Independent unidentified MOTU | 1/11 [9] |
Lluciapomaresius anapaulae | 1 | Merged with L. asturiensis | 1/11 [6] |
Lluciapomaresius asturiensis | 8 | Split into three MOTUs, one of which includes one specimen of L. anapaulae | 5/11 [1,6,8,10,11] |
Lluciapomaresius sp. 1 (MEDOR233-23) | 1 | Independent unidentified MOTU | 5/11 [2,3,6,9,11] |
Lluciapomaresius stalii | 15 | Merged in a single MOTU | 5/11 [1,7,8,9,10] |
Neocallicrania lusitanica | 3 | Split into two MOTUs, one of which includes one specimen of N. miegii | 3/11 [7,8,10] |
Neocallicrania miegii | 9 | Split into two MOTUs, with one single specimen merged with N. lusitanica | 3/11 [7,8,10] |
Neocallicrania selligera (except N. s. selligera and N. s. meridionalis) | 6 | Split into three MOTUs | 5/11 [6,7,8,10,11] |
Parasteropleurus martorellii | 9 | Split into two MOTUs | 1/11 [10] |
Platystolus martinezii | 9 | Split into three MOTUs | 5/11 [2,3,4,5,6] |
Pycnogaster cucullatus | 2 | Split into two MOTUs | — |
Steropleurus brunnerii | 4 | Split into two MOTUs | — |
Steropleurus flavovittatus | 6 | Split into two MOTUs | 2/11 [6,11] |
Steropleurus sp. (MEDOR046-23) | 1 | Independent unidentified MOTU | — |
Tettigoniidae – Tettigoniinae | |||
Antaxius difformis | 6 | Split into two MOTUs | — |
Antaxius kraussii | 9 | Split into three MOTUs | 7/11 [4,5,6,8,9,10,11] * |
Bicolorana bicolor | 7 | Split into two MOTUs | 3/11 [6,8,10] |
Decticus verrucivorus | 4 | Split into two MOTUs | 4/11 [2,4,6,9] |
Eupholidoptera schmidti | 10 | Merged in a single MOTU | 5/11 [1,3,5,8,10] |
Eupholidoptera smyrnensis | 8 | Split into two MOTUs | 2/11 [6,9] |
Eupholidoptera sp. 1 (MEDOR128-23, MEDOR129-23) | 2 | Independent unidentified MOTU | 5/11 [2,3,4,6,9] |
Eupholidoptera sp. 2 (MEDOR127-23, MEDOR282-23) | 2 | Independent unidentified MOTU | 5/11 [2,3,4,6,9] |
Incertana decorata | 2 | Merged with I. drepanensis | — |
Incertana drepanensis | 1 | Merged with I. decorata | — |
Montana sp. (MEDOR537-23, MEDOR059-23, MEDOR058-23) | 3 | Independent unidentified MOTU | — |
Parnassiana sp. (MEDOR292-23) | 1 | Independent unidentified MOTU | — |
Pholidoptera fallax | 3 | Split into two MOTUs | — |
Platycleis affinis | 19 | Split into three MOTUs, one of which includes one specimen of P. falx | 2/11 [8,10] |
Platycleis albopunctata | 22 | Split into two MOTUs, one of which includes specimens of P. sabulosa, while the second one is merged with P. concii, P. grisea, P. intermedia | 3/11 [8,9,10] |
Platycleis concii | 4 | Merged with P. albopunctata, P. grisea, P. intermedia | 3/11 [8,9,10] |
Platycleis falx | 1 | Merged with P. affinis | — |
Platycleis grisea | 2 | Merged with P. albopunctata, P. concii, P. intermedia | 3/11 [8,9,10] |
Platycleis intermedia | 6 | Merged with P. albopunctata, P. grisea, P. concii | 3/11 [8,9,10] |
Platycleis sabulosa | 33 | Split into two MOTUs, one of which includes specimens of P. albopunctata | 3/11 [8,9,10] |
Pterolepis lusitanica | 2 | Merged with P. spoliata | — |
Pterolepis spoliata | 12 | Split into three MOTUs, one of which includes two specimens of P. lusitanica | 1/11 [11] |
Rhacocleis annulata | 6 | Split into three MOTUs | 1/11 [9] |
Rhacocleis sp. 1 (MEDOR070-23) | 1 | Independent unidentified MOTU | — |
Rhacocleis sp. 2 (MEDOR1298-23) | 1 | Independent unidentified MOTU | 4/11 [2,3,6,9] |
Rhacocleis sp. 3 (MEDOR1297-23, MEDOR069-23) | 2 | Independent unidentified MOTU | 4/11 [2,3,6,9] |
Roeseliana oporina | 2 | Merged with R. r. roeselii | 1/11 [10] |
Roeseliana roeselii roeselii | 8 | Merged with R. oporina | 1/11 [10] |
Sporadiana sporadarum | 5 | Split into two MOTUs | 3/11 [6,9,11] |
Tessellana lagrecai | 3 | Merged with T. tessellata | 8/11 [1,2,3,6,8,9,10,11] * |
Tessellana sp. (MEDOR1449-23) | 1 | Independent unidentified MOTU | 4/11 [2,3,6,11] |
Tessellana tessellata (except T. t. tessellata) | 15 | Merged with T. lagrecai | 8/11 [1,2,3,6,8,9,10,11] * |
Tessellana tessellata tessellata | 1 | Independent MOTU | 4/11 [2,3,6,11] |
Thyreonotus bidens | 13 | Split into four MOTUs | 7/11 [2,3,4,5,6,7,11] * |
Thyreonotus corsicus | 4 | Split into two MOTUs | 1/11 [9] |
The percentages of DNA barcoding effectiveness in species identification are somewhat lower than those reported by
As expected from a barcode tree (
The detailed discussion of the delimited putative species clusters is organized according to traditional taxonomic families and can be found in File S7 [part 1] for caeliferans and File S7 [part 2] for ensiferans.
The employment of multiple species delimitation methods, each with distinct parameter combinations, on a comprehensive dataset has facilitated the identification of various patterns of diversity, some of which are clearly delineated, while others remain unresolved. In total, 12 cases for caeliferans and 42 cases for ensiferans were identified as potentially harboring more than one cryptic lineage. Additionally, there were 10 cases for caeliferans and 26 for ensiferans of unidentified independent MOTUs. The detected instances of barcode sharing (12 cases for caeliferans and nine cases for ensiferans) can either be attributed to intraspecific variability, suggesting a need for synonymization among these taxa, or may highlight the ineffectiveness of using a single genetic marker for accurate species delimitation, like the case of Chorthippus spp.
Consistent with prior studies, delimitation patterns within the Gomphocerinae (
Among well-represented taxa of Caelifera, instances of potential cryptic diversity are notably prevalent within the Pamphagidae and Tetrigidae families, as well as the Calliptaminae and Melanoplinae subfamilies of Acrididae. More distinct and frequent occurrences of potential cryptic diversity are observed within Ensifera, especially among Gryllidae, and also Tettigoniinae, Bradyporinae, and Phaneropterinae (Tettigoniidae). Ensiferans encompass a greater number of taxonomically recognized species (185 vs. 164) and the highest number of putative species clusters identified by each delimitation method, despite a smaller initial specimen count (1128 vs. 1478).
Among all the delimitation methods compared, ABGD, particularly when using the combination of Kimura two-parameters and the initial partition, produced results most consistent with traditional taxonomy. On the other hand, the PTP method was the least consistent, oversplitting the lineages into a large number of clusters.
Geographical regions partially reflect genetic diversification. Various potential cryptic taxa have indeed been identified in isolated localities, such as islands and glacial refugia. Key examples are represented by Anterastes supersp. serbicus Brunner von Wattenwyl, 1882 in Anatolia and the Balkans (
Widespread European genera, like Calliptamus Serville, 1831, a member of the subfamily Gomphocerinae, feature morphologically similar species but display distinct delimitation patterns, including probable cryptic taxa (e.g., within C. siciliae Ramme, 1927 and C. barbarus (Costa, 1836)). However, these patterns within Calliptamus diverge from those observed across other genera in the Acrididae family, despite their similarly broad distribution across Europe.
Another factor impacting geographical diversification patterns is anthropogenic transport and the introduction of non-endemic species. A well-documented case involves Rhacocleis annulata Fieber, 1853, native to Sicily but subsequently transported across large areas of continental Europe through the ornamental plant trade (
The BIN discordance analysis and observed delimitation patterns have revealed several cases of incongruency with currently accepted taxonomy. In order to exclude the misidentification of specimens and the use of obsolete taxonomic units as reasons for these incongruencies as far as possible, MEDOR was initially based exclusively on specimens collected in the field over the last decade (between 2013 and 2022) which were identified by experts directly after collection. The identification was verified after analysing the ML trees using the most recently published dichotomous keys. As taxonomic revisions of orthopterans are continuously being published (e.g.,
The quality control protocol, involving sequence alignment assessment, phylogenetic reconstruction, chromatogram analysis, and sequence identification using BLASTn, has excluded most suspect records potentially representing numts to the greatest extent possible. Additionally, the improvement in PCR primers has contributed to reducing numts. Furthermore,
The influence of Wolbachia bacteria was also examined by analysing each sequence with BLASTn. However, merely detecting the presence of this endosymbiont does not necessarily indicate its impact on the host genome or the occurrence of horizontal gene transfer (HGT) (
Signatures of potential hybridization and introgression have been confirmed in several taxa within this dataset, with notable cases reported among both caeliferans and ensiferans. Hybrids among Chorthippus biguttulus (Linnaeus, 1758) and related species have been generated under laboratory conditions, but hybrid males were found to be behaviorally sterile due to their intermediate courtship songs rejected by all females (
Incomplete lineage sorting (ILS), the sharing of ancestral haplotypes among related species, is identified as a primary cause of barcoding failure in numerous groups. Rapid and recent speciation, particularly within the Gomphocerinae subfamily of Acrididae, which diverged approximately in the last 6.44 million years (
The current delimitation analyses underscore the limitations of barcoding in distinguishing extremely recently diverged species, while also highlighting interesting patterns in less recently diverged lineages that may conceal cryptic taxa, due to nearly identical morphological and bioacoustic characteristics among different species.
A prime example is provided by the insular Greek populations of Eupholidoptera Maran, 1953 and Poecilimon Fischer, 1853, particularly species such as E. smyrnensis and P. cretensis Werner, 1903. Despite being classified as a single taxon, populations from different islands or even different regions within the same island (e.g., Crete) exhibit clear genetic divergence. This divergence is influenced by the mutation rate of the marker in question (COI), which dictates the expected number of mutations diagnostic of a species. The mutation rate determines the pace at which neutral divergence accumulates, potentially enlarging the barcoding gap in rapidly evolving species (
As previously noted, several evolutionary processes, predominantly hybridization and incomplete lineage sorting (ILS), significantly influence the performance of species delimitation based on barcode sequences. Utilizing methods based on diverse theoretical frameworks helps mitigate biases associated with the limitations of each algorithm.
Taxonomically problematic groups, including Platycleidini, Gomphocerinae, and Stenobothrinae, could be more accurately delimited and phylogenetically resolved through the adoption of more rigorous protocols.
High-throughput genomic techniques promise to effectively identify divergent lineages among both widespread and endemic species.
Future investigations should leverage high-throughput genetic methodologies not only to address existing taxonomic uncertainties but also to explore the biogeographical characteristics of endemic orthopteran species, which may demonstrate greater (rather than less) variation and divergence compared to their more ubiquitous counterparts.
Our barcoding data, thanks to rigid quality protocols and a representative set of assessed species, will be useful for a wide range of applications in taxonomy, conservation management and ecology. The well-known advantages of DNA barcoding will allow a reliable identification of many Mediterranean orthopterans, even just basing on biological fragments and soil eDNA. This becomes crucial for the detection of the presence of endangered species, both globally and locally, in a certain area. Barcodes of IUCN critically endangered species like Isophya harzi Kis, 1960, Zubovskya banatica Kis, 1965 and Chorthippus acroleucus (Müller, 1924) were here published for the first time on BOLD systems (
To minimize misidentification and taxonomic confusion, species exhibiting BIN sharing should not be fully considered in species delimitation conclusions, as resolving such cases requires more robust investigations using multi-locus or genomic approaches. However, these species delimitation results provide key insights, including evidence of potential cryptic undescribed taxa, instances of synonymy requiring taxonomic revision, and geographically structured divergent lineages. This comprehensive preliminary assessment of orthopteran biodiversity lays the groundwork for future in-depth taxonomic studies, population genomic analyses, and expanded sampling programs, ultimately advancing the understanding of the evolutionary and biogeographic dynamics of orthopterans in the Mediterranean basin.
This research was partly subsidized by Erasmus+ Internship Program, Stichting Fonds Pontium for systematic zoology and Synthesys+ Call 4 NL-TAF: “Barcoding of Mediterranean grasshoppers” and by personal funding to LSD from Heinrich-Böll-Stiftung.
The authors report there are no competing interests to declare.
We are deeply thankful to Prof. Dr. Martin Husemann, Rob Felix and Prof. Dr. Bruno Massa for their help with specimen identification and for providing key publications. Our gratitude is further extended to Mr. Matteo Calafato, Dr. Giobbe Forni, and Dr. Jorge Luis Ramirez Malaver for their invaluable support with the application of SPdel v.2.0 (
This project was funded by Synthesys + Call 4 (TAF-NL), DGFO research fund for young scientists and by a doctoral stipend from Heinrich-Böll-Stiftung to LSD.
Files S1–S8
Data type: .zip
Explanation notes: File S1. notes on the protocols involved in PCR at LIB (Protocol a), PCR at Naturalis (Protocol b) and DNA sequencing (Protocol c). — File S2. Maximum Likelihood trees inferred for caeliferans (Graph a) and ensiferans (Graph b). — File S3. Graphic outputs from SPdel (