Research Article |
|
Corresponding author: Sumeyra Nur Sanal Demirci ( sumeyrasanal@gmail.com ) Academic editor: Arianna Thomas-Cabianca
© 2026 Sumeyra Nur Sanal Demirci, Volkan Kılıç, Serap Mutun, A. Yavuz Kılıç.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Sanal Demirci SN, Kılıç V, Mutun S, Kılıç AY (2026) Insights into the molecular diversity and phylogeographic structure of Tabanus bifarius Loew, 1858 (Diptera: Tabanidae). Arthropod Systematics & Phylogeny 84: 31-46. https://doi.org/10.3897/asp.84.e162008
|
Tabanus bifarius Loew, 1858 is a horsefly species distributed throughout the Mediterranean region and in the areas of central and southeastern Europe and partly in Asia. In this study, we conducted the first comprehensive phylogeographic and population genetic analysis of T. bifarius collected from 13 localities across Türkiye and Iran, through utilizing mitochondrial (COI) and nuclear DNA (ITS1–5.8S–ITS2) markers. A total of 81 haplotypes and 59 nuclear alleles were identified among 187 sequenced individuals. Both mitochondrial and nuclear DNA patterns and diversity observed in T. bifarius are indicative of complex historical and demographic processes. Among all the populations studied, Eskişehir and Hatay exhibited the highest genetic diversity, which may be due to the region’s topography and transitional zones. Phylogenetic analyses suggested that T. bifarius split from the outgroup species approximately 5.53 million years ago (MYA) during the Late Miocene–Early Pliocene, likely driven by tectonic and climatic events. Subsequent diversification events that occurred during times of climatic and environmental fluctuations in the Late Pliocene and Early Pleistocene also seemed to have significantly affected the species and gave rise to the formation of some important genetic lineages. Our analyses results indicate that T. bifarius exhibits a structured genetic landscape shaped by multiple refugial routes, geographic barriers, and Quaternary climatic oscillations. Further, our findings suggest that the species likely entered Anatolia through three distinct routes: (1) from the Levant region into southern Anatolia via Hatay; (2) from the Caucasus into northeastern Anatolia through Ardahan; and (3) from the Iranian Plateau into eastern Anatolia via Van.
Horsefly, Population genetics, Phylogeography, COI, ITS
Horseflies (Diptera: Tabanidae) represent an ancient lineage, with fossil evidence tracing back to the Cretaceous period (
The genus Tabanus Linnaeus, 1758 in particular, includes species of considerable economic and medical importance, yet phylogenetic and population genetic investigations remain limited. Among these, Tabanus bifarius Loew, 1858 is one of the lesser-studied species, with no comprehensive phylogenetic analyses reported from Türkiye or neighboring Mediterranean countries. This paucity of genetic data constrains our understanding of the species’ evolutionary history and biogeographic patterns. T. bifarius, is a horsefly species commonly found in open habitats and forested areas ranging from sea level to elevations of 1,000 meters.
Its distribution spans the Mediterranean region (Albania, Croatia, France, Greece, Italy, Morocco, Spain, Syria, Tunisia, Jordan, Algeria, Montenegro, Slovenia), southeastern (Bosnia and Herzegovina, Bulgaria, North Macedonia, Romania, Serbia, Kosovo) and central Europe (Austria, Czech Republic, Germany, Hungary, Slovakia) as well as the territory of Russia, Ukraine, Armenia, Georgia, Azerbaijan, Iran, Afghanistan, Kazakhstan (Abbasian, 1964; Leclercq, 1966, 1967; Chvála et al., 1972; Chvála et al., 1988; Krčmar and Durbešić, 1999; Krčmar et al., 2002, 2004; Krčmar, 2011; Mazraeh et al., 2018;
Türkiye, as a native habitat for many species and a refuge for organisms affected by past geological and climatic shifts, represents a uniquely biodiverse region globally. In addition to serving as a center of origin and dispersal for numerous taxa, Türkiye holds a pivotal position within the Palearctic realm, functioning as a transitional zone for Eremial, Boreal, and Central European species. Notably, it is the only country encompassing three globally recognized biodiversity hotspots: the Caucasus, the Irano-Anatolian, and the Mediterranean regions (Demirsoy, 2002). Research over recent decades has demonstrated that Türkiye served as a refugium for many species during glacial periods and played a critical role in the post-glacial recolonization of Europe (Çıplak, 2008). Furthermore, it has been identified as a center of genetic diversity and the origin of many European taxa (Bilgin, 2011; Çıplak, 2008; Hewitt, 2000; Rokas et al., 2003). Although the phylogenetic history of numerous Anatolian taxa has been investigated, comprehensive studies are still needed to fully understand the scope and complexity of Türkiye’s genetic diversity, which continues to represent a key for a great deal of European biodiversity.
Previous phylogeographic research on T. bromius Linnaeus, 1758 in Türkiye provided baseline insights into the population structure and gene flow of horseflies, offering a reference point for further studies on the genus (Sanal-Demirci et al., 2021). Building on this foundation, the present study represents the second molecular investigation of a Tabanus species from Türkiye, focusing on T. bifarius. Specifically, we aimed to: (i) identify intraspecific genetic variation across the sampled range; (ii) elucidate the phylogeographic structure of the species; and (iii) assess the potential impacts of historical climate change on current patterns of genetic diversity within T. bifarius populations. We anticipate that the findings will enhance understanding of T. bifarius’ ecological and evolutionary dynamics and provide insights for management and control of horseflies as potential disease vectors.
Females of T. bifarius were collected from 12 locations across Türkiye, covering most of the species’ known distribution range. Fresh specimens were captured using Malaise traps, and additional samples were obtained from the Zoology Museum of Eskişehir Technical University’s Faculty of Science, Department of Biology (ESTU-ZM). Specimens from Iran and Croatia were also included in the study (Fig.
The specimens used in this study were identified by Prof. Dr. A. Yavuz Kılıç based on morphological keys (Chvála et al., 1972; Kılıç, 1999, 2006). Females are medium-sized horseflies, with body lengths ranging from 12 to 15 mm. Greyish to olive-grey in color, with a relatively narrow frons and well-separated calli; the lower callus is brownish. Eyes haired and with three bands. Antennae are reddish-brown with a dark apex. The thorax exhibits dark brown coloration with distinct yellowish stripes, while the abdomen shows alternating dark and light bands. Notable variability is observed in antennal coloration and abdominal patterns.
In total, genomic DNA was successfully extracted from 187 individuals, including 146 fresh specimens and 41 museum samples, representing 13 populations (182 from Türkiye and 5 from Iran), using a modified version of the extraction protocol by
Populations, abbreviations, coordinates, and sample sizes of T. bifarius. Haplotype and nucleotide diversity values are given for each population: COI data are shown in the upper row and ITS1–ITS2 data in the lower row.
| Populations | Abb. | Coordinate | Sample Size/ NHAP/ ALLELES | Haplotype/Nucleotide Diversity | ||
| COI | ITS1-ITS2 | COI | ITS1-ITS2 | |||
| Antalya | ANT | 36°53.81’N; 31°42.80’E | 13/9 | 13/7 | 0.9103 +/– 0.0683 0.7308 +/– 0.1332 | 0.0214 +/– 0.0115 0.0058 +/– 0.0032 |
| Ardahan | ARD | 41°06.52'N; 42°42.13'E | 11/2 | 9/4 | 0.1818 +/– 0.1436 0.5833 +/– 0.1833 | 0.0044 +/– 0.0028 0.0135 +/– 0.0075 |
| Çanakkale | CAN | 40°09.12'N; 26°24.33'E | 20/7 | 20/4 | 0.6895 +/– 0.1053 0.5368 +/– 0.1042 | 0.0014 +/– 0.0011 0.0005 +/– 0.0004 |
| Denizli | DNZ | 37°47.00'N; 29°05.78'E | 3/3 | 3/3 | 1.0000 +/– 0.2722 1.0000 +/– 0.2722 | 0.0210 +/– 0.0163 0.0016 +/– 0.0015 |
| Edirne | EDR | 41°40.62'N; 26°33.33'E | 20/9 | 10/4 | 0.8842 +/– 0.0494 0.7333 +/– 0.1199 | 0.0025 +/– 0.0017 0.0008 +/– 0.0006 |
| Erzincan | ERZ | 39°44.78'N; 39°29.48'E | 14/3 | 10/1 | 0.6923 +/– 0.0652 0.0000 +/– 0.0000 | 0.0175 +/– 0.0095 0.0000 +/– 0.0000 |
| Eskişehir | ESK | 39°46.00'N; 30°31.60'E | 19/15 | 19/9 | 0.9708 +/– 0.0273 0.7310 +/– 0.1090 | 0.0160 +/– 0.0085 0.0043 +/– 0.0024 |
| Hatay | HTY | 36°25.49'N; 36°10.27'E | 8/7 | 8/7 | 0.9643 +/– 0.0772 0.9643 +/– 0.0772 | 0.0160 +/– 0.0093 0.0152 +/– 0.0086 |
| Kayseri | KAY | 38°44.09'N; 35°28.08'E | 20/8 | 20/12 | 0.7789 +/– 0.0829 0.9000 +/– 0.0532 | 0.0128 +/– 0.0069 0.0144 +/– 0.0074 |
| Muğla | MUG | 37°13.00'N; 28°22.00'E | 22/12 | 12/2 | 0.8658 +/– 0.0652 0.1667 +/– 0.1343 | 0.0175 +/– 0.0092 0.0001 +/– 0.0002 |
| Sinop | SNP | 42°01.60'N; 35°09.07'E | 20/9 | 20/11 | 0.6526 +/– 0.1225 0.8842 +/– 0.0535 | 0.0137 +/– 0.0074 0.0218 +/– 0.0111 |
| Van | VAN | 38°29.99'N; 43°22.69'E | 12/4 | 7/5 | 0.7879 +/– 0.0701 0.8571 +/– 0.1371 | 0.0040 +/– 0.0025 0.0019 +/– 0.0013 |
| Iran | IRN | 38°51.00'N; 44°37.00'E | 5/4 | 5/4 | 0.9000 +/– 0.1610 0.9000 +/– 0.1610 | 0.0040 +/– 0.0029 0.0044 +/– 0.0030 |
A 650-base pair (bp) fragment of the mitochondrial cytochrome c oxidase subunit I (COI) gene was amplified using the LCO1490–HCO2198 primer pair (
The number of polymorphic sites (S), haplotype (h) and nucleotide (π) diversity (
Both data sets were analyzed separately to reveal intraspecific phylogenetic relationships among Tabanus bifarius populations. Tabanus maculicornis Zetterstedt, 1842 (GenBank: MZ563331.1), Tabanus shannonellus Kröber, 1936 (GenBank: MZ563357.1) for the COI haplotypes, T. bromius-1 (GenBank: MK936073) and T. bromius-2 (GenBank: MK936162) for the ITS alleles were used as outgroups in all phylogenetic analyses. Maximum parsimony (MP), maximum likelihood (ML) and Bayesian analyses were used to deduce phylogenetic relationships among haplotypes/alleles using PAUP*4.0b10 (
A total of 81 unique COI haplotypes were identified from 187 T. bifarius individuals sampled from Türkiye and Iran. Seventy-seven haplotypes were detected across all Turkish populations, while the five Iranian specimens yielded four distinct haplotypes (Table SS1). Among the 81 haplotypes, 87 polymorphic sites were identified, 68 of which were parsimony informative. Nucleotide composition was biased toward adenine and thymine, with frequencies of 37.8% thymine (T), 30.0% adenine (A), 16.9% guanine (G), and 15.1% cytosine (C), consistent with typical mitochondrial DNA. No nonsense mutations or internal stop codons were observed in the translated amino acid sequences, supporting the mitochondrial origin of the data. Translation of the COI sequences into 216 amino acids revealed polymorphism at only seven positions. Pairwise nucleotide differences among haplotypes ranged from 1 bp (0.1%) to 37 bp (6.0%). Overall haplotype diversity was high (h = 0.9608 ± 0.0080), and nucleotide diversity was moderate (π = 0.026804 ± 0.013240). Of the 81 haplotypes, six were shared across multiple populations, 22 were private (restricted to a single population), and 53 were singletons (found in only one individual). Notably, all Iranian haplotypes were shared exclusively with the Van population in eastern Türkiye, suggesting a possible historical or contemporary connection between these regions.
The amplified ITS1–5.8S rRNA–ITS2 region from the nuclear genome of T. bifarius totaled 1,339 base pairs (bp). The 5.8S region (123 bp) was invariant across all samples and was therefore excluded from subsequent analyses. Only the ITS1 and ITS2 regions were concatenated and used for allele identification. The resulting combined ITS1–ITS2 sequence was 1,216 bp in length and showed no length variation among the 156 successfully amplified samples. These sequences were collapsed into 59 unique alleles. Among the 1,216 bp sequences, 130 sites were polymorphic, with 127 of these being parsimony-informative. The overall nucleotide composition was 42.14% adenine (A), 35.07% thymine (T), 13.95% guanine (G), and 8.83% cytosine (C). Pairwise differences between alleles ranged from 1 bp to 106 bp. Allele diversity was high (Ad = 0.8883 ± 0.0210), and nucleotide diversity was π = 0.023355 ± 0.011370. Most alleles were unique: 45 were singletons, 6 were private, and only 8 alleles were shared among multiple populations. In the Iranian population, two singleton alleles were identified, along with one private allele and one shared allele (Table SS2).
Neutrality tests were performed to assess deviations from neutrality and potential changes in population size. Fu’s FS value for all COI haplotype analyses was significantly negative (FS = −23.4979, P ≤ 0.05), suggesting population expansion. In contrast, Tajima’s D, however, was not significant (Tajima’s D = −0.5032, P ≥ 0.05). Both Harpending’s raggedness index Hri = 0.0036 and SSD = 0.0059 were low, further supporting a recent demographic expansion. When analyzed individually, several Turkish populations such as Çanakkale and Edirne showed signals of past demographic expansion, whereas the remaining populations did not yield statistically significant results. For both the Ardahan and Çanakkale populations, Tajima’s D value was significantly negative (Tajima’s D = −2.0832, P ≤ 0.05; Tajima’s D = −1.6916, P ≤ 0.05, respectively), suggesting deviations from neutrality, possibly due to population expansion (Table
Fu’s FS, Tajima’s D, Hri and SSD values calculated for each population. A. COI gene. B. ITS1–ITS2 region (*p≤ 0.05).
| A | ||||
| COI | ||||
| Population | Fu’s Fs | Tajima’s D | Hri | SSD |
| Antalya | 1.0239 | 0.6075 | 0.1668 | 0.0948* |
| Ardahan | 5.5287 | –2.0835* | 0.7355 | 0.0276 |
| Çanakkale | –3.7110* | –1.6916* | 0.1230 | 0.0112* |
| Denizli | 1.4687 | 0.0000 | 0.4444 | 0.2491 |
| Edirne | –4.0990* | –1.1700 | 0.0964 | 0.0128 |
| Erzincan | 11.4371 | 2.7513 | 0.2127 | 0.1153* |
| Eskişehir | –3.0910 | –0.5323 | 0.0553 | 0.0237 |
| Hatay | –0.3717 | 1.5241 | 0.0344 | 0.0410 |
| Kayseri | 2.6647 | –1.1741 | 0.0965 | 0.0409 |
| Muğla | 0.8329 | 1.4942 | 0.0217 | 0.0166 |
| Sinop | 1.9586 | –0.8186 | 0.0900 | 0.0175 |
| Van | 1.5936 | 1.1886 | 0.1037 | 0.0246 |
| Iran | –0.5674 | –0.6682 | 0.0700 | 0.0148 |
| B | ||||
| ITS1-ITS2 | ||||
| Population | Fu’s Fs | Tajima’s D | Hri | SSD |
| Antalya | 1.2312 | 1.9605 | 0.0836* | 0.0283* |
| Ardahan | 6.9650 | –1.5900* | 0.2878 | 0.0642 |
| Çanakkale | –1.0060 | –0.6427 | 0.1257* | 0.0097* |
| Denizli | –0.6931 | 0.0000 | 0.0000 | 0.0000 |
| Edirne | –0.8763 | 1.3372 | 0.1511* | 0.0147* |
| Erzincan | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Eskişehir | –0.0303 | 1.2186 | 0.0727* | 0.0217* |
| Hatay | 0.6548 | 1.4273 | 0.0573 | 0.0339 |
| Kayseri | 1.8026 | –0.4034 | 0.0173 | 0.0120 |
| Muğla | –0.4756 | –1.1405 | 0.4722 | 0.0003 |
| Sinop | 4.8014 | –0.2735 | 0.0354 | 0.0214 |
| Van | –1.1767 | –0.1411 | 0.0204 | 0.0041 |
| Iran | 0.6122 | 0.8945 | 0.2300 | 0.0846 |
When analyzed individually, the Çanakkale and Edirne populations exhibited unimodal mismatch distribution curves, indicative of recent population expansion. On the contrary, the Ardahan, Denizli, and Erzincan populations displayed bimodal curves, while the remaining populations showed multimodal patterns. These latter distributions suggest demographic stability or population decline (Fig. S1). In the ITS1–ITS2 region data for all the populations, neither Fu’s FS nor Tajima’s D value was a significant value (FS = −1.5243, P ≥ 0.05; Tajima’s D = −0.7292, P ≥ 0.05) while Hri (0.0121) and SSD (0.0152) values calculated for all the alleles were low. When each population was analyzed separately, Fu’s FS values were non-significant for all populations. Only the Ardahan population generated a significant negative value in Tajima’s D (−1.5900, P ≤ 0.05). The remaining populations produced no significant values (Table
The pairwise FST values showed significant genetic differentiation for the COI data set (Table SS3). The highest FST value was observed between Çanakkale and Ardahan (FST = 0.9484, P ≤ 0.05), followed by Iran (FST = 0.9434, P ≤ 0.05) and Van (FST = 0.9304, P ≤ 0.05), respectively. The next highest statistically significant FST values involving another population were observed between Edirne and Ardahan (FST = 0.9361, P ≤ 0.05), Iran (FST = 0.9167, P ≤ 0.05), and Van (FST = 0.9101, P ≤ 0.05). The lowest FST value involving the Iranian population was with the Van population (FST = 0.1318, P ≥ 0.05), even though it was statistically insignificant. Furthermore, for the Iranian population, the highest level of genetic differentiation was observed with the Çanakkale population. Pairwise FST analysis using the ITS1–ITS2 dataset supported the presence of considerable genetic differentiation among populations (Table S4). The highest FST values were observed in comparisons between Denizli and Erzincan (FST = 0.965, P ≤ 0.05), followed by Denizli and Muğla (FST = 0.958, P ≤ 0.05), and Denizli and Çanakkale (FST = 0.929, P ≤ 0.05). In contrast, the lowest levels of genetic differentiation were found between Antalya and Iran (FST = 0.0146, P ≥ 0.05) and Erzincan and Muğla (FST = 0.0161, P ≥ 0.05), all of which were statistically insignificant. For the Iranian population, the highest differentiation was detected between Iran and Ardahan (FST = 0.808, P ≤ 0.05), whereas the lowest value was found between Iran and Antalya (FST = 0.0146, P ≥ 0.05).
An analysis of the hierarchical distribution of genetic diversity based on the COI dataset revealed that the highest level of genetic variation occurred within populations (30.61%, P ≤ 0.01), followed by variation among populations (39.72%, P ≤ 0.01) and among groups (29.68%, P ≤ 0.01), when all populations were divided into two groups (Group 1: IRN, VAN; Group 2: the other Turkish populations. The genetic differentiation between these groups was high (FST = 0.69, P ≤ 0.001; Table
AMOVA analysis of T. bifarius. A Analysis of the COI data; grouping comprises two groups: Group 1: IRN, VAN; Group 2: ANT, ARD, CAN, DNZ, EDR, ERZ, ESK, HTY, KAY, MUG, SNP. B Analysis of the ITS1–ITS2 data; grouping comprises three groups: Group 1: ANT, DNZ, MUG; Group 2: ARD, CAN, EDR, ERZ, ESK, HTY, KAY, SNP; Group 3: IRN, VAN.
| Source variation | d.f. | Sum of squares | Variance components | Percentage of variation |
| A | ||||
| Among groups | 1 | 166.191 | 3.59828 | 29.68* |
| Among pops. | 11 | 808.429 | 4.81597 | 39.72* |
| Within pops. | 174 | 645.706 | 3.71096 | 30.61* |
| Total | 186 | 1.620.326 | 12.1252 | |
| B | ||||
| Among groups | 9 | 1.498.111 | 9.90869 | 64.49* |
| Among pops. | 3 | 31.013 | 0.75752 | 4.93* |
| Within pops. | 143 | 671.870 | 4.69839 | 30.58* |
| Total | 155 | 2.200.994 | 15.3646 |
On the contrary, analysis based on the ITS1–ITS2 dataset showed that 30.5% of the genetic variation was within populations, 4.93% among populations, and 64.49% among groups, when all populations were divided into three groups (Group 1: ANT, DNZ, MUG; Group 2: ARD, CAN, EDR, ERZ, ESK, HTY, KAY, SNP; Group 3: IRN, VAN). The genetic differentiation between these groups was also high (FST = 0.69, P ≤ 0.01) (Table
The ML, MP, and BEAST tree topologies were largely congruent, differing only in bootstrap or posterior probability values at certain branches. Consequently, the BEAST tree for the COI dataset and the ML tree for the ITS dataset are presented here. Molecular analyses based on the COI gene suggest that T. bifarius diverged from the outgroup taxa approximately 5.52 million years ago (MYA) (Fig.
Subsequently, T. bifarius split into two distinct haplogroups, Clades A and B, around 3.63 MYA. In Clade A, a basal haplotype from the Eskişehir population (H45), located in central Anatolia, diverged from its sister group approximately 1.98 MYA, forming a monophyletic group with the remaining haplotypes. This sister group later split into two subclades, both primarily composed of haplotypes from northern Anatolia. Clade B, the second major lineage, is divided into two subclades: B1 and B2. Their divergence is estimated to have occurred around 2.5 MYA. Subclade B1 is characterized by a basal haplotype from the Kayseri population, which diverged approximately 1.25 MYA. The remaining haplotypes in this subclade are mainly from the Hatay and Antalya populations. Subclade B2 shows a more structured organization and is dominated by haplotypes from western and central Anatolia. It is further divided into two geographically distinct subclades: B2-A and B2-B, which split around 1.84 MYA. B2-A includes basal haplotypes (H47, H48, and H49) from the Hatay population, which diverged approximately 1.33 MYA to form a minor distinct clade. The remaining haplotypes within B2-A, originating from Erzincan, Van, and Iran, formed a separate lineage around 1.09 MYA. Despite this diversity, B2-A remains primarily composed of haplotypes from western and central Türkiye, with diversification dating to around 0.7 MYA. B2-B features a basal haplotype (H40) from the Eskişehir population, which diverged approximately 0.92 MYA. This subclade subsequently split around 0.54 MYA into two clades, each consisting of smaller groups of haplotypes predominantly from western Anatolia.
The general pattern observed in the COI gene phylogenetic trees is further supported by the results of the minimum spanning network analysis (Fig.
The first group, labeled as A in the network, corresponds to Clade A in the phylogenetic trees and includes haplotypes from central, northern, and northeastern Anatolia. The remaining haplotypes, forming subclades B1 and B2 (including B2-A and B2-B) in the BEAST tree, are also clearly represented in the network as distinct clusters labeled B1, B2-A, and B2-B. The B1 haplogroup is predominantly composed of haplotypes from central and southern Anatolia. A small star-like phylogeny is observed within this group, with haplotype H62, originating from Kayseri from the central Anatolia. The B2-A haplogroup primarily contains haplotypes from western and central Türkiye, along with another cluster that includes both Iranian and eastern Anatolian (Van, Erzincan) haplotypes. In the B2-B subgroup, haplotypes H25 and H4, shared and among the most frequent, serve as central nodes connected to other haplotypes mostly found in western Anatolia. Several small star-like phylogenies are also evident within this group, which aligns closely with Clade B2-B in the phylogenetic tree.
Both phylogenetic analyses (ML and MP) of the concatenated ITS1–ITS2 region of T. bifarius produced similar topologies, differing only in bootstrap and posterior probability values. Therefore, only the ML tree is presented here, with BI posterior probabilities on the corresponding branches. All branches in the tree are strongly supported (Fig.
Analysis of the ITS1–ITS2 region revealed a predominantly polytomous tree structure, indicating relatively low phylogenetic resolution. The A41 allele from the Kayseri population occupies the most basal position in the tree. The topology is divided into two sister clades. The first clade includes a small group formed by the A16, A15, and A14 alleles from the Denizli population, as well as a larger subclade comprising the A26, A28, A27, A32, A31, and A29 alleles, all sampled from Hatay. The second is characterized by a basal allele from the Eskişehir population and includes additional alleles from Eskişehir, Antalya, and Sinop, suggesting a distinct lineage. Within the broader polytomous structure, the A1 allele (from Antalya) also occupies a basal position. The remainder of the tree is composed of numerous unresolved branches, consisting of alleles or allele groups from populations in eastern, central, northern, and western Türkiye, as well as from Iran.
The minimum spanning network based on the ITS region is presented in Fig.
The most frequent and widely distributed allele, A2 (found in 48 individuals), occupies a central position in a star-like configuration, suggesting it may represent an ancestral or founding allele. A2 is shared among six geographically diverse populations: Antalya, Çanakkale, Edirne, Erzincan, Eskişehir, and Muğla. Another widely shared allele, A4, is present in individuals from the Antalya, Eskişehir, and Sinop populations. Notably, a distinct connection pattern is evident between the Iranian population and the Van population in eastern Türkiye. This pattern may indicate historical gene flow or shared ancestry between these geographically separated populations, supporting the hypothesis of east–west dispersal or connectivity across the region.
This study provides the first comprehensive analysis of mitochondrial and nuclear genetic variation in T. bifarius populations from Türkiye and Iran. The high haplotype diversity (h = 0.9608) and nucleotide diversity (π = 0.0268) observed in the COI gene suggest a long evolutionary history, potentially involving multiple glacial refugia and complex post-glacial recolonization pathways in the region. The A+T content in the COI gene T. bifarius was found to be 67.72%, which falls within the typical range reported for mitochondrial coding genes in many insect groups. Additionally, amino acid polymorphism in the COI gene was relatively low, with only 7 out of 216 residues exhibiting variation. This pattern is commonly observed in mitochondrial genes of insects and other metazoans, largely due to the essential role of mitochondrial-encoded proteins in oxidative phosphorylation. (
In the present study, the highest haplotype diversity was observed in the Eskişehir population, whereas the greatest nuclear allelic diversity was detected in Hatay. In contrast, our previous research on T. bromius revealed that both mitochondrial and nuclear diversity peaked in the Sinop population, located along the northern Black Sea coast. These distinct genetic diversity patterns in T. bifarius and T. bromius provide valuable insights into their respective adaptations to the historical and ecological dynamics of Anatolia. Eskişehir, in central-western Türkiye, may act as a long-term refugial area or secondary contact zone for T. bifarius, reinforcing remarkable haplotype diversity. Nuclear allele diversity was highest in Hatay, a biogeographically transitional region connecting Anatolia to the Levant, potentially acting as a corridor for gene flow and historical dispersal (
The impact of historical factors can be critically assessed through a range of demographic analyses of population data (Avise, 2004). For T. bifarius, Fu’s FS value (−23.4979) is significantly negative, indicating an excess of rare haplotypes, which suggests deviation from neutrality and is consistent with population expansion. This inference is further supported by the mismatch distribution analysis of the COI data, where all populations exhibited a multimodal profile (Hri = 0.0036; SSD = 0.0059). Although these values were not statistically significant, the overall pattern aligns with a demographic history of expansion (
When the results obtained for T. bifarius in the current study are compared with findings from our previous work, T. bromius displays partially similar yet more structured demographic patterns. This increased structuring may reflect a more complex and regionally heterogeneous demographic history, potentially linked to T. bromius’s broader distribution across the Palearctic region and greater sensitivity to climatic fluctuations during the Pleistocene (Hewitt, 2000; Schmitt, 2007). In contrast, T. bifarius exhibits more recent or incomplete signals of demographic change, likely shaped by its Mediterranean distribution and associated ecological constraints.
Our previous phylogenetic analyses indicated that T. bromius diverged from its outgroup taxa approximately 8.83 MYA, during the Late Miocene (Sanal-Demirci et al. 2021). This period was characterized by dramatic climatic shifts and the widespread expansion of pasture-dominated habitats, highlighting the evolutionary significance of T. bromius in adapting to dynamic environmental conditions. The Miocene epoch witnessed substantial climatic and ecological fluctuations, including the proliferation of open habitats such as grasslands. These environmental changes likely created new ecological opportunities for Tabanid flies, including increased host availability and more favorable breeding conditions. Our current research estimates that T. bifarius diverged from outgroup taxa approximately 5.52 million years ago, during the Messinian period of the Late Miocene. This finding aligns closely with our previous results and strongly supports the hypothesis that the significant diversification within the Tabanus lineage was predominantly driven by transformative environmental events occurring during the Miocene. The consistent results observed for both T. bromius and T. bifarius underscore the pivotal role of the Miocene in the early diversification of the Tabanus genus. Additionally,
When molecular data from both markers are considered together, they provide mutually supportive evidence regarding the origin and distribution of the species in Türkiye. Phylogenetic tree analyses indicate that one of the species’ entry routes into Türkiye is from Transcaucasia in the northeast, specifically through the Ardahan area. It is hypothesized that the Ardahan population remained isolated for an extended period due to glacial conditions, subsequently expanding from the northern side of the North Anatolian Mountains towards Sinop. Another probable entry route is from Hatay in the south, with this population dispersing between the Southern Taurus Mountains and the Central Anatolian Diagonal in the direction of Kayseri. The population that reached Central Anatolia likely merged with the Sinop population, facilitating further expansion into the southern and western regions. The detection of the A2 allele (N = 48) in the western region of the Anatolian Diagonal, a prominent mountain range dividing Anatolia into eastern and western halves, suggests that T. bifarius has been significantly influenced by this geographic barrier (
The research was supported by Anadolu University Scientific Research Project Commission of Türkiye (Project No: 1410F406).
The Croatian and Iranian samples were kindly provided by Dr. Stjepan Krčmar and Dr. Samad Khaghaninia, respectively, and the authors are very grateful to them.
Figures S1, S2
Data type: .pdf
Explanation notes: Figure S1. Mismatch distribution profiles of each population based on COI haplotypes. The observed distribution is shown by the red line, and the expected distribution under the expansion model is shown by the green line. — Figure S2. Mismatch distribution profiles of each population based on ITS1–ITS2 alleles. The observed distribution is represented by the red line, and the expected distribution under the expansion model is represented by the green line.
Tables S1–S4
Data type: .pdf
Explanation notes: Table SS1. COI haplotypes and their frequencies in populations. — Table SS2. ITS1–ITS2 alleles and their frequencies in populations. — Table SS3. COI gene FST analysis results showing pairwise genetic variations between populations. — Table S4. ITS region FST analysis results showing pairwise genetic variations between populations.