Research Article |
Corresponding author: Sze-Looi Song ( szelooi@um.edu.my ) Academic editor: André Nel
© 2024 Hoi-Sen Yong, Sze-Looi Song, Kah-Ooi Chua, Yvonne Jing Mei Liew, Kok-Gan Chan, Phaik-Eem Lim, Praphathip Eamsobhana.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
|
Bactrocera (Bulladacus) cinnabaria and B. (Bactrocera) propinqua are tephritid fruit flies of the subfamily Dacinae, tribe Dacini. The whole mitogenomes of these two species (first report for the subgenus Bulladacus) possess 37 genes (13 protein-coding genes – PCGs, 2 rRNA and 22 tRNA genes). The mitogenome of B. cinnabaria (15,225 bp) is shorter than that of B. propinqua (15,927 bp), mainly due to the smaller size of the control region and intergenic spacers in B. cinnabaria. Molecular phylogeny based on mitochondrial genes (mt-genes) reveals two clades of the genus Bactrocera: one comprising the subgenus Bactrocera and the other comprising the subgenera Bulladacus, Daculus, Tetradacus and unassigned Bactrocera sp. ‘yunnanensis’. The subgenera represented by two or more taxa are monophyletic. B. (Bulladacus) cinnabaria forms a sister group with the subgenus Tetradacus (B. minax and B. tsuneonis) and B. sp. ‘yunnanensis’, in a clade containing also the basal sister lineage of the subgenus Daculus (B. oleae and B. biguttula). B. propinqua forms a sister group with B. ritsemai and B. limbifera in a subclade containing also B. umbrosa, B. curvifera and B. moluccensis of the monophyletic subgenus Bactrocera. The present study supports the synonymy of B. ruiliensis with B. thailandica. It also shows a high genetic similarity between (a) B. melastomatos and B. rubigina, (b) B. papayae and B. philippinensis, (c) B. dorsalis and B. invadens, (d) B. tryoni and B. neohumeralis, and (e) B. cheni and B. tuberculata; and B. cheni is distinct from and not a synonym of B. tsuneonis or B. lombokensis.
Bactrocera subgenera, Dacinae, Fruit flies, Mitogenomes, Molecular phylogeny
True fruit flies of the genus Bactrocera Macquart are members of the family Tephritidae, subfamily Dacinae, tribe Dacini. Based on the classification with Zeugodacus Hendel as a genus, the genus Bactrocera consists of 461 species worldwide, with 451 species in the Asia-Pacific and 13 species in Africa (
Fifteen subgenera are recognized under the genus Bactrocera: (1) Bactrocera group of subgenera – Apodacus Perkins, Bactrocera Macquart, Bulladacus Drew & Hancock, Calodacus Hancock, Queenslandacus Drew, Semicallantra Drew, and Trypetidacus Drew; (2) Melanodacus group of subgenera – Daculus Speiser, Gymnodacus Munro, Hemizeugodacus Hardy, Neozeugodacus May, Notodacus Perkins, Paratridacus Shiraki, and Parazeugodacus Shiraki; and (3) Tetradacus Miyake (
Several aspects of the Bactrocera fruit flies have been widely studied, such as taxonomy and systematics (
Most of the studies on the molecular phylogeny of the genus Bactrocera (and other tephritid fruit flies) are based on partial sequences of single or multiple mitochondrial and nuclear genes (
In view of the potential application of mitochondrial genomes (mitogenomes) in studies regarding phylogeny and evolution (
Bactrocera cinnabaria is found in Andaman and Nicobar Islands (
Bactrocera propinqua has been documented in Bangladesh, China, Thailand, Cambodia, Laos, Vietnam, Malaysia, Singapore, and Indonesia (
The larvae of B. cinnabaria and B. propinqua feed on a variety of fruits, including both cultivated and wild species. They are economically important pests, posing significant challenges to agriculture and fruit production.
Both B. cinnabaria and B. propinqua were collected by H-S Yong from the garden of the Institute of Biological Sciences, Universiti Malaya, Malaysia (3°07ʹ9.00ʺN, 101°39ʹ13.79ʺE). Fruit flies of B. cinnabaria hatched from the infested fruits of Gnetum gnemon. Male fruit flies of B. propinqua were collected by application of cue-lure on the surface of a green leaf. Cue-lure is a synthetic pheromone that attracts male fruit flies, and when applied to the leaf, it effectively lures them in for collection. The specimens were preserved in absolute ethanol and stored in a −20°C deep freezer until use for DNA extraction. They were identified according to existing literature (
The complete mitogenomes of the genus Bactrocera (n = 32 species) available from the GenBank (Table S1) were used for phylogenetic comparison. Three other tephritid mitogenomes (Ceratitis capitata NC_000857, Ceratitis fasciventris NC_035497, and Ceratitis rosa NC_053847) available from the GenBank were used as the outgroup taxa.
Sample and library preparation (using Nextera DNA Sample Preparation Kit) and genome sequencing using the Illumina MiSeq Desktop Sequencer (150 bp paired-end reads) (Illumina, USA) were as described in
A contig identified and established as mitogenome was annotated with MITOS (
MEGA X was used to determine the nucleotide composition, amino acid frequency and relative synonymous codon usage (RSCU) (
Alignment of nucleotide sequences and reconstruction of phylograms based on 13 concatenated PCGs and 15 mt-genes (13 PCGs and 2 rRNA genes) followed that described in
The mitogenomes of B. cinnabaria and B. propinqua had similar gene order and contained 37 genes (13 protein-coding genes – PCGs, 2 rRNA genes, and 22 tRNA genes) and a non-coding region (A + T-rich control region) (Table
Complete mitogenomes of Bactrocera cinnabaria and B. propinqua with BRIG visualization showing the protein coding genes, rRNAs, tRNAs and non-coding region. GC skew is shown on the outer surface of the ring whereas GC content is shown on the inner surface. The anticodon of each tRNA gene is shown in parentheses.
Gene order and organization of the mitochondrial genomes of Bactrocera cinnabaria (Bc) and B. propinqua (Bp). *Minus sign indicates overlap. J (+) or N (–) indicates gene directions.
Gene | Strand | Size (bp) | Intergenic sequence* | Start codon | Stop codon |
Bc/Bp | Bc/Bp | Bc/Bp | Bc/Bp | ||
trnI (gat) | J | 66/66 | 1/–3 | ||
trnQ(ttg) | N | 69/69 | 11/69 | ||
trnM(cat) | J | 69/69 | |||
nad2 | J | 1023/1023 | 8/10 | ATT/ATT | TAG/TAA |
trnW (tca) | J | 68/69 | –8/–8 | ||
trnC (gca) | N | 62/63 | 1/31 | ||
trnY (gta) | N | 67/67 | –2/–2 | ||
cox1 | J | 1539/1539 | –5/–5 | TCG/TCG | TAA/TAA |
trnL2 (taa) | J | 66/66 | 4/4 | ||
cox2 | J | 690/687 | 4/7 | ATG/ATG | TAA/TAA |
trnK (ctt) | J | 70/71 | 0/3 | ||
trnD (gtc) | J | 67/67 | |||
atp8 | J | 162/162 | –7/–7 | ATC/GTG | TAA/TAA |
atp6 | J | 678/678 | –1/–1 | ATG/ATG | TAA/TAA |
cox3 | J | 789/789 | 9/9 | ATG/ATG | TAA/TAA |
trnG (tcc) | J | 65/65 | |||
nad3 | J | 354/354 | –2/–2 | ATT/ATT | TAG/TAG |
trnA (tgc) | J | 64/65 | 5/14 | ||
trnR (tcg) | J | 63/64 | 24/26 | ||
trnN (gtt) | J | 65/65 | |||
trnS1 (gct) | J | 68/68 | |||
trnE (ttc) | J | 66/67 | 18/19 | ||
trnF (gaa) | N | 65/65 | 1/0 | ||
nad5 | N | 1720/1720 | 15/15 | ATT/ATT | T/T |
trnH (gtg) | N | 65/69 | |||
nad4 | N | 1341/1341 | –7/–7 | ATG/ATG | TAA/TAG |
nad4L | N | 291/291 | 8/8 | ATG/ATG | TAA/TAA |
trnT (tgt) | J | 65/65 | |||
trnP (tgg) | N | 66/66 | 2/2 | ||
nad6 | J | 525/525 | –1/–1 | ATT/ATT | TAA/TAA |
cob | J | 1137/1137 | –2/–2 | ATG/ATG | TAG/TAG |
trnS2 (tga) | J | 67/67 | –65/–65 | ||
nad1 | N | 1020/1020 | 10/10 | ATG/ATG | TAA/TAA |
trnL1 (tag) | N | 65/65 | –23/10 | ||
rrnL | N | 1322/1291 | 29/30 | ||
trnV (tac) | N | 72/72 | –1/–1 | ||
rrnS | N | 790/790 | |||
Control region | J | 358/947 |
The A + T content for the whole mitogenome was 71.2% for B. cinnabaria and 74.1% for B. propinqua, with positive AT and negative GC skewness values (Table
A + T content (%), AT and GC skewness of Bactrocera cinnabaria (Bc) and B. propinqua (Bp) mitogenomes.
A+T% | AT skew | GC skew | ||||
Region | Bc | Bp | Bc | Bp | Bc | Bp |
Whole mitogenome | 71.2 | 74.1 | 0.084 | 0.069 | –0.250 | –0.228 |
Protein coding genes | 69.1 | 71.6 | –0.152 | –0.153 | –0.049 | –0.018 |
1st codon position | 63.8 | 65.1 | –0.069 | –0.076 | 0.166 | 0.198 |
2nd codon position | 65.1 | 65.5 | –0.382 | –0.380 | –0.167 | –0.165 |
3rd codon position | 78.2 | 84.1 | –0.028 | –0.039 | –0.211 | –0.182 |
tRNA genes | 73.9 | 75.3 | –0.009 | –0.012 | 0.088 | 0.109 |
rRNA genes | 77.4 | 77.7 | –0.083 | –0.094 | 0.304 | 0.309 |
Control region | 84.9 | 89.5 | 0.178 | 0.068 | –0.373 | –0.135 |
J strand | 67.4 | 69.8 | –0.045 | –0.060 | –0.206 | –0.166 |
N strand | 74.3 | 76.2 | 0.147 | 0.134 | –0.094 | –0.084 |
For the individual PCGs, the A + T content ranged from 62.9% for cox1 to 76.3% for nad4L in B. cinnabaria, and 65.1% for cox3 to 80.1% for nad4L in B. propinqua (Table S2). Most of the PCGs had negative AT and GC skewness values (Table S2); cox2 and nad6 had positive AT skewness values in B. cinnabaria, nad4L had positive GC skewness value in B. cinnabaria, and nad1, nad4 and nad5 had positive GC skewness values in both B. cinnabaria and B. propinqua.
B. cinnabaria and B. propinqua shared an identical start codon for their respective PCGs, except atp8 with ATC for B. cinnabaria and GTG for B. propinqua (Table
The frequency of individual amino acid was quite similar between B. cinnabaria and B. propinqua (Fig.
Analysis of the relative synonymous codon usage (RSCU) revealed that there was no biased usage of A/T than G/C at the third codon position (Table S4; Fig.
The Ka/Ks ratio (an indicator of selective pressure on a PCG) was less than 1 for all the 13 PCGs in the two Bactrocera mitogenomes, indicating purifying selection (Table S5; Fig.
The cloverleaf structure for some tRNAs was dissimilar in B. cinnabaria and B. propinqua (Fig.
The control region was flanked by rrnS and trnI genes respectively, with 358 bp in B. cinnabaria and 947 bp in B. propinqua. In the control region of B. propinqua, a long poly-A stretch of 12 bp was present in the anterior region, and a 24 bp poly-A stretch was present in the posterior region; a long poly-T stretch of 24 bp was present in the middle region. Long poly-A and poly-T stretches were not present in B. cinnabaria control region.
Simple tandem repeats and palindromes were present in the control region of B. cinnabaria and B. propinqua (Table
Number of different repetitive sequences in the control regions of Bactrocera cinnabaria and Bactrocera propinqua mitogenomes.
No. of repeat | |||
Type of repeat | Repetitive sequence | B. cinnabaria | B. propinqua |
Simple sequence repeat | (A)12 | 0 | 1 |
(A)24 | 0 | 1 | |
(T)24 | 0 | 1 | |
(TA)3 | 0 | 4 | |
(TA)6 | 0 | 1 | |
(ATA)2 | 0 | 1 | |
(AAT)2 | 2 | 3 | |
(TAA)2 | 5 | 4 | |
(TAG)2 | 1 | 1 | |
(TTA)2 | 2 | 3 | |
(TAAA)2 | 2 | 1 | |
(AAAT)2 | 1 | 1 | |
(ATAA)2 | 0 | 1 | |
(TTAA)2 | 0 | 2 | |
(TTTA)2 | 0 | 2 | |
(AAATA)2 | 0 | 1 | |
(AATTT)2 | 0 | 1 | |
(TTTAA)2 | 0 | 1 | |
(TTTAAA)2 | 1 | 0 | |
Palindromes | AATTAA | 2 | 4 |
ATAATA | 0 | 1 | |
ATTTTA | 0 | 1 | |
TAAAAT | 3 | 2 | |
TATTAT | 0 | 3 | |
TTAATT | 1 | 5 | |
AATTTTAA | 0 | 1 | |
ATTAATTA | 0 | 1 | |
TTAAAATT | 0 | 1 | |
AAATTTTAAA | 0 | 1 | |
TAAAATTAAAAT | 0 | 1 |
Phylogenetic analysis, based on 15 mt-genes (13 PCGs and 2 rRNA genes) and 13 PCGs of available complete mitogenomes, revealed two major clades of the Bactrocera taxa: (A) subgenus Bactrocera, and (B) subgenera Bulladacus, Daculus, Tetradacus and unassigned Bactrocera sp. ‘yunnanensis’ (Fig.
Phylogenetic trees (ML/BI) of (a) 15 mt-genes (13 PCGs + 2 rRNA genes), and (b) 13 PCGs of the whole mitogenomes of Bactrocera fruit flies with Ceratitis capitata, C. fasciventris, and C. rosa as the outgroup taxa. Numeric values at the nodes are ML bootstrap and Bayesian posterior probabilities. Support values labelled with a “*” have 100% bootstrap support or 1.0 posterior probability.
B. cinnabaria was basal to the sister lineage of B. minax and B. tsuneonis, indicating that the subgenus Bulladacus was closer related to subgenus Tetradacus than to subgenus Daculus. B. propinqua was basal to the sister lineage of B. ritsemai and B. limbifera, forming a subclade with B. umbrosa, B. curvifera and B. moluccensis (Fig.
In the present study, the mitogenomes of B. cinnabaria and B. propinqua have three main clusters of characteristic tRNAs (Fig.
The 358-bp control region of B. cinnabaria mitogenome is exceptionally short for tephritid fruit flies. It aligns with the anterior portion of the long control region of other Bactrocera species. It is, however, not the shortest control region for Bactrocera species. Bactrocera rubigina (NC_046521) has a 235-bp control region (
In the present study, the subgenera of genus Bactrocera, particularly the subgenus Bactrocera represented by a large number of taxa, are monophyletic. Apart from the subgenus Bactrocera, a broader taxon sampling is needed to confirm the monophyletic status of the subgenera Bulladacus, Daculus and Tetradacus. A recent study, however, indicates that the subgenus Bactrocera based on current taxonomic classification is not monophyletic (
An earlier study based on partial COXI and 16S sequences shows that the subgenus Tetradacus is a sister group to the subgenus Paratridacus of the Melanodacus group (
The sister lineage of B. oleae and B. biguttula (Fig.
The very small genetic difference (lack of genetic differentiation, ‘p’ = 0.03% based on 15 mt-genes) between B. ruiliensis and B. thailandica supports the synonymy of B. ruiliensis with B. thailandica (
In the present study, a small genetic difference is observed between B. melastomatos and B. rubigina with ‘p’ = 0.08% and 0.40%, and intra B. rubigina genetic distance of 0.37% (Table S6; Fig.
Likewise, B. tryoni and B. neohumeralis (each with one specimen) are genetically very similar with ‘p’ = 0.69% (Table S6; Fig.
The present phylogenetic analysis based on 15 mt-genes concurs with the finding of
Likewise, the present finding of ‘p’ = 1.70-1.74% (Table S6) based on 15 mt-genes does not support the synonymy of B. albistrigata with B. frauenfeldi (
A recent mitogenomic study on specimens of the B. dorsalis complex from various geographic regions indicates that they do not group together, and is therefore paraphyletic (
The present phylogenetic analysis supports the high genetic similarity of B. papayae and B. philippinensis which form a sister lineage with ‘p’ = 0.85%; B. papayae has a genetic distance of 1.00% with B. dorsalis and 1.18% with B. invadens (Table S6; Fig.
In summary, we have successfully sequenced the whole mitochondrial genomes of B. (Bulladacus) cinnabaria (the first report for the subgenus Bulladacus) and B. (Bactrocera) propinqua from Peninsular Malaysia by next generation sequencing. The genome features are similar to other Bactrocera fruit flies, excepting the short control region (358 bp) in B. cinnabaria. Phylogenetic analysis based on the mt-genes reveals two major clades of the Bactrocera taxa: (A) subgenus Bactrocera, and (B) subgenera Bulladacus, Daculus, Tetradacus and unassigned Bactrocera sp. ‘yunnanensis’. The subgenera represented by two or more species are monophyletic. A broad taxon sampling, including taxa of all the subgenera, will help to clarify their phylogeny. The present study supports the synonymy of B. ruiliensis with B. thailandica. It also shows a high genetic similarity between (a) B. melastomatos and B. rubigina, (b) B. papayae and B. philippinensis, (c) B. dorsalis and B. invadens, (d) B. tryoni and B. neohumeralis, and (e) B. cheni and B. tuberculata; and B. cheni is distinct from and not a synonym of B. tsuneonis or B. lombokensis. The phylogenomics will serve as a useful dataset for studying the genetics, systematics (including species differentiation) and phylogenetic relationships of the many species/species complexes and subgenera of the genus Bactrocera in particular, and tephritid fruit flies in general.
We thank our respective institutions for their support of our research on tephritid fruit flies. H-S Yong is supported by MoHE-HIR Grant (H-50001-00-A000025) and Universiti Malaya (H-5620009).
Figure
Data type: .pdf
Explanation notes: Stem-loop structures of intergenic sequences in the mitogenomes of Bactrocera cinnabaria and B. propinqua.
Tables S1–S6
Data type: .pdf
Explanation notes: Table S1. List of Bactrocera mitogenomes from GenBank. — Table S2. Base composition, A + T content (%), AT and GC skewness of the 13 protein coding genes in Bactrocera cinnabaria (Bc) and B. propinqua (Bp) mitogenomes. — Table S3. Amino acid frequency for the protein-coding genes of Bactrocera cinnabaria (Bc) and B. propinqua (Bp) mitogenomes. — Table S4. Relative synonymous codon usage for the 13 protein coding genes of Bactrocera cinnabaria (Bc) and B. propinqua (Bp) mitogenomes. — Table S5. Ka, Ks, Ka/Ks values for the 13 protein coding genes of Bactrocera cinnabaria and B. propinqua mitogenomes. — Table S6. Pair-wise genetic distance (%) of Bactrocera taxa based on 15 mt-genes (13 protein-coding genes and 2 rRNA genes).