TY - JOUR
T1 - Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution
AU - Fort, Antoine
AU - Mchale, Marcus
AU - Cascella, Kevin
AU - Potin, Philippe
AU - Perrineau, Marie‐mathilde
AU - Kerrison, Philip D.
AU - Costa, Elisabete
AU - Calado, Ricardo
AU - Domingues, Maria Do Rosário
AU - Costa Azevedo, Isabel
AU - Sousa‐pinto, Isabel
AU - Gachon, Claire
AU - Werf, Adrie
AU - Visser, Willem
AU - Beniers, Johanna E.
AU - Jansen, Henrice
AU - Guiry, Michael D.
AU - Sulpice, Ronan
N1 - © 2021 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.
This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Open access funding provided by IReL.
PY - 2021/7/5
Y1 - 2021/7/5
N2 - Accurate species identification often relies on public repositories to compare the barcode sequences of the investigated individual(s) with taxonomically assigned sequences. However, the accuracy of identifications in public repositories is often questionable, and the names originally given are rarely updated. For instance, species of the Sea Lettuce (Ulva spp.; Ulvophyceae, Ulvales, Ulvaceae) are frequently misidentified in public repositories, including herbaria and gene banks, making species identification based on traditional barcoding unreliable. We DNA barcoded 295 individual distromatic foliose strains of Ulva from the North-East Atlantic for three loci (rbcL, tufA, ITS1). Seven distinct species were found, and we compared our results with all worldwide Ulva spp. sequences present in the NCBI database for the three barcodes rbcL, tufA and the ITS1. Our results demonstrate a large degree of species misidentification, where we estimate that 24%–32% of the entries pertaining to foliose species are misannotated and provide an exhaustive list of NCBI sequences reannotations. An analysis of the global distribution of registered samples from foliose species also indicates possible geographical isolation for some species, and the absence of U. lactuca from Northern Europe. We extended our analytical framework to three other genera, Fucus, Porphyra and Pyropia and also identified erroneously labelled accessions and possibly new synonymies, albeit less than for Ulva spp. Altogether, exhaustive taxonomic clarification by aggregation of a library of barcode sequences highlights misannotations and delivers an improved representation of species diversity and distribution.
AB - Accurate species identification often relies on public repositories to compare the barcode sequences of the investigated individual(s) with taxonomically assigned sequences. However, the accuracy of identifications in public repositories is often questionable, and the names originally given are rarely updated. For instance, species of the Sea Lettuce (Ulva spp.; Ulvophyceae, Ulvales, Ulvaceae) are frequently misidentified in public repositories, including herbaria and gene banks, making species identification based on traditional barcoding unreliable. We DNA barcoded 295 individual distromatic foliose strains of Ulva from the North-East Atlantic for three loci (rbcL, tufA, ITS1). Seven distinct species were found, and we compared our results with all worldwide Ulva spp. sequences present in the NCBI database for the three barcodes rbcL, tufA and the ITS1. Our results demonstrate a large degree of species misidentification, where we estimate that 24%–32% of the entries pertaining to foliose species are misannotated and provide an exhaustive list of NCBI sequences reannotations. An analysis of the global distribution of registered samples from foliose species also indicates possible geographical isolation for some species, and the absence of U. lactuca from Northern Europe. We extended our analytical framework to three other genera, Fucus, Porphyra and Pyropia and also identified erroneously labelled accessions and possibly new synonymies, albeit less than for Ulva spp. Altogether, exhaustive taxonomic clarification by aggregation of a library of barcode sequences highlights misannotations and delivers an improved representation of species diversity and distribution.
KW - aquaculture
KW - DNA barcoding
KW - phylogeny
KW - sea lettuce
KW - ulva
U2 - 10.1111/1755-0998.13453
DO - 10.1111/1755-0998.13453
M3 - Article
SN - 1755-098X
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
ER -