skip to main content

Title: Biodiversity of Philippine marine fishes: A DNA barcode reference library based on voucher specimens

Accurate identification of fishes is essential for understanding their biology and to ensure food safety for consumers. DNA barcoding is an important tool because it can verify identifications of both whole and processed fishes that have had key morphological characters removed (e.g., filets, fish meal); however, DNA reference libraries are incomplete, and public repositories for sequence data contain incorrectly identified sequences. During a nine-year sampling program in the Philippines, a global biodiversity hotspot for marine fishes, we developed a verified reference library of cytochrome c oxidase subunit I (COI) sequences for 2,525 specimens representing 984 species. Specimens were primarily purchased from markets, with additional diversity collected using rotenone or fishing gear. Species identifications were verified based on taxonomic, phenotypic, and genotypic data, and sequences are associated with voucher specimens, live-color photographs, and genetic samples catalogued at Smithsonian Institution, National Museum of Natural History. The Biodiversity of Philippine Marine Fishes dataset is released herein to increase knowledge of species diversity and distributions and to facilitate accurate identification of market fishes.

more » « less
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Data
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Marine zooplankton are key players in pelagic food webs, central links in ecosystem function, useful indicators of water masses, and rapid responders to environmental variation and climate change. Characterization of biodiversity of the marine zooplankton assemblage is complicated by many factors, including systematic complexity of the assemblage, with numerous rare and cryptic species, and high local-to-global ratios of species diversity. The papers in this themed article set document important advances in molecular protocols and procedures, integration with morphological taxonomic identifications, and quantitative analyses (abundance and biomass). The studies highlight several overarching conclusions and recommendations. A primary issue is the continuing need for morphological taxonomic experts, who can identify species and provide voucher specimens for reference sequence databases, which are essential for biodiversity analyses based on molecular approaches. The power of metabarcoding using multi-gene markers, including both DNA (Deoxyribonucleic Acid) and RNA (Ribonucleic Acid)templates, is demonstrated. An essential goal is the accurate identification of species across all taxonomic groups of marine zooplankton, with particular concern for detection of rare, cryptic, and invasive species. Applications of molecular approaches include analysis of trophic relationships by metabarcoding of gut contents, as well as investigation of the underlying ecological and evolutionary forces driving zooplankton diversity and structure.

    more » « less

    True fungi (Fungi) and fungus-like organisms (e.g.Mycetozoa,Oomycota) constitute the second largest group of organisms based on global richness estimates, with around 3 million predicted species. Compared to plants and animals, fungi have simple body plans with often morphologically and ecologically obscure structures. This poses challenges for accurate and precise identifications. Here we provide a conceptual framework for the identification of fungi, encouraging the approach of integrative (polyphasic) taxonomy for species delimitation, i.e. the combination of genealogy (phylogeny), phenotype (including autecology), and reproductive biology (when feasible). This allows objective evaluation of diagnostic characters, either phenotypic or molecular or both. Verification of identifications is crucial but often neglected. Because of clade-specific evolutionary histories, there is currently no single tool for the identification of fungi, although DNA barcoding using the internal transcribed spacer (ITS) remains a first diagnosis, particularly in metabarcoding studies. Secondary DNA barcodes are increasingly implemented for groups where ITS does not provide sufficient precision. Issues of pairwise sequence similarity-based identifications and OTU clustering are discussed, and multiple sequence alignment-based phylogenetic approaches with subsequent verification are recommended as more accurate alternatives. In metabarcoding approaches, the trade-off between speed and accuracy and precision of molecular identifications must be carefully considered. Intragenomic variation of the ITS and other barcoding markers should be properly documented, as phylotype diversity is not necessarily a proxy of species richness. Important strategies to improve molecular identification of fungi are: (1) broadly document intraspecific and intragenomic variation of barcoding markers; (2) substantially expand sequence repositories, focusing on undersampled clades and missing taxa; (3) improve curation of sequence labels in primary repositories and substantially increase the number of sequences based on verified material; (4) link sequence data to digital information of voucher specimens including imagery. In parallel, technological improvements to genome sequencing offer promising alternatives to DNA barcoding in the future. Despite the prevalence of DNA-based fungal taxonomy, phenotype-based approaches remain an important strategy to catalog the global diversity of fungi and establish initial species hypotheses.

    more » « less
  3. Abstract

    We are far from knowing all species living on the planet. Understanding biodiversity is demanding and requires time and expertise. Most groups are understudied given problems of identifying and delimiting species. DNA barcoding emerged to overcome some of the difficulties in identifying species. Its limitations derive from incomplete taxonomic knowledge and the lack of comprehensive DNA barcode libraries for so many taxonomic groups. Here, we evaluate how useful barcoding is for identifying arthropods from highly diverse leaf litter communities in the southern Appalachian Mountains (USA). We used 3 reference databases and several automated classification methods on a data set including several arthropod groups. Acari, Araneae, Collembola, Coleoptera, Diptera, and Hymenoptera were well represented, showing different performances across methods and databases. Spiders performed the best, with correct identification rates to species and genus levels of ~50% across databases. Springtails performed poorly, no barcodes were identified to species or genus. Other groups showed poor to mediocre performance, from around 3% (mites) to 20% (beetles) correctly identified barcodes to species, but also with some false identifications. In general, BOLD-based identification offered the best identification results but, in all cases except spiders, performance is poor, with less than a fifth of specimens correctly identified to genus or species. Our results indicate that the soil arthropod fauna is still insufficiently documented, with many species unrepresented in DNA barcode libraries. More effort toward integrative taxonomic characterization is needed to complete our reference libraries before we can rely on DNA barcoding as a universally applicable identification method.

    more » « less
  4. Abstract

    Characterization of species diversity of zooplankton is key to understanding, assessing, and predicting the function and future of pelagic ecosystems throughout the global ocean. The marine zooplankton assemblage, including only metazoans, is highly diverse and taxonomically complex, with an estimated ~28,000 species of 41 major taxonomic groups. This review provides a comprehensive summary of DNA sequences for the barcode region of mitochondrial cytochrome oxidase I (COI) for identified specimens. The foundation of this summary is the MetaZooGene Barcode Atlas and Database (MZGdb), a new open-access data and metadata portal that is linked to NCBI GenBank and BOLD data repositories. The MZGdb provides enhanced quality control and tools for assembling COI reference sequence databases that are specific to selected taxonomic groups and/or ocean regions, with associated metadata (e.g., collection georeferencing, verification of species identification, molecular protocols), and tools for statistical analysis, mapping, and visualization. To date, over 150,000 COI sequences for ~ 5600 described species of marine metazoan plankton (including holo- and meroplankton) are available via the MZGdb portal. This review uses the MZGdb as a resource for summaries of COI barcode data and metadata for important taxonomic groups of marine zooplankton and selected regions, including the North Atlantic, Arctic, North Pacific, and Southern Oceans. The MZGdb is designed to provide a foundation for analysis of species diversity of marine zooplankton based on DNA barcoding and metabarcoding for assessment of marine ecosystems and rapid detection of the impacts of climate change.

    more » « less
  5. null (Ed.)
    An accurate identification of species and communities is a prerequisite for analysing and recording biodiversity and community shifts. In the context of marine biodiversity conservation and management, this review outlines past, present and forward-looking perspectives on identifying and recording planktonic diversity by illustrating the transition from traditional species identification based on morphological diagnostic characters to full molecular genetic identification of marine assemblages. In this process, the article presents the methodological advancements by discussing progress and critical aspects of the crossover from traditional to novel and future molecular genetic identifications and it outlines the advantages of integrative approaches using the strengths of both morphological and molecular techniques to identify species and assemblages. We demonstrate this process of identifying and recording marine biodiversity on pelagic copepods as model taxon. Copepods are known for their high taxonomic and ecological diversity and comprise a huge variety of behaviours, forms and life histories, making them a highly interesting and well-studied group in terms of biodiversity and ecosystem functioning. Furthermore, their short life cycles and rapid responses to changing environments make them good indicators and core research components for ecosystem health and status in the light of environmental change. This article is part of the theme issue ‘Integrative research perspectives on marine conservation’. 
    more » « less