skip to main content


Search for: All records

Creators/Authors contains: "Cao, Yanghui"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Large systematic revisionary projects incorporating data for hundreds or thousands of taxa require an integrative approach, with a strong biodiversity-informatics core for efficient data management to facilitate research on the group. Our original biodiversity informatics platform, 3i (Internet-accessible Interactive Identification) combined a customized MS Access database backend with ASP-based web interfaces to support revisionary syntheses of several large genera of leafhopers (Hemiptera: Auchenorrhyncha: Cicadellidae). More recently, for our National Science Foundation sponsored project, “GoLife: Collaborative Research: Integrative genealogy, ecology and phenomics of deltocephaline leafhoppers (Hemiptera: Cicadellidae), and their microbial associates”, we selected the new open-source platform TaxonWorks as the cyberinfrastructure. In the scope of the project, the original “3i World Auchenorrhyncha Database” was imported into TaxonWorks. At the present time, TaxonWorks has many tools to automatically import nomenclature, citations, and specimen based collection data. At the time of the initial migration of the 3i database, many of those tools were still under development, and complexity of the data in the database required a custom migration script, which is still probably the most efficient solution for importing datasets with long development history. At the moment, the World Auchenorrhyncha Database comprehensively covers nomenclature of the group and includes data on 70 valid families, 6,816 valid genera, 47,064 valid species as well as synonymy and subsequent combinations (Fig. 1). In addition, many taxon records include the original citation, bibliography, type information, etymology, etc. The bibliography of the group includes 37,579 sources, about 1/3 of which are associated with PDF files. Species have distribution records, either derived from individual specimens or as country and state level asserted distribution, as well as biological associations indicating host plants, predators, and parasitoids. Observation matrices in TaxonWorks are designed to handle morphological data associated with taxa or specimens. The matrices may be used to automatically generate interactive identification keys and taxon descriptions. They can also be downloaded to be imported, for example, into Lucid builder, or to perform phylogenetic analysis using an external application. At the moment there are 36 matrices associated with the project. The observation matrix from GoLife project covers 798 taxa by 210 descriptors (most of which are qualitative multi-state morphological descriptors) (Fig. 2). Illustrations are provided for 9,886 taxa and organized in the specialized image matrix and could be used as a pictorial key for determination of species and taxa of a higher rank. For the phylogenetic analysis, a dataset was constructed for 730 terminal taxa and >160,000 nucleotide positions obtained using anchored hybrid enrichment of genomic DNA for a sample of leafhoppers from the subfamily Deltocephalinae and outgroups. The probe kit targets leafhopper genes, as well as some bacterial genes (endosymbionts and plant pathogens transmitted by leafhoppers). The maximum likelihood analyses of concatenated nucleotide and amino acid sequences as well as coalescent gene tree analysis yielded well-resolved phylogenetic trees (Cao et al. 2022). Raw sequence data have been uploaded to the Sequence Read Archive on GenBank. Occurrence and morphological data, as well as diagnostic images, for voucher specimens have been incorporated into TaxonWorks. Data in TaxonWorks could be exported in raw format, get accessed via Application Programming Interface (API), or be shared with external data aggregators like Catalogue of Life, GBIF, iDigBio. 
    more » « less
  2. Despite several decades’ effort to detect and identify phytoplasmas (Mollicutes) using PCR and Sanger sequencing focusing on diseased plants, knowledge of phytoplasma biodiversity and vector associations remains highly incomplete. To improve protocols for documenting phytoplasma diversity and ecology, we used DNA extracted from phloem-feeding insects and compared traditional Sanger sequencing with a next-generation sequencing method, Anchored Hybrid Enrichment (AHE) for detecting and characterizing phytoplasmas. Among 22 of 180 leafhopper samples that initially tested positive for phytoplasmas using qPCR, AHE yielded phytoplasma 16Sr sequences for 20 (19 complete and 1 partial sequence) while Sanger sequencing yielded sequences for 16 (11 complete and 5 partial). AHE yielded phytoplasma sequences for an additional 7 samples (3 complete and 4 partial) that did not meet the qPCR threshold for phytoplasma positivity or yielded non-phytoplasma sequences using Sanger sequencing. This suggests that AHE is more efficient for obtaining phytoplasma sequences. Twenty-three samples with sufficient data were classified into eight 16Sr subgroups (16SrI-B, I-F, I-AO, III-U, V-C, IX-J, XI-C, XXXVII-A), three new subgroups (designated as 16SrVI-L, XV-D, XI-G) and three possible new groups. Our results suggest that screening phloem-feeding insects using qPCR and AHE sequencing may be the most efficient method for discovering new phytoplasmas. 
    more » « less
  3. Reliable host plant records are available for only a small fraction of herbivorous insect species, despite their potential agricultural importance. Most available data on insect–plant associations have been obtained through field observations of occurrences of insects on particular plants. Molecular methods have more recently been used to identify potential host plants using DNA extracted from insects, but most prior studies using these methods have focused on chewing insects that ingest tissues expected to contain large quantities of plant DNA. Screening of Illumina data obtained from sap feeders of the hemipteran family Cicadellidae (leafhoppers) using anchored hybrid enrichment indicates that, despite feeding on plant fluids, these insects often contain detectable quantities of plant DNA. Although inclusion of probes for bacterial 16S in the original anchored hybrid probe kit yielded relatively high detection rates for chloroplast 16S, the Illumina short reads also, in some cases, included DNA for various plant barcode genes as “by-catch”. Detection rates were generally only slightly higher for Typhlocybinae, which feed preferentially on parenchyma cell contents, compared to other groups of leafhoppers that feed preferentially on phloem or xylem. These results indicate that next-generation sequencing provides a powerful tool to investigate the specific association between individual insect and plant species. 
    more » « less
  4. null (Ed.)
    The leafhopper genus Thaia Ghauri, 1962 and two related genera, Parathaia Kuoh, 1982 and Pseudothaia Kuoh, 1982, are revised. Nlunga Dworakowska, 1974, previously treated as a subgenus of Thaia, is elevated to the genus level, and Parathaia Kuoh, 1982, previously treated as a synonym of Thaia, is reinstated to valid status. The Oriental species lacking foveae on the pronotum are excluded from Nlunga and transferred to a new genus, Etmaria gen. n. Two species, Pseudothaia caudata Song & Li, 2013 and Thaia (Nlunga) leishanensis (Song & Li, 2007), are treated as Incertae sedis within Erythroneurini. Moreover, nine additional new species are described and illustrated: Nlunga parareeneni, Etmaria brevis, E. chaiyaphumica, E. dentata, E. indonesica, E. magna, E. triquetra, E. ulterior and Pseudothaia forcipis spp. nov. 
    more » « less
  5. null (Ed.)
    Six Afrotropical genera of the leafhopper tribe Erythroneurini are redescribed and two new genera, Laminaris gen. nov. and Projecta gen. nov. are established. Sixteen new species are described and illustrated: Laminaris angusta sp. nov., Laminaris cuspidatima sp. nov., Laminaris serrata sp. nov., Laminaris tenuis sp. nov., Projecta auriculata sp. nov., Projecta brevis sp. nov., Projecta depressa sp. nov., Projecta draciformis sp. nov., Accacidia obunca sp. nov., Imbecilla bifurca sp. nov., Imbecilla spinalis sp. nov., Lublinia anchoroides sp. nov., Molopopterus hastata sp. nov., Molopopterus ugandica sp. nov., Lamtoana exigua sp. nov. and Nsesa cameroonica sp. nov. Nsesa Dworakowska, 1974 is newly recorded from Cameroon; Nsimbala Dworakowska, 1974 and Szymczakowskia Dworakowska, 1974 from the Central African Republic; Lublinia Dworakowska, 1970, Lamtoana Dworakowska, 1972 and Ivorycoasta Dworakowska, 1972 from the Republic of Congo. Keys to species of the genera Accacidia Dworakowska, Imbecilla Dworakowska, Lublinia Dworakowska, Molopopterus Jacobi, Lamtoana Dworakowska and Nsesa Dworakowska are also given. 
    more » « less
  6. null (Ed.)
  7. null (Ed.)