skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The Herbarium 2021 Half-Earth Challenge Dataset
Herbarium sheets present a unique view of the world's botanical history, evolution, and diversity. This makes them an all-important data source for botanical research. With the increased digitisation of herbaria worldwide and the advances in the fine-grained classification domain that can facilitate automatic identification of herbarium specimens, there are a lot of opportunities for supporting research in this field. However, existing datasets are either too small, or not diverse enough, in terms of represented taxa, geographic distribution or host institutions. Furthermore, aggregating multiple datasets is difficult as taxa exist under a multitude of different names and the taxonomy requires alignment to a common reference. We present the Herbarium Half-Earth dataset, the largest and most diverse dataset of herbarium specimens to date for automatic taxon recognition. https://doi.org/10.48550/arXiv.2105.13808  more » « less
Award ID(s):
2054684
PAR ID:
10445634
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
FGVC8 Workshop at CVPR 2021
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Herbarium sheets present a unique view of the world's botanical history, evolution, and biodiversity. This makes them an all–important data source for botanical research. With the increased digitization of herbaria worldwide and advances in the domain of fine–grained visual classification which can facilitate automatic identification of herbarium specimen images, there are many opportunities for supporting and expanding research in this field. However, existing datasets are either too small, or not diverse enough, in terms of represented taxa, geographic distribution, and imaging protocols. Furthermore, aggregating datasets is difficult as taxa are recognized under a multitude of names and must be aligned to a common reference. We introduce the Herbarium 2021 Half–Earth dataset: the largest and most diverse dataset of herbarium specimen images, to date, for automatic taxon recognition. We also present the results of the Herbarium 2021 Half–Earth challenge, a competition that was part of the Eighth Workshop on Fine-Grained Visual Categorization (FGVC8) and hosted by Kaggle to encourage the development of models to automatically identify taxa from herbarium sheet images. 
    more » « less
  2. Summary Botanical gardens make unique contributions to climate change research, conservation, and public engagement. They host unique resources, including diverse collections of plant species growing in natural conditions, historical records, and expert staff, and attract large numbers of visitors and volunteers. Networks of botanical gardens spanning biomes and continents can expand the value of these resources. Over the past decade, research at botanical gardens has advanced our understanding of climate change impacts on plant phenology, physiology, anatomy, and conservation. For example, researchers have utilized botanical garden networks to assess anatomical and functional traits associated with phenological responses to climate change. New methods have enhanced the pace and impact of this research, including phylogenetic and comparative methods, and online databases of herbarium specimens and photographs that allow studies to expand geographically, temporally, and taxonomically in scope. Botanical gardens have grown their community and citizen science programs, informing the public about climate change and monitoring plants more intensively than is possible with garden staff alone. Despite these advances, botanical gardens are still underutilized in climate change research. To address this, we review recent progress and describe promising future directions for research and public engagement at botanical gardens. 
    more » « less
  3. Elmer Ottis Wooton (1865–1945) was one of the most important early botanists to work in the Southwestern United States, contributing a great deal of natural history knowledge and botanical research on the flora of New Mexico that shaped many naturalists and scientists for generations. The extensive Wooton legacy includes herbarium collections that he and his famous student Paul Carpenter Standley (1884–1963), prolific botanist and explorer, used for the first Flora of New Mexi co by Wooten and Standley 1915 , along with resources covering botany and range management strategies for the northern Chihuahuan Desert, and an extensive, yet to be digitized, historical archive of correspondence, field notes, vegetation sketches, photographs, and lantern slides, all from his travels and field work in the region. Starting in 1890, the most complete set of Wooton’s herbarium collections were deposited in the NMC herbarium at New Mexico State University (NMSU), and his archives, now stored in a Campus library, have together been underutilized, offline resources. The goals of this ongoing project are to secure, preserve, and promote Wooton’s important historical resources, by fleshing out the botanical history of the region, raising appreciation of herbarium collections within the community, and emphasizing their unique role in facilitating contemporary research aimed at addressing pressing scientific questions such as vegetation responses to global climate change. Students and the general public involved in this project are engaged through hands-on activities including cataloging, databasing and digitization of nearly 10,000 herbarium specimens and Wooton’s archives. These outputs, combined with contemporary data collection and computational biology techniques from an ecological perspective, are being used to document vegetation changes in iconic, climate-sensitive, high-elevation mountainous ecosystems present in southwestern New Mexico. In a later phase of the project, a variety of public audiences will participate through interactive online story maps and citizen science programs such as iNaturalist , Notes from Nature , and BioBlitz . Images of herbarium specimens will be shared via an online database and other relevant biodiversity portals ( Symbiota , iDigBio , JStor ) Community members reached through this project will be better-informed citizens, who may go on to become new stewards of natural history collections, with the potential to influence policies safeguarding the future of our planet’s biodiversity. More locally, the project will support the management of Organ Mountains Desert Peaks National Monument, which was established in 2014 to protect the area's human and environmental resources, and for which knowledge and data are currently limited. 
    more » « less
  4. Freshwater red algae have been collected on the African continent since the early 1800s. However, the collections have been sparse and geographically restricted. The present study sought to bring together information from the literature, herbarium specimens, and newly collected specimens to provide an updated assessment of the freshwater red algal diversity of the African continent with a focus on the species-rich Batrachospermales. DNA sequence data and morphological observations were conducted for recently collected specimens. From these analyses, four new taxa are proposed: Kumanoa bouwmanii, Sheathia murpheyi, Sirodotia kennedyi, and the form taxon ‘Chantransia azurea’. DNA sequence data had been previously published for Kumanoa iriomotensis, Sirodotia aff. huillensis, and S. suecica. With this study, we have added sequence data for Torularia atra as well as a second location for S. suecica. In total, there are eight taxa with sequence data, of which five appear to be endemic. From our assessment of literature reports and herbarium specimens, we conclude that Kumanoa, Sirodotia, and Torularia have often been collected and are relatively geographically widespread with two or more species present. In addition, Montagnia, Nothocladus, Paralemanea, Sheathia, and Visia as well as Batrachospermum section Gonimopropagulum are represented in the flora. We estimate that 14 to 19 batrachospermalean taxa can be recognised for the African flora, and that with more study, that number could easily double or triple based on the diversity known from other well-studied continents. 
    more » « less
  5. Abstract The widespread digitization of natural history collections, combined with novel tools and approaches is revolutionizing biodiversity science. The ‘extended specimen’ concept advocates a more holistic approach in which a specimen is framed as a diverse stream of interconnected data. Herbarium specimens that by their very nature capture multispecies relationships, such as certain parasites, fungi and lichens, hold great potential to provide a broader and more integrative view of the ecology and evolution of symbiotic interactions. This particularly applies to parasite–host associations, which owing to their interconnectedness are especially vulnerable to global environmental change.Here, we present an overview of how parasitic flowering plants is represented in herbarium collections. We then discuss the variety of data that can be gathered from parasitic plant specimens, and how they can be used to understand global change impacts at multiple scales. Finally, we review best practices for sampling parasitic plants in the field, and subsequently preparing and digitizing these specimens.Plant parasitism has evolved 12 times within angiosperms, and similar to other plant taxa, herbarium collections represent the foundation for analysing key aspects of their ecology and evolution. Yet these collections hold far greater potential. Data and metadata obtained from parasitic plant specimens can inform analyses of co‐distribution patterns, changes in eco‐physiology and species plasticity spanning temporal and spatial scales, chemical ecology of tripartite interactions (e.g. host–parasite–herbivore), and molecular data critical for species conservation. Moreover, owing to the historic nature and sheer size of global herbarium collections, these data provide the spatiotemporal breadth essential for investigating organismal response to global change.Parasitic plant specimens are primed to serve as ideal examples of extended specimen concept and help motivate the next generation of creative and impactful collection‐based science. Continued digitization efforts and improved curatorial practices will contribute to opening these specimens to a broader audience, allowing integrative research spanning multiple domains and offering novel opportunities for education. 
    more » « less