Public genomic datasets like the 1000 Genomes project (1KGP), Human Genome Diversity Project (HGDP), and the Adolescent Brain Cognitive Development (ABCD) study are valuable public resources that facilitate scientific advancements in biology and enhance the scientific and economic impact of federally funded research projects. Regrettably, these datasets have often been developed and studied in ways that propagate outdated racialized and typological thinking, leading to fallacious reasoning among some readers that social and health disparities among the so-called races are due in part to innate biological differences between them. We highlight how this framing has set the stage for the racist exploitation of these datasets in two ways: First, we discuss the use of public biomedical datasets in studies that claim support for innate genetic differences in intelligence and other social outcomes between the groups identified as races. We further highlight recent instances of this which involve unauthorized access, use, and dissemination of public datasets. Second, we discuss thememification,use of simple figures meant for quick dissemination among lay audiences, of population genetic data to argue for a biological basis for purported human racial groups. We close with recommendations for scientists, to preempt the exploitation and misuse of their data, and for funding agencies, to better enforce violations of data use agreements.
more »
« less
Visualizing Life in the Deep: A Creative Pipeline for Data-Driven Animations to Facilitate Marine Mammal Research, Outreach, and Conservation
In this paper, we introduce a creative pipeline to incorporate physiological and behavioral data from contemporary marine mammal research into data-driven animations, leveraging functionality from industry tools and custom scripts to promote scientific insights, public awareness, and conservation outcomes. Our framework can flexibly transform data describing animals’ orientation, position, heart rate, and swimming stroke rate to control the position, rotation, and behavior of 3D models, to render animations, and to drive data sonification. Additionally, we explore the challenges of unifying disparate datasets gathered by an interdisciplinary team of researchers, and outline our design process for creating meaningful data visualization tools and animations. As part of our pipeline, we clean and process raw acceleration and electrophysiological signals to expedite complex multi-stream data analysis and the identification of critical foraging and escape behaviors. We provide details about four animation projects illustrating marine mammal datasets. These animations, commissioned by scientists to achieve outreach and conservation outcomes, have successfully increased the reach and engagement of the scientific projects they describe. These impactful visualizations help scientists identify behavioral responses to disturbance, increase public awareness of human-caused disturbance, and help build momentum for targeted conservation efforts backed by scientific evidence.
more »
« less
- PAR ID:
- 10342952
- Date Published:
- Journal Name:
- 2021 IEEE VIS Arts Program (VISAP)
- Page Range / eLocation ID:
- 1 to 10
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Citizen science yields increased scientific capacity in exchange for science literacy and promises of a more responsive science to society’s needs. Yet, citizen science projects are criticized for producing few scientific outputs and having exploitative relationships with the citizens who participate. In the eagerness to capture new data, scientists can fail to see the value of citizen scientists’ expertise beyond data generation and can forget to close the loop with outputs that benefit the public interest. Citizen scientists are experts in their local environments who, when asked, can improve scientific processes and products. To the degree that citizen scientists are relegated to data collection, we shortchange opportunities to advance science. Rather than merely critique, we present an evidence-based engagement approach for listening to citizen scientist participants and incorporating their input into science processes and products that can be retrofitted onto existing citizen science projects or integrated from a project’s inception. We offer this adaptable blueprint in four steps and illustrate this approach via a crowdsourced hydrology project on the Boyne River, USA. We show how engaging voices of citizen scientists at key points in the project improves both the products of science (a real-time ecohydrological model) and the process of conducting the science (adaptations to help improve data collection). Distinct from outreach or education, considering citizen scientists as an equally interesting site of inquiry can improve the practice and outputs of science.more » « less
-
Abstract Purpose Social media users share their ideas, thoughts, and emotions with other users. However, it is not clear how online users would respond to new research outcomes. This study aims to predict the nature of the emotions expressed by Twitter users toward scientific publications. Additionally, we investigate what features of the research articles help in such prediction. Identifying the sentiments of research articles on social media will help scientists gauge a new societal impact of their research articles. Design/methodology/approach Several tools are used for sentiment analysis, so we applied five sentiment analysis tools to check which are suitable for capturing a tweet's sentiment value and decided to use NLTK VADER and TextBlob. We segregated the sentiment value into negative, positive, and neutral. We measure the mean and median of tweets’ sentiment value for research articles with more than one tweet. We next built machine learning models to predict the sentiments of tweets related to scientific publications and investigated the essential features that controlled the prediction models. Findings We found that the most important feature in all the models was the sentiment of the research article title followed by the author count. We observed that the tree-based models performed better than other classification models, with Random Forest achieving 89% accuracy for binary classification and 73% accuracy for three-label classification. Research limitations In this research, we used state-of-the-art sentiment analysis libraries. However, these libraries might vary at times in their sentiment prediction behavior. Tweet sentiment may be influenced by a multitude of circumstances and is not always immediately tied to the paper's details. In the future, we intend to broaden the scope of our research by employing word2vec models. Practical implications Many studies have focused on understanding the impact of science on scientists or how science communicators can improve their outcomes. Research in this area has relied on fewer and more limited measures, such as citations and user studies with small datasets. There is currently a critical need to find novel methods to quantify and evaluate the broader impact of research. This study will help scientists better comprehend the emotional impact of their work. Additionally, the value of understanding the public's interest and reactions helps science communicators identify effective ways to engage with the public and build positive connections between scientific communities and the public. Originality/value This study will extend work on public engagement with science, sociology of science, and computational social science. It will enable researchers to identify areas in which there is a gap between public and expert understanding and provide strategies by which this gap can be bridged.more » « less
-
Cooke, Steven (Ed.)Abstract Bioenergetic approaches are increasingly used to understand how marine mammal populations could be affected by a changing and disturbed aquatic environment. There remain considerable gaps in our knowledge of marine mammal bioenergetics, which hinder the application of bioenergetic studies to inform policy decisions. We conducted a priority-setting exercise to identify high-priority unanswered questions in marine mammal bioenergetics, with an emphasis on questions relevant to conservation and management. Electronic communication and a virtual workshop were used to solicit and collate potential research questions from the marine mammal bioenergetic community. From a final list of 39 questions, 11 were identified as ‘key’ questions because they received votes from at least 50% of survey participants. Key questions included those related to energy intake (prey landscapes, exposure to human activities) and expenditure (field metabolic rate, exposure to human activities, lactation, time-activity budgets), energy allocation priorities, metrics of body condition and relationships with survival and reproductive success and extrapolation of data from one species to another. Existing tools to address key questions include labelled water, animal-borne sensors, mark-resight data from long-term research programs, environmental DNA and unmanned vehicles. Further validation of existing approaches and development of new methodologies are needed to comprehensively address some key questions, particularly for cetaceans. The identification of these key questions can provide a guiding framework to set research priorities, which ultimately may yield more accurate information to inform policies and better conserve marine mammal populations.more » « less
-
Modern science depends on computers, but not all scientists have access to the scale of computation they need. A digital divide separates scientists who accelerate their science using large cyberinfrastructure from those who do not, or who do not have access to the compute resources or learning opportunities to develop the skills needed. The exclusionary nature of the digital divide threatens equity and the future of innovation by leaving people out of the scientific process while over-amplifying the voices of a small group who have resources. However, there are potential solutions: recent advancements in public research cyberinfrastructure and resources developed during the open science revolution are providing tools that can help bridge this divide. These tools can enable access to fast and powerful computation with modest internet connections and personal computers. Here we contribute another resource for narrowing the digital divide: scalable virtual machines running on public cloud infrastructure. We describe the tools, infrastructure, and methods that enabled successful deployment of a reproducible and scalable cyberinfrastructure architecture for a collaborative data synthesis working group in February 2023. This platform enabled 45 scientists with varying data and compute skills to leverage 40,000 hours of compute time over a 4-day workshop. Our approach provides an open framework that can be replicated for educational and collaborative data synthesis experiences in any data- and compute-intensive discipline.more » « less
An official website of the United States government

