skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Practitioners Teaching Data Science in Industry and Academia: Expectations, Workflows, and Challenges
Data science has been growing in prominence across both academia and industry, but there is still little formal consensus about how to teach it. Many people who currently teach data science are practitioners such as computational researchers in academia or data scientists in industry. To understand how these practitioner-instructors pass their knowledge onto novices and howthat contrasts with teaching more traditional forms of programming, we interviewed 20 data scientists who teach in settings ranging from small-group workshops to large online courses. We found that: 1) they must empathize with a diverse array of student backgrounds and expectations, 2) they teach technical workflows that integrate authentic practices surrounding code, data, and communication, 3) they face challenges involving authenticity versus abstraction in software setup, finding and curating pedagogically-relevant datasets, and acclimating students to live with uncertainty in data analysis. These findings can point the way toward better tools for data science education and help bring data literacy to more people around the world.  more » « less
Award ID(s):
1735234
PAR ID:
10104556
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
Issue:
2019
Page Range / eLocation ID:
1 to 14
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Seagroves, Scott; Barnes, Austin; Metevier, Anne; Porter, Jason; Hunter, Lisa (Ed.)
    Ostensibly, the main goal of the ISEE Professional Development Program (PDP) is to teach scientists and engineers how to be intentional, inclusive educators by experiencing and designing inquiry-based learning activities. However, the PDP program has many indirect, positive effects on its participants as well, including building community and a sense of STEM identity, fluency to understand and discuss diversity, equity, and inclusion topics, and recognizing the importance of psychological safety in learning, academia, and industry. We present four narratives from past participants with underestimated minority identities, who discuss how the PDP program had a positive impact on their growth as scientists and engineers. In each case, the PDP provided critical tools, knowledge or support that enabled their success as graduate students and into their respective career and life journeys. 
    more » « less
  2. High Performance Computing (HPC) and, in general, Parallel and Distributed Computing (PDC) has become pervasive, from supercomputers and server farms containing multicore CPUs and GPUs, to individual PCs, laptops, and mobile devices. Therefore, it is important for every computing professional (and especially every programmer) to understand how parallelism and distributed computing affect problem solving. It is essential for educators to impart a range of PDC and HPC knowledge and skills at multiple levels within the educational fabric woven by Computer Science (CS), Computer Engineering (CE), and related computational curricula including data science. Companies and laboratories need people with these skills, and, as a result, they are finding that they must now engage in extensive on-the-job training. All the while, rapid changes in hardware platforms, languages, and programming environments increasingly challenge educators to decide what to teach and how to teach it in order to prepare students for careers that are increasingly involving PDC and HPC. EduHiPC aims to provide a forum that brings together academia, industry, government, and non-profit organizations – especially from India, its vicinity, and Asia – for exploring and exchanging experiences and ideas about the inclusion of high-performance, parallel, and distributed computing into undergraduate and graduate curriculum of Computer Science, Computer Engineering, Computational Science, Computational Engineering, and computational courses for STEM and business and other non-STEM disciplines. 
    more » « less
  3. As data science is an evolving field, existing definitions reflect this uncertainty with overloaded terms and inconsistency. As a result of the field’s fluidity, there is often a mismatch between what data-related programs teach, what employers expect, and the actual tasks data scientists are performing. In addition, the tools available to data scientists are not necessarily the tools being taught; textbooks do not seem to meet curricular needs; and empirical evidence does not seem to support existing program design. Currently, the field appears to be bifurcating into data science (DS) and data engineering (DE), with specific but overlapping roles in the combined data science and engineering (DSE) lifecycle. However, curriculum design has not yet caught up to this evolution. This working group report shows an empirical and data-driven view of the data-related education landscape, and includes several recommendations for both academia and industry that are based on this analysis. 
    more » « less
  4. Abstract Science communication (scicomm) shapes our world by helping people use science to make societal and personal decisions. Supporting and doing ethical scicomm requires valuing diverse perspectives and the people who do scicomm. Unfortunately, institutional hurdles ingrained in academia impede and undermine ethical scicomm. The injustices impeding scicomm stem from the prestige paradigm of academia (articulated in the present article), which reinforces hierarchical relationships in an exclusionary and exploitative system. To move academia forward, we name and review these injustices through the lens of five realms of scicomm (scientific communication, teaching scicomm, academics engaging in scicomm, scicomm research, and scicomm careers beyond academia). We then provide a novel framework, helping readers identify axes of influence and how they can leverage their intersectional, academic capital to take concrete action to remove the hurdles impeding ethical scicomm in academia. 
    more » « less
  5. Bulmer, Michael; Finch, Sue (Ed.)
    By learning collaboration skills, statisticians and data scientists can more effectively collaborate with others who possess the skills they may lack. We describe the ASCCR framework, which was developed in the United States for teaching and learning collaboration skills. We hypothesize which aspects of this framework are universal and which are culturally dependent. We provide an example of our initial attempt to translate the Structure component of ASCCR into the Indonesian language and culture. The new acronym of SABAR can help Indonesian students learn, remember, and use specific skills for structuring, organizing, and conducting collaboration meetings. How to effectively teach collaboration skills to Indonesian students in statistics and data science will require more work and is the subject of future research. 
    more » « less