skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: MIDST: an enhanced development environment that improves the maintainability of a data science analysis
>With the increasing ability to generate actionable insight from data, the field of data science has seen significant growth. As more teams develop data science solutions, the analytical code they develop will need to be enhanced in the future, by an existing or a new team member. Thus, the importance of being able to easily maintain and enhance the code required for an analysis will increase. However, to date, there has been minimal research on the maintainability of an analysis done by a data science team. To help address this gap, data science maintainability was explored by (1) creating a data science maintainability model, (2) creating a new tool, called MIDST (Modular Interactive Data Science Tool), that aims to improve data science maintainability, and then (3) conducting a mixed method experiment to evaluate MIDST. The new tool aims to improve the ability of a team member to update and rerun an existing data science analysis by providing a visual data flow view of the analysis within an integrated code and computational environment. Via an analysis of the quantitative and qualitative survey results, the experiment found that MIDST does help improve the maintainability of an analysis. Thus, this research demonstrates the importance of enhanced tools to help improve the maintainability of data science projects.  more » « less
Award ID(s):
1618444
PAR ID:
10301603
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
International journal of information systems and project management
Volume:
8
Issue:
3
ISSN:
2182-7788
Page Range / eLocation ID:
5-24
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Hockett, Kevin Loren (Ed.)
    ABSTRACT Synthetic microbial communities (SynComs) are a valuable tool to study community assembly patterns, host–microbe interactions, and microbe–microbe interactions in a fully controllable setting. Constructing the SynCom inocula for plant–microbe experiments can be time-consuming and difficult because a large number of isolates with different medium requirements and growth rates are grown in parallel and mixed to appropriate titers. A potential workaround to assembling fresh SynCom inocula for every experiment could be to prepare and freeze SynComs on a large scale, creating ready-to-use inocula. The objective of this study was to compare the reproducibility, stability, and colonization ability of freshly prepared versus frozen SynCom inocula. We used a community of seven species known to colonize maize roots. The results from inoculation with the frozen SynCom were as consistent as those of standardizedde novoconstruction of fresh SynCom. Our results indicate that creating frozen SynCom inocula for repeated use in experiments not only saves time but could also improve cross-experiment reproducibility. Although this approach was only validated with one SynCom, it demonstrates a principle that can be tested for improving approaches in constructing other SynComs. IMPORTANCESynthetic communities (SynComs) are an invaluable tool to characterize and model plant–microbe interactions. Multimember SynComs approximate intricate real-world interactions between plants and their microbiome, but the complexity and time required for their construction increase enormously for each additional member added to the SynCom. Therefore, researchers who study a diversity of microbiomes using SynComs are looking for ways to simplify the use of SynComs. In this manuscript, we evaluate the feasibility of creating ready-to-use freezer stocks of a well-studied seven-member SynCom for maize roots. The frozen ready-to-use SynCom stocks work according to the principle of “just add buffer and apply to sterilized seeds or seedlings” and thus can save time applied in multiple days of laborious growing and combining of multiple microorganisms. We show that ready-to-use SynCom stocks provide comparable results to those of freshly constructed SynComs and thus allow for significant time savings when working with SynComs. 
    more » « less
  2. With Kotlin becoming a viable language replacement for Java, there is a need for translators and data flow analysis libraries to create maintainable and readable source code. Instagram, Uber, and Gradle are only a few of the large corporations that have either switched from Java to Kotlin completely or started to use it in internal tools in order to reduce code base size. Developers have claimed that Kotlin is fun to use in comparison to Java and much of the boilerplate code is reduced. With Java being the main language for the open source organization, PhenoApps, there is a need to support both Java and Kotlin to increase the maintainability of the code. Fortunately, JetBrains has an open-source IDE plugin for translating Java to Kotlin; however, the translation has some fundamental issues which shall be discussed further in this paper. Introducing, j2k, a CLI translation tool which includes various anti-pattern detection for syntactical formatting, performance, and other Android requirements. The new tool introduced within this paper, j2kCLI allows users to directly translate strings of Java code to Kotlin, or entire directories. This facilitates the maintainability of a large open source code base. 
    more » « less
  3. Commitment is a multi-dimensional construct that has been extensively researched in the context of organizations. Organizational and professional commitment have been positively associated with technical performance, client service, attention to detail, and degree of involvement with one’s job. However, there is a relative dearth of research in terms of team commitment, especially in educational settings. Teamwork is considered a 21stcentury skill and higher education institutions are focusing on helping students to develop teamwork skills by applied projects in the coursework. But studies have demonstrated that creating a team is not enough to help students build teamwork skills. Literature supports the use of team contracts to bolster commitment, among team members. However, the relationship between team contracts and team commitment has not been formally operationalized.This research category study presents a mixed-methods approach towards characterizing and operationalizing team commitment exhibited by students enrolled in a sophomore-level systems analysis and design course by analyzing team contracts and team retrospective reflections. The course covers concepts pertaining to information systems development and includes a semester-long team project where the students work together in four or five member teams to develop the project deliverables. The students have prior software development experiences through an introductory systems development course as well as multiple programming courses. The data for this study was collected through the team contracts signed by students belonging to one of the 23 teams of this course. The study aims to answer the following research question: How can team commitment be characterized in a sophomore-level system analysis and design course among the student teams?A rubric was developed to quantify the team commitment levels of students based on their responses on the team contracts. Students were classified as high or low commitment based on the rubric scores. The emergent themes of high and low commitment teams were also presented. The results indicated that the high commitment teams were focused on setting goals, effective communication, and having mechanisms in place for timely feedback and improvement. On the other hand, low commitment teams did not articulate the goals of the project, they demonstrated a lack of dedication for attending team meetings regularly, working as a team, and had a lack of proper coordination while working together. 
    more » « less
  4. Water is a critical natural resource that has significant impacts on human living and society. Growing population and energy consumption exacerbate the scarcity of water and our ability to manage this resource. This demonstration paper presents WaterScope, a smart and connected platform for water resource management, which integrates multiple data sources such as water level data, social media data, and water-related articles. Furthermore, the tool enables forecasting underground water levels, identifying water concerns, sharing knowledge and expertise among stakeholders, and thus bringing new insights to our understanding and insights of the water supplies and resource management. The prototype engages water stakeholders who face problems of similar nature but deal with the problem in an ad-hoc and isolated manner. The interactive WaterScope platform targets creating an interconnected virtual community that aims to improve water supply resilience. 
    more » « less
  5. Understanding how to build relationships between universities, organizations, and community colleges could encourage more inter-organizational work through the formation of intentional and strong positive relationships. In this work in progress paper, we discuss how we fostered a research collaboration between faculty, administrators, and researchers at two community colleges, two universities, and several professional welding organizations. The intent of the overarching research project is to study and improve the educational experiences, outcomes, and career pathways of welding technology (WT) students. During the facilitation of this project, the team has successfully cultivated and leveraged relationships and partnerships to help inform the study. As a result, the Project Team recognizes the importance of capturing how we develop and leverage these relationships to address project needs and produce deliverables. The formation of the relationships between researchers, practitioners, employers and professional organizations is rarely examined and documented in technological education. Thus, this work allows us to capture and share the theoretical and practical knowledge about how we have developed, maintained, and leveraged these partnerships with engaged leadership from our community college principal investigators. In this work, we present: (1) a brief review of literature about team science and (2) best practices related to our process of cultivating and leveraging relationships between the Project team members, faculty and industry employers. This work provides theoretical and practical knowledge about partnership development in Advanced Technological Education (ATE) projects that can provide critical insights about creating and leveraging partnerships between researchers, faculty, and practitioners. 
    more » « less