PurposeThis study examined differences related to gender and racial/ethnic identity among academic researchers participating in the National Science Foundation’s “Innovation-Corps” (NSF I-Corps) entrepreneurship training program. Drawing from prior research in the fields of technology entrepreneurship and science, technology, engineering and mathematics (STEM) education, this study addresses the goal of broadening participation in academic entrepreneurship. Design/methodology/approachUsing ANOVA and MANOVA analyses, we tested for differences by gender and minoritized racial/ethnic identity for four variables considered pertinent to successful program outcomes: (1) prior entrepreneurial experience, (2) perceptions of instructional climate, (3) quality of project team interactions and (4) future entrepreneurial intention. The sample includes faculty (n = 434) and graduate students (n = 406) who completed pre- and post-course surveys related to a seven-week nationwide training program. FindingsThe findings show that group differences based on minoritized racial/ethnic identity compared with majority group identity were largely not evident. Previous research findings were replicated for only one variable, indicating that women report lower amounts of total prior entrepreneurial experience than men, but no gender differences were found for other study variables. Originality/valueOur analyses respond to repeated calls for research in the fields of entrepreneurship and STEM education to simultaneously examine intersecting minoritized and/or under-represented social identities to inform recruitment and retention efforts. The unique and large I-Corps national dataset offered the statistical power to quantitatively test for differences between identity groups. We discuss the implications of the inconsistencies in our analyses with prior findings, such as the need to consider selection bias.
more »
« less
The LOOP Estimator: Adjusting for Covariates in Randomized Experiments
Background:When conducting a randomized controlled trial, it is common to specify in advance the statistical analyses that will be used to analyze the data. Typically, these analyses will involve adjusting for small imbalances in baseline covariates. However, this poses a dilemma, as adjusting for too many covariates can hurt precision more than it helps, and it is often unclear which covariates are predictive of outcome prior to conducting the experiment. Objectives:This article aims to produce a covariate adjustment method that allows for automatic variable selection, so that practitioners need not commit to any specific set of covariates prior to seeing the data. Results:In this article, we propose the “leave-one-out potential outcomes” estimator. We leave out each observation and then impute that observation’s treatment and control potential outcomes using a prediction algorithm such as a random forest. In addition to allowing for automatic variable selection, this estimator is unbiased under the Neyman–Rubin model, generally performs at least as well as the unadjusted estimator, and the experimental randomization largely justifies the statistical assumptions made.
more »
« less
- Award ID(s):
- 1646108
- PAR ID:
- 10547162
- Publisher / Repository:
- SAGE Publications
- Date Published:
- Journal Name:
- Evaluation Review
- Volume:
- 42
- Issue:
- 4
- ISSN:
- 0193-841X
- Format(s):
- Medium: X Size: p. 458-488
- Size(s):
- p. 458-488
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract BackgroundEstimating and accounting for hidden variables is widely practiced as an important step in molecular quantitative trait locus (molecular QTL, henceforth “QTL”) analysis for improving the power of QTL identification. However, few benchmark studies have been performed to evaluate the efficacy of the various methods developed for this purpose. ResultsHere we benchmark popular hidden variable inference methods including surrogate variable analysis (SVA), probabilistic estimation of expression residuals (PEER), and hidden covariates with prior (HCP) against principal component analysis (PCA)—a well-established dimension reduction and factor discovery method—via 362 synthetic and 110 real data sets. We show that PCA not only underlies the statistical methodology behind the popular methods but is also orders of magnitude faster, better-performing, and much easier to interpret and use. ConclusionsTo help researchers use PCA in their QTL analysis, we provide an R package along with a detailed guide, both of which are freely available athttps://github.com/heatherjzhou/PCAForQTL. We believe that using PCA rather than SVA, PEER, or HCP will substantially improve and simplify hidden variable inference in QTL mapping as well as increase the transparency and reproducibility of QTL research.more » « less
-
Abstract Calibration weighting has been widely used to correct selection biases in nonprobability sampling, missing data and causal inference. The main idea is to calibrate the biased sample to the benchmark by adjusting the subject weights. However, hard calibration can produce enormous weights when an exact calibration is enforced on a large set of extraneous covariates. This article proposes a soft calibration scheme, where the outcome and the selection indicator follow mixed-effect models. The scheme imposes an exact calibration on the fixed effects and an approximate calibration on the random effects. On the one hand, our soft calibration has an intrinsic connection with best linear unbiased prediction, which results in a more efficient estimation compared to hard calibration. On the other hand, soft calibration weighting estimation can be envisioned as penalized propensity score weight estimation, with the penalty term motivated by the mixed-effect structure. The asymptotic distribution and a valid variance estimator are derived for soft calibration. We demonstrate the superiority of the proposed estimator over other competitors in simulation studies and using a real-world data application on the effect of BMI screening on childhood obesity.more » « less
-
BackgroundHandpumps are used by millions of people as their main source of water. Although handpumps represent only a basic form of water provision, there have been continuous efforts to improve the performance of these systems as they are likely to remain in use for many years to come. The introduction of a professional maintenance service in southern Kenya has shown an order of magnitude improvement in operational performance over community-based management, with 90% of handpump faults repaired within 3 days of being reported. One driver behind these efforts is the assumption that a more reliable water supply will lead to a reduction in water-related disease. However, it is not clear if operational improvements lead to health gains. Despite limited empirical evidence, some modeling studies suggest that even short periods of drinking contaminated water can lead to disproportionate negative health impacts. ObjectiveThe aim of this study was to assess whether the improvements in operational performance from the rapid professional maintenance of rural handpumps lead to improved household health outcomes. MethodsFrom a sample of households using handpumps as their primary water source in Kwale County, Kenya, we measured the 2-week prevalence of World Health Organization–defined diarrhea in children, reported by the adult respondent for each household. We compared the rates before and after a period during which the households’ handpumps were being professionally maintained. We then conducted a cross-sectional analysis, fitting logistic regression models with reported diarrhea as the dependent variable and speed of repair as the independent exposure of interest, adjusting for household socioeconomic characteristics; dwelling construction; and Water, Sanitation, and Hygiene (WASH)-related factors. We fitted an additional model to examine select interactions between covariates. ResultsReported diarrhea in children was lower in households whose pumps had been repaired within 24 hours (adjusted odds ratio 0.35, 95% CI 0.24-0.51). This effect was robust to the inclusion of multiple categories of covariates. No reduction was seen in households whose pump repairs took more than 24 hours. Analysis of interaction terms showed that certain interventions associated with improved WASH outcomes were only associated with reductions in diarrhea in conjunction with socioeconomic improvements. ConclusionsOnly pump repairs consistently made within 24 hours of failure led to a reduction in diarrhea in the children of families using handpumps. While the efficacy of reduction in diarrhea is substantial, the operational challenges of guaranteeing same-day repairs limits the effectiveness of even best-in-class pump maintenance. Maintenance regimes that cannot bring handpump downtimes close to zero will struggle to generate health benefits. Other factors that reduce diarrhea prevalence have limited effect in isolation, suggesting that WASH interventions will be more effective when undertaken as part of more holistic poverty-reduction efforts.more » « less
-
The National Alzheimer's Coordinating Center Uniform Data Set includes test results from a battery of cognitive exams. Motivated by the need to model the cognitive ability of low‐performing patients we create a composite score from ten tests and propose to model this score using a partially linear quantile regression model for longitudinal studies with non‐ignorable dropouts. Quantile regression allows for modeling non‐central tendencies. The partially linear model accommodates nonlinear relationships between some of the covariates and cognitive ability. The data set includes patients that leave the study prior to the conclusion. Ignoring such dropouts will result in biased estimates if the probability of dropout depends on the response. To handle this challenge, we propose a weighted quantile regression estimator where the weights are inversely proportional to the estimated probability a subject remains in the study. We prove that this weighted estimator is a consistent and efficient estimator of both linear and nonlinear effects.more » « less
An official website of the United States government
