Urban residential building stock synthetic datasets for building energy performance analysis
- Award ID(s):
- 2217410
- PAR ID:
- 10518345
- Publisher / Repository:
- Elsevier
- Date Published:
- Journal Name:
- Data in Brief
- Volume:
- 53
- Issue:
- C
- ISSN:
- 2352-3409
- Page Range / eLocation ID:
- 110241
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract Pangenome graphs can represent all variation between multiple genomes, but existing methods for constructing them are biased due to reference-guided approaches. In response, we have developed PanGenome Graph Builder (PGGB), a reference-free pipeline for constructing unbi-ased pangenome graphs. PGGB uses all-to-all whole-genome alignments and learned graph embeddings to build and iteratively refine a model in which we can identify variation, measure conservation, detect recombination events, and infer phylogenetic relationships.more » « less
-
Context.— Machine learning (ML) allows for the analysis of massive quantities of high-dimensional clinical laboratory data, thereby revealing complex patterns and trends. Thus, ML can potentially improve the efficiency of clinical data interpretation and the practice of laboratory medicine. However, the risks of generating biased or unrepresentative models, which can lead to misleading clinical conclusions or overestimation of the model performance, should be recognized. Objectives.— To discuss the major components for creating ML models, including data collection, data preprocessing, model development, and model evaluation. We also highlight many of the challenges and pitfalls in developing ML models, which could result in misleading clinical impressions or inaccurate model performance, and provide suggestions and guidance on how to circumvent these challenges. Data Sources.— The references for this review were identified through searches of the PubMed database, US Food and Drug Administration white papers and guidelines, conference abstracts, and online preprints. Conclusions.— With the growing interest in developing and implementing ML models in clinical practice, laboratorians and clinicians need to be educated in order to collect sufficiently large and high-quality data, properly report the data set characteristics, and combine data from multiple institutions with proper normalization. They will also need to assess the reasons for missing values, determine the inclusion or exclusion of outliers, and evaluate the completeness of a data set. In addition, they require the necessary knowledge to select a suitable ML model for a specific clinical question and accurately evaluate the performance of the ML model, based on objective criteria. Domain-specific knowledge is critical in the entire workflow of developing ML models.more » « less
-
Pangenome graphs can represent all variation between multiple reference genomes, but current approaches to build them exclude complex sequences or are based upon a single reference. In response, we developed the PanGenome Graph Builder, a pipeline for constructing pangenome graphs without bias or exclusion. The PanGenome Graph Builder uses all-to-all alignments to build a variation graph in which we can identify variation, measure conservation, detect recombination events and infer phylogenetic relationships.more » « less
-
In West Virginia, the unemployment rate is 3.6% (2024) and the workforce participation rate is 55.1% (2024) suggesting 44.9% of work-eligible people are not participating in the workforce while another 3.6% are looking for work. We have identified attracting and retaining job-ready employees, inadequate communication and collaboration, and misperceptions among students and job seekers concerning the diverse career opportunities available in WV as challenges within the region. The ARC POWER grant project, “Building Connections to Grow Capacity: Breaking Down Regional Barriers in the STEM Workforce Pipeline” addresses these challenges and have proposed many opportunities to overcome these barriers making a connection between industry, academics and educators, students and other jobseekers, and workforce entities to develop the STEM workforce in the service region of Fairmont State University (FSU) and Glenville State University (GSU) through a Regional Career Services and Workforce Development Collaborative. Several areas of the grant are likely of interest to educators in academia, including an emphasis on forming industry advisory boards to help build stronger relationships between academia and industry and also aid in refining curriculum and a focus on both student and industry engagement and instructor and industry engagement to create opportunities for job shadowing, internships and other opportunities between these groups. This grant provides resources for fostering collaboration among people who might not normally interact, and this collaboration can produce an enormous impact on the STEM workforce in our region.more » « less
An official website of the United States government

