Tools for Integrating Data by Complex, Dynamic Categories

Hruschka, Daniel; Cheng, Yi‐Yun; Hsiao, I‐Han; Bischoff, Robert; Peeples, Matthew; Kasi, Harsha; Huang, Cindy

doi:10.1002/pra2.1145

Citation Details

Tools for Integrating Data by Complex, Dynamic Categories

ABSTRACT A key challenge in conducting comparative analyses across social units, such as religions, ethnicities, or cultures, is that data on these units is often encoded in distinct and incompatible formats across diverse datasets. This can involve simple differences in the variables and values used to encode these units (e.g., Roman Catholic is V130 = 1 vs. Q98A = 2 in two different datasets) or differences in the resolutions at which units are encoded (Maya vs. Kaqchikel Maya). These disparate encodings can create substantial challenges for the efficiency and transparency of data syntheses across diverse datasets. We introduce a user‐friendly set of tools to help users translate four kinds of categories (religion, ethnicity, language, and subdistrict) across multiple, external datasets. We outline the platform's key functions and current progress, as well as long‐range goals for the platform. more »

Award ID(s):: 2318505

PAR ID:: 10590892

Author(s) / Creator(s):: Hruschka, Daniel; Cheng, Yi‐Yun; Hsiao, I‐Han; Bischoff, Robert; Peeples, Matthew; Kasi, Harsha; Huang, Cindy

Publisher / Repository:: Wiley

Date Published:: 2024-10-01

Journal Name:: Proceedings of the Association for Information Science and Technology

Volume:: 61

Issue:: 1

ISSN:: 2373-9231

Page Range / eLocation ID:: 934 to 936

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1002/pra2.1145

More Like this