skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Award ID contains: 1738502

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Public opinion surveys constitute a widespread, powerful tool to study peoples’ attitudes and behaviors from comparative perspectives. However, even global surveys can have limited geographic and temporal coverage, which can hinder the production of comprehensive knowledge. To expand the scope of comparison, social scientists turn to ex-post harmonization of variables from datasets that cover similar topics but in different populations and/or at different times. These harmonized datasets can be analyzed as a single source and accessed through various data portals. However, the Survey Data Recycling (SDR) research project has identified three challenges faced by social scientists when using data portals: the lack of capability to explore data in-depth or query data based on customized needs, the difficulty in efficiently identifying related data for studies, and the incapability to evaluate theoretical models using sliced data. To address these issues, the SDR research project has developed the SDR Querier, which is applied to the harmonized SDR database. The SDR Querier includes a BERT-based model that allows for customized data queries through research questions or keywords (Query-by-Question), a visual design that helps users determine the availability of harmonized data for a given research question (Query-by-Condition), and the ability to reveal the underlying relational patterns among substantive and methodological variables in the database (Query-by-Relation), aiding in the rigorous evaluation or improvement of regression models. Case studies with multiple social scientists have demonstrated the usefulness and effectiveness of the SDR Querier in addressing daily challenges. 
    more » « less
  2. The SDR Database v.2.0 (SDR2) is a multi-country, multi-year database for research on political participation, social capital, and well-being. It comprises harmonized information from 23 international survey projects, covering over 4.4 million respondents from 156 countries in the period 1966 – 2017. SDR2 provides both target variables and methodological indicators that store source survey and ex-post harmonization metadata. SDR2 consists of three datasets. The MASTER file, which stores harmonized information for a total of 4,402,489 respondents. The auxiliary PLUG-SURVEY file containing controls for source data quality and a set of technical variables needed for merging this file with the MASTER file. And the PLUG-COUNTRY file, which is a dictionary of countries and territories used in the MASTER file. An overall description of the SDR2 Database, and detailed information about its datasets are available in the SDR2 documentation. SDR2 is a product of the project Survey Data Recycling: New Analytic Framework, Integrated Database, and Tools for Cross-national Social, Behavioral and Economic Research, financed by the US National Science Foundation (PTE Federal award 1738502). We thank the Ohio State University and the Institute of Philosophy and Sociology, Polish Academy of Sciences, for organizational support. 
    more » « less
  3. This article proposes a new approach to analyze protest participation measured in surveys of uneven quality. Because single international survey projects cover only a fraction of the world’s nations in specific periods, researchers increasingly turn to ex-post harmonization of different survey data sets not a priori designed as comparable. However, very few scholars systematically examine the impact of the survey data quality on substantive results. We argue that the variation in source data, especially deviations from standards of survey documentation, data processing, and computer files—proposed by methodologists of Total Survey Error, Survey Quality Monitoring, and Fitness for Intended Use—is important for analyzing protest behavior. In particular, we apply the Survey Data Recycling framework to investigate the extent to which indicators of attending demonstrations and signing petitions in 1,184 national survey projects are associated with measures of data quality, controlling for variability in the questionnaire items. We demonstrate that the null hypothesis of no impact of measures of survey quality on indicators of protest participation must be rejected. Measures of survey documentation, data processing, and computer records, taken together, explain over 5% of the intersurvey variance in the proportions of the populations attending demonstrations or signing petitions. 
    more » « less
  4. As active involvement in protest has been legitimized as an acceptable form of political activity, citizens’ protest potential has become an important measure to understand contemporary democratic politics. However, the arbitrary use of a forced-choice question, which prevents those who have previously participated in protests from expressing willingness to engage in future protest, and the limited coverage of international surveys across countries and years have impeded comparative research on protest potential. This research develops a new systematic weighting method for the measurement of protest potential for comparative research. Using the 1996 International Social Survey Program survey, which asks two separate questions about “have done” and “would do” demonstrations, I create a weighting scale for the forced-choice question by estimating the predicted probabilities of protest potential for those who have already participated in demonstrations. Capitalizing on the survey data recycling framework, this study also controls for harmonization procedures and the quality of surveys, thereby expanding the cross-national and temporal coverage beyond the affluent Western democracies. The results show that this weighting scale provides a valid measure of protest potential, and the survey data recycling framework improves comparability between surveys. 
    more » « less
  5. Researchers have long theorized that characteristics of education systems impact both perceived and experienced corruption in public schools. However, due to insufficient cross-national survey data with measures on corruption in education and unassembled yet publicly available institutional data, there are few empirical tests of this theory. This article provides the rare direct test of the relationship between corruption in European public schools and three education system factors: government expenditure on education, education staff compensation, and teacher workload (pupil–teacher ratio). With a newly constructed harmonized data set for European countries, and controlling for national economic factors and individual characteristics, results of multilevel analyses suggest partial support for the theory that specific institutional characteristics of education systems impact public school corruption. The theorized institutional factors have different effects that depend on whether we examine bribe-giving experience or corruption perception. Results show that bribe-giving experience in public schools of Europe is weakly yet significantly related to education staff compensation. For corruption perception, low levels of government expenditure on education and a lopsided pupil–teacher ratio (too few teachers per student) increase the probability that people view corruption as prevalent. 
    more » « less
  6. A common claim about the affluent democracies is that protest is trending, becoming more legitimate and widely used by all political contenders. In the new democracies, protest is seen as having contributed to democratization, but growing apathy has led to protest decline while in authoritarian regimes protest may be spurring more democratization. Assessing these ideas requires comparative trend data covering 15 or more years but constructing such data confronts problems. The major problem is that the most available survey item asks “have you ever joined (lawful) demonstrations,” making it difficult to time when this protest behavior occurred. We advance a novel method for timing these “ever” responses by focusing on young adults (aged 18-23 years), who are likely reporting on participation within the past 5 years. Drawing on the Survey Data Recycling harmonized data set, we use a multilevel model including harmonization and survey quality controls to create predicted probabilities for young adult participation (576 surveys, 119 countries, 1966-2010). Aggregating these to create country-year rate estimates, these compare favorably with overlapping estimates from surveys asking about “the past 5 years or so” and event data from the PolDem project. Harmonization and survey quality controls improve these predicted values. These data provide 15+ years trend estimates for 60 countries, which we use to illustrate the possibilities of estimating comparative protest trends. 
    more » « less
  7. This special issue examines innovative methods in three areas of protest studies: (1) survey methodology and scaling, (2) the development and assessment of political event data, and (3) methods of contextual analysis. The articles provide new techniques and general methodologies for improving the quality of data that we have about protest, especially in the comparative study of protest. 
    more » « less