<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcq="http://purl.org/dc/terms/"><records count="1" morepages="false" start="1" end="1"><record rownumber="1"><dc:product_type>Journal Article</dc:product_type><dc:title>HINENI: Human Identity across the Nations of the Earth Ngram Investigator</dc:title><dc:creator>Handzlik, Dakota; Jones, Jason Jeffrey; Skiena, Steven S</dc:creator><dc:corporate_author/><dc:editor/><dc:description>Self-reported biographical strings on social media profiles provide a powerful tool to study self-identity. We present HINENI, a dataset of 420 million Twitter user profiles collected over a 12 year period, partitioned into 32 distinct national cohorts, which we believe is the largest publicly available data resource for identity research. We report on the major design decisions underlying HINENI, including a new notion of sampling (k-persistence) which spans the divide between traditional cross-sectional and longitudinal approaches. We demonstrate the power of HINENI to study the relative survival rate (half-life) of different tokens, and the use of emoji analysis across national cohorts to study the effects of gender, national, and sports identities.</dc:description><dc:publisher>AAAI</dc:publisher><dc:date>2024-05-31</dc:date><dc:nsf_par_id>10548355</dc:nsf_par_id><dc:journal_name>Proceedings of the International AAAI Conference on Web and Social Media</dc:journal_name><dc:journal_volume>18</dc:journal_volume><dc:journal_issue/><dc:page_range_or_elocation>515 to 527</dc:page_range_or_elocation><dc:issn>2162-3449</dc:issn><dc:isbn/><dc:doi>https://doi.org/10.1609/icwsm.v18i1.31331</dc:doi><dcq:identifierAwardId>2208664</dcq:identifierAwardId><dc:subject/><dc:version_number/><dc:location/><dc:rights/><dc:institution/><dc:sponsoring_org>National Science Foundation</dc:sponsoring_org></record></records></rdf:RDF>