skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Emoji use in tweets: Relationships with personality traits and word usage
Emojis are digital pictographs used to express ideas and emotions. There are thousands of emojis, some depicting faces with expressions and others depicting objects, animals, and activities. We examined how emoji use on Twitter was related to users’ language use and Big Five personality traits. In the present research, we examined emoji use in tweets for seventy-six twitter accounts belonging to undergraduate students (34 women, 22 men) who also completed an online survey in which they reported personal characteristics including personality traits. With their consent, we retrieved tweets from accounts using the Twitter API and python script. We also analyzed word usage in tweets using the Linguistic Inquiry Word Count (LIWC). The results showed that frequent use of emojis was related to lower levels of openness, but not other Big Five traits. More frequent use of emojis was related to more frequent use of word related to: tone, positive emotion, sad, affect, feel, you pronouns, family, and the body. More frequent use of emojis was related to less frequent use of articles and words related to insight, money, risk, anger, sexual, ingest, informal, and swear words.  more » « less
Award ID(s):
1918591
PAR ID:
10528544
Author(s) / Creator(s):
; ; ;
Corporate Creator(s):
Editor(s):
na
Date Published:
Subject(s) / Keyword(s):
Emojis Personality Gender Differences
Format(s):
Medium: X Size: n/a Other: n/a
Size(s):
n/a
Location:
San Diego, CA
Sponsoring Org:
National Science Foundation
More Like this
  1. Al-Nofaie, H (Ed.)
    Prior research has demonstrated relationships between personality traits of social media users and the language used in their posts. Few studies have examined whether there are relationships between personality traits of users and how they use emojis in their social media posts. Emojis are digital pictographs used to express ideas and emotions. There are thousands of emojis, which depict faces with expressions, objects, animals, and activities. We conducted a study with two samples (n = 76 andn = 245) in which we examined how emoji use on X (formerly Twitter) related to users’ personality traits and language use in posts. Personality traits were assessed from participants in an online survey. With participants’ consent, we analyzed word usage in posts. Word frequencies were calculated using the Linguistic Inquiry Word Count (LIWC). In both samples, the results showed that those who used the most emojis had the lowest levels of openness to experience. Emoji use was unrelated to the other personality traits. In sample 1, emoji use was also related to use of words related to family, positive emotion, and sadness and less frequent use of articles and words related to insight. In sample 2, more frequent use of emojis in posts was related to more frequent use ofyoupronouns,Ipronouns, and more frequent use of negative function words and words related to time. The results support the view that social media users’ characteristics may be gleaned from the content of their social media posts. 
    more » « less
  2. Sentiment analysis on large-scale social media data is important to bridge the gaps between social media contents and real world activities including political election prediction, individual and public emotional status monitoring and analysis, and so on. Although textual sentiment analysis has been well studied based on platforms such as Twitter and Instagram, analysis of the role of extensive emoji uses in sentiment analysis remains light. In this paper, we propose a novel scheme for Twitter sentiment analysis with extra attention on emojis.We first learn bi-sense emoji embeddings under positive and negative sentimental tweets individually, and then train a sentiment classifier by attending on these bi-sense emoji embeddings with an attention-based long short-term memory network (LSTM). Our experiments show that the bi-sense embedding is effective for extracting sentiment-aware embeddings of emojis and outperforms the state-of-the-art models. We also visualize the attentions to show that the bi-sense emoji embedding provides better guidance on the attention mechanism to obtain a more robust understanding of the semantics and sentiments. 
    more » « less
  3. Danforth, Christopher M. (Ed.)
    Emotions at work have long been identified as critical signals of work motivations, status, and attitudes, and as predictors of various work-related outcomes. When more and more employees work remotely, these emotional signals of workers become harder to observe through daily, face-to-face communications. The use of online platforms to communicate and collaborate at work provides an alternative channel to monitor the emotions of workers. This paper studies how emojis, as non-verbal cues in online communications, can be used for such purposes and how the emotional signals in emoji usage can be used to predict future behavior of workers. In particular, we present how the developers on GitHub use emojis in their work-related activities. We show that developers have diverse patterns of emoji usage, which can be related to their working status including activity levels, types of work, types of communications, time management, and other behavioral patterns. Developers who use emojis in their posts are significantly less likely to dropout from the online work platform. Surprisingly, solely using emoji usage as features, standard machine learning models can predict future dropouts of developers at a satisfactory accuracy. Features related to the general use and the emotions of emojis appear to be important factors, while they do not rule out paths through other purposes of emoji use. 
    more » « less
  4. Abstract BackgroundWildfire smoke contributes substantially to the global disease burden and is a major cause of air pollution in the US states of Oregon and Washington. Climate change is expected to bring more wildfires to this region. Social media is a popular platform for health promotion and a need exists for effective communication about smoke risks and mitigation measures to educate citizens and safeguard public health. MethodsUsing a sample of 1,287 Tweets from 2022, we aimed to analyze temporal Tweeting patterns in relation to potential smoke exposure and evaluate and compare institutions’ use of social media communication best practices which include (i) encouraging adoption of smoke-protective actions; (ii) leveraging numeric, verbal, and Air Quality Index risk information; and (iii) promoting community-building. Tweets were characterized using keyword searches and the Linguistic Inquiry and Word Count (LIWC) software. Descriptive and inferential statistics were carried out. Results44% of Tweets in our sample were authored between January-August 2022, prior to peak wildfire smoke levels, whereas 54% of Tweets were authored during the two-month peak in smoke (September-October). Institutional accounts used Twitter (or X) to encourage the adoption of smoke-related protective actions (82% of Tweets), more than they used it to disseminate wildfire smoke risk information (25%) or promote community-building (47%). Only 10% of Tweets discussed populations vulnerable to wildfire smoke health effects, and 14% mentioned smoke mitigation measures. Tweets from Washington-based accounts used significantly more verbal and numeric risk information to discuss wildfire smoke than Oregon-based accounts (p = 0.042 andp = 0.003, respectively); however, Tweets from Oregon-based accounts on average contained a higher percentage of words associated with community-building language (p < 0.001). ConclusionsThis research provides practical recommendations for public health practitioners and researchers communicating wildfire smoke risks on social media. As exposures to wildfire smoke rise due to climate change, reducing the environmental disease burden requires health officials to leverage popular communication platforms, distribute necessary health-related messaging rapidly, and get the message right. Timely, evidence-based, and theory-driven messaging is critical for educating and empowering individuals to make informed decisions about protecting themselves from harmful exposures. Thus, proactive and sustained communications about wildfire smoke should be prioritized even during wildfire “off-seasons.” 
    more » « less
  5. Abstract There is a growing interest in using social media content for Natural Language Processing applications. However, it is not easy to computationally identify the most relevant set of tweets related to any specific event. Challenging semantics coupled with different ways for using natural language in social media make it difficult for retrieving the most relevant set of data from any social media outlet. This paper seeks to demonstrate a way to present the changing semantics of Twitter within the context of a crisis event, specifically tweets during Hurricane Irma. These methods can be used to identify the most relevant corpus of text for analysis in relevance to a specific incident such as a hurricane. Using an implementation of the Word2Vec method of Neural Network training mechanisms to create Word Embeddings, this paper will: discuss how the relative meaning of words changes as events unfold; present a mechanism for scoring tweets based upon dynamic, relative context relatedness; and show that similarity between words is not necessarily static. We present different methods for training the vector model in Word2Vec for identification of the most relevant tweets for any search query. The impact of tuning parameters such as Word Window Size, Minimum Word Frequency, Hidden Layer Dimensionality, and Negative Sampling on model performance was explored. The window containing the local maximum for AU_ROC for each parameter serves as a guide for other studies using the methods presented here for social media data analysis. 
    more » « less