Segmentation of Tweets with URLs and its Applications to Sentiment Analysis

Abdullah Aljebreen, Weiyi Meng

Citation Details

An important means for disseminating information in social media platforms is by including URLs that point to external sources in user posts. In Twitter, we estimate that about 21% of the daily stream of English-language tweets contain URLs. We notice that NLP tools make little attempt at understanding the relationship between the content of the URL and the text surrounding it in a tweet. In this work, we study the structure of tweets with URLs relative to the content of the Web documents pointed to by the URLs. We identify several segments classes that may appear in a tweet with URLs, such as the title of a Web page and the user's original content. Our goals in this paper are: introduce, define, and analyze the segmentation problem of tweets with URLs, develop an effective algorithm to solve it, and show that our solution can benefit sentiment analysis on Twitter. We also show that the problem is an instance of the block edit distance problem, and thus an NP-hard problem. more »

Award ID(s):: 1838145

PAR ID:: 10292515

Author(s) / Creator(s):: Abdullah Aljebreen, Weiyi Meng

Date Published:: 2021-01-01

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 35

Issue:: 14

ISSN:: 2159-5399

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this