Domain Adaptation for Crisis Data Using Correlation Alignment and Self-Training

Li, Hongmin; Sopova, Oleksandra; Caragea, Doina; Caragea, Cornelia

doi:10.4018/IJISCRAM.2018100101

Citation Details

Domain Adaptation for Crisis Data Using Correlation Alignment and Self-Training

Domain adaptation methods have been introduced for auto-filtering disaster tweets to address the issue of lacking labeled data for an emerging disaster. In this article, the authors present and compare two simple, yet effective approaches for the task of classifying disaster-related tweets. The first approach leverages the unlabeled target disaster data to align the source disaster distribution to the target distribution, and, subsequently, learns a supervised classifier from the modified source data. The second approach uses the strategy of self-training to iteratively label the available unlabeled target data, and then builds a classifier as a weighted combination of source and target-specific classifiers. Experimental results using Naïve Bayes as the base classifier show that both approaches generally improve performance as compared to baseline. Overall, the self-training approach gives better results than the alignment-based approach. Furthermore, combining correlation alignment with self-training leads to better result, but the results of self-training are still better. more »

Award ID(s):: 1741345

PAR ID:: 10204866

Author(s) / Creator(s):: Li, Hongmin; Sopova, Oleksandra; Caragea, Doina; Caragea, Cornelia

Date Published:: 2018-10-01

Journal Name:: International Journal of Information Systems for Crisis Response and Management

Volume:: 10

Issue:: 4

ISSN:: 1937-9390

Page Range / eLocation ID:: 1 to 20

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.4018/IJISCRAM.2018100101

More Like this