Fake News Detection Enhancement with Data Imputation

Kotteti, Chandra Mouli; Dong, XIshuang; Li, Na; Qian, Lijun

doi:10.1109/DASC/PiCom/DataCom/CyberSciTec.2018.00042

Citation Details

Fake News Detection Enhancement with Data Imputation

Raw datasets collected for fake news detection usually contain some noise such as missing values. In order to improve the performance of machine learning based fake news detection, a novel data preprocessing method is proposed in this paper to process the missing values. Specifically, we have successfully handled the missing values problem by using data imputation for both categorical and numerical features. For categorical features, we imputed missing values with the most frequent value in the columns. For numerical features, the mean value of the column is used to impute numerical missing values. In addition, TF-IDF vectorization is applied in feature extraction to filter out irrelevant features. Experimental results show that Multi-Layer Perceptron (MLP) classifier with the proposed data preprocessing method outperforms baselines and improves the prediction accuracy by more than 15%. more »

Award ID(s):: 1712496

PAR ID:: 10438715

Author(s) / Creator(s):: Kotteti, Chandra Mouli; Dong, XIshuang; Li, Na; Qian, Lijun

Date Published:: 2018-08-01

Journal Name:: 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech)

Page Range / eLocation ID:: 187 to 192

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

More Like this