Cuckoo Feature Hashing: Dynamic Weight Sharing for Sparse Analytics

Gao, Jinyang; Ooi, Beng Chin; Shen, Yanyan; Lee, Wang-Chien

doi:10.24963/ijcai.2018/295

Citation Details

Cuckoo Feature Hashing: Dynamic Weight Sharing for Sparse Analytics

Feature hashing is widely used to process large scale sparse features for learning of predictive models. Collisions inherently happen in the hashing process and hurt the model performance. In this paper, we develop a feature hashing scheme called Cuckoo Feature Hashing(CCFH) based on the principle behind Cuckoo hashing, a hashing scheme designed to resolve collisions. By providing multiple possible hash locations for each feature, CCFH prevents the collisions between predictive features by dynamically hashing them into alternative locations during model training. Experimental results on prediction tasks with hundred-millions of features demonstrate that CCFH can achieve the same level of performance by using only 15%-25% parameters compared with conventional feature hashing. more »

Award ID(s):: 1717084

PAR ID:: 10065400

Author(s) / Creator(s):: Gao, Jinyang; Ooi, Beng Chin; Shen, Yanyan; Lee, Wang-Chien

Date Published:: 2018-07-13

Journal Name:: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

Page Range / eLocation ID:: 2135 to 2141

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.24963/ijcai.2018/295

More Like this