Learning to Interpret Satellite Images using Wikipedia

Uzkent, Burak; Sheehan, Evan; Meng, Chenlin; Tang, Zhongyi; Burke, Marshall; Lobell, David; Ermon, Stefano

doi:10.24963/ijcai.2019/502

Citation Details

Learning to Interpret Satellite Images using Wikipedia

Despite recent progress in computer vision, fine-grained interpretation of satellite images remains challenging because of a lack of labeled training data. To overcome this limitation, we construct a novel dataset called WikiSatNet by pairing geo-referenced Wikipedia articles with satellite imagery of their corresponding locations. We then propose two strategies to learn representations of satellite images by predicting properties of the corresponding articles from the images. Leveraging this new multi-modal dataset, we can drastically reduce the quantity of human-annotated labels and time required for downstream tasks. On the recently released fMoW dataset, our pre-training strategies can boost the performance of a model pre-trained on ImageNet by up to 4.5% in F1 score. more »

Award ID(s):: 1651565

PAR ID:: 10136095

Author(s) / Creator(s):: Uzkent, Burak; Sheehan, Evan; Meng, Chenlin; Tang, Zhongyi; Burke, Marshall; Lobell, David; Ermon, Stefano

Date Published:: 2019-08-01

Journal Name:: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence

Page Range / eLocation ID:: 3620 to 3626

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.24963/ijcai.2019/502

More Like this