An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data

Ulutan, Oytun; Riggan, Benjamin S.; Nasrabadi, Nasser M.; Manjunath, B. S.

doi:10.1109/WACV.2018.00132

Citation Details

An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data

We propose a new order preserving bilinear framework that exploits low-resolution video for person detection in a multi-modal setting using deep neural networks. In this setting cameras are strategically placed such that less robust sensors, e.g. geophones that monitor seismic activity, are located within the field of views (FOVs) of cameras. The primary challenge is being able to leverage sufficient information from videos where there are less than 40 pixels on targets, while also taking advantage of less discriminative information from other modalities, e.g. seismic. Unlike state-of-the-art methods, our bilinear framework retains spatio-temporal order when computing the vector outer products between pairs of features. Despite the high dimensionality of these outer products, we demonstrate that our order preserving bilinear framework yields better performance than recent orderless bilinear models and alternative fusion methods. Code is available at https://github.com/oulutan/OP-Bilinear-Model. more »

Award ID(s):: 1650474

PAR ID:: 10091237

Author(s) / Creator(s):: Ulutan, Oytun; Riggan, Benjamin S.; Nasrabadi, Nasser M.; Manjunath, B. S.

Date Published:: 2018-03-01

Journal Name:: IEEE Winter Conference on Applications of Computer Vision (WACV)

Page Range / eLocation ID:: 1160 to 1169

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/WACV.2018.00132

More Like this