A fast algorithm to factorize high-dimensional Tensor Product matrices used in Genetic Models

Lopez-Cruz, Marco; Pérez-Rodríguez, Paulino; de los Campos, Gustavo

doi:10.1093/g3journal/jkae001

Citation Details

A fast algorithm to factorize high-dimensional Tensor Product matrices used in Genetic Models

Abstract Many genetic models (including models for epistatic effects as well as genetic-by-environment) involve covariance structures that are Hadamard products of lower rank matrices. Implementing these models require factorizing large Hadamard product matrices. The available algorithms for factorization do not scale well for big data, making the use of some of these models not feasible with large sample sizes. Here, based on properties of Hadamard products and (related) Kronecker products we propose an algorithm that produces an approximate decomposition that is orders of magnitude faster than the standard eigenvalue decomposition. In this article, we describe the algorithm, show how it can be used to factorize large Hadamard product matrices, present benchmarks, and illustrate the use of the method by presenting an analysis of data from the northern testing locations of the G×E project from the Genomes-to-Fields Initiative (n∼60,000). We implemented the proposed algorithm in the open-source ‘tensorEVD’ R-package. more »

Award ID(s):: 2035472

PAR ID:: 10489076

Author(s) / Creator(s):: Lopez-Cruz, Marco; Pérez-Rodríguez, Paulino; de los Campos, Gustavo

Editor(s):: Lipka, Alexander

Publisher / Repository:: OXFORD

Date Published:: 2024-01-05

Journal Name:: G3: Genes, Genomes, Genetics

ISSN:: 2160-1836

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/g3journal/jkae001

More Like this