Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention

Honarpisheh, Arya; Bozdag, Mustafa; Camps, Octavia; Sznaier, Mario

Citation Details

This content will become publicly available on December 5, 2026

Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention

State-space models (SSMs) have recently emerged as a compelling alternative to Transformers for sequence modeling tasks. This paper presents a theoretical generalization analysis of selective SSMs, the core architectural component behind the Mamba model. We derive a novel covering number-based generalization bound for selective SSMs, building upon recent theoretical advances in the analysis of Transformer models. Using this result, we analyze how the spectral abscissa of the continuous-time state matrix influences the model’s stability during training and its ability to generalize across sequence lengths. We empirically validate our findings on a synthetic majority task, the IMDb sentiment classification benchmark, and the ListOps task, demonstrating how our theoretical insights translate into practical model behavior. more »

Award ID(s):: 2038493 2208182

PAR ID:: 10659914

Author(s) / Creator(s):: Honarpisheh, Arya; Bozdag, Mustafa; Camps, Octavia; Sznaier, Mario

Publisher / Repository:: ArXiv; Openreview

Date Published:: 2025-12-05

Format(s):: Medium: X

Location:: San Diego, CA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 5, 2026
Conference Paper:
The DOI is not currently available.

More Like this