Linear Relational Decoding of Morphology in Language Models

Xia, Eric; Kalita, Jugal

Citation Details

This content will become publicly available on March 1, 2026

Linear Relational Decoding of Morphology in Language Models

A two-part affine approximation has been found to be a good approximation for trans- former computations over certain subject- object relations. Adapting the Bigger Analogy Test Set, we show that the linear transforma- tion W s, where s is a middle layer representa- tion of a subject token and W is derived from model derivatives, is also able to accurately re- produce final object states for many relations. This linear technique is able to achieve 90% faithfulness on morphological relations, and we show similar findings multi-lingually and across models. Our findings indicate that some conceptual relationships in language models, such as morphology, are readily interpretable from latent space, and are sparsely encoded by cross-layer linear transformations. more »

Award ID(s):: 2349452

PAR ID:: 10655817

Author(s) / Creator(s):: Xia, Eric; Kalita, Jugal

Publisher / Repository:: NAACL, aclanthology.org

Date Published:: 2025-03-01

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 1, 2026
Conference Paper:
The DOI is not currently available.

More Like this