SSL-SurvFormer: A Self-Supervised Learning and Continuously Monotonic Transformer Network for Missing Values in Survival Analysis

Le, Quang-Hung; Patel, Brijesh; Adjeroh, Donald; Doretto, Gianfranco (ORCID:0000000289216646); Le, Ngan

doi:10.3390/informatics12010032

Citation Details

This content will become publicly available on March 1, 2026

SSL-SurvFormer: A Self-Supervised Learning and Continuously Monotonic Transformer Network for Missing Values in Survival Analysis

Survival analysis is a crucial statistical technique used to estimate the anticipated duration until a specific event occurs. However, current methods often involve discretizing the time scale and struggle with managing absent features within the data. This becomes especially pertinent since events can transpire at any given point, rendering event analysis a continuous concern. Additionally, the presence of missing attributes within tabular data is widespread. By leveraging recent developments of Transformer and Self-Supervised Learning (SSL), we introduce SSL-SurvFormer. This entails a continuously monotonic Transformer network, empowered by SSL pre-training, that is designed to address the challenges presented by continuous events and absent features in survival prediction. Our proposed continuously monotonic Transformer model facilitates accurate estimation of survival probabilities, thereby bypassing the need for temporal discretization. Additionally, our SSL pre-training strategy incorporates data transformation to adeptly manage missing information. The SSL pre-training encompasses two tasks: mask prediction, which identifies positions of absent features, and reconstruction, which endeavors to recover absent elements based on observed ones. Our empirical evaluations conducted across a variety of datasets, including FLCHAIN, METABRIC, and SUPPORT, consistently highlight the superior performance of SSL-SurvFormer in comparison to existing methods. Additionally, SSL-SurvFormer demonstrates effectiveness in handling missing values, a critical aspect often encountered in real-world datasets. more »

Award ID(s):: 2223793

PAR ID:: 10639061

Author(s) / Creator(s):: Le, Quang-Hung; Patel, Brijesh; Adjeroh, Donald; Doretto, Gianfranco; Le, Ngan

Publisher / Repository:: MDPI

Date Published:: 2025-03-01

Journal Name:: Informatics

Volume:: 12

Issue:: 1

ISSN:: 2227-9709

Page Range / eLocation ID:: 32

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 1, 2026
Journal Article:
https://doi.org/10.3390/informatics12010032

More Like this