NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improved allele-specific single-cell copy number estimation in low-coverage DNA-sequencing

https://doi.org/10.1093/bioinformatics/btae506

Weiner, Samson; Li, Bingjun; Nabavi, Sheida; Mathelier, ed., Anthony (August 2024, Bioinformatics)

Abstract MotivationAdvances in whole-genome single-cell DNA sequencing (scDNA-seq) have led to the development of numerous methods for detecting copy number aberrations (CNAs), a key driver of genetic heterogeneity in cancer. While most of these methods are limited to the inference of total copy number, some recent approaches now infer allele-specific CNAs using innovative techniques for estimating allele-frequencies in low coverage scDNA-seq data. However, these existing allele-specific methods are limited in their segmentation strategies, a crucial step in the CNA detection pipeline. ResultsWe present SEACON (Single-cell Estimation of Allele-specific COpy Numbers), an allele-specific copy number profiler for scDNA-seq data. SEACON uses a Gaussian Mixture Model to identify latent copy number states and breakpoints between contiguous segments across cells, filters the segments for high-quality breakpoints using an ensemble technique, and adopts several strategies for tolerating noisy read-depth and allele frequency measurements. Using a wide array of both real and simulated datasets, we show that SEACON derives accurate copy numbers and surpasses existing approaches under numerous experimental conditions, and identify its strengths and weaknesses. Availability and implementationSEACON is implemented in Python and is freely available open-source from https://github.com/NabaviLab/SEACON and https://doi.org/10.5281/zenodo.12727008.
more » « less
A multimodal graph neural network framework for cancer molecular subtype classification

https://doi.org/10.1186/s12859-023-05622-4

Li, Bingjun; Nabavi, Sheida (January 2024, BMC Bioinformatics)

Abstract BackgroundThe recent development of high-throughput sequencing has created a large collection of multi-omics data, which enables researchers to better investigate cancer molecular profiles and cancer taxonomy based on molecular subtypes. Integrating multi-omics data has been proven to be effective for building more precise classification models. Most current multi-omics integrative models use either an early fusion in the form of concatenation or late fusion with a separate feature extractor for each omic, which are mainly based on deep neural networks. Due to the nature of biological systems, graphs are a better structural representation of bio-medical data. Although few graph neural network (GNN) based multi-omics integrative methods have been proposed, they suffer from three common disadvantages. One is most of them use only one type of connection, either inter-omics or intra-omic connection; second, they only consider one kind of GNN layer, either graph convolution network (GCN) or graph attention network (GAT); and third, most of these methods have not been tested on a more complex classification task, such as cancer molecular subtypes. ResultsIn this study, we propose a novel end-to-end multi-omics GNN framework for accurate and robust cancer subtype classification. The proposed model utilizes multi-omics data in the form of heterogeneous multi-layer graphs, which combine both inter-omics and intra-omic connections from established biological knowledge. The proposed model incorporates learned graph features and global genome features for accurate classification. We tested the proposed model on the Cancer Genome Atlas (TCGA) Pan-cancer dataset and TCGA breast invasive carcinoma (BRCA) dataset for molecular subtype and cancer subtype classification, respectively. The proposed model shows superior performance compared to four current state-of-the-art baseline models in terms of accuracy, F1 score, precision, and recall. The comparative analysis of GAT-based models and GCN-based models reveals that GAT-based models are preferred for smaller graphs with less information and GCN-based models are preferred for larger graphs with extra information.
more » « less
Single-cell classification using graph convolutional networks

https://doi.org/10.1186/s12859-021-04278-2

Wang, Tianyu; Bai, Jun; Nabavi, Sheida (July 2021, BMC Bioinformatics)

Abstract BackgroundAnalyzing single-cell RNA sequencing (scRNAseq) data plays an important role in understanding the intrinsic and extrinsic cellular processes in biological and biomedical research. One significant effort in this area is the identification of cell types. With the availability of a huge amount of single cell sequencing data and discovering more and more cell types, classifying cells into known cell types has become a priority nowadays. Several methods have been introduced to classify cells utilizing gene expression data. However, incorporating biological gene interaction networks has been proved valuable in cell classification procedures. ResultsIn this study, we propose a multimodal end-to-end deep learning model, named sigGCN, for cell classification that combines a graph convolutional network (GCN) and a neural network to exploit gene interaction networks. We used standard classification metrics to evaluate the performance of the proposed method on the within-dataset classification and the cross-dataset classification. We compared the performance of the proposed method with those of the existing cell classification tools and traditional machine learning classification methods. ConclusionsResults indicate that the proposed method outperforms other commonly used methods in terms of classification accuracy and F1 scores. This study shows that the integration of prior knowledge about gene interactions with gene expressions using GCN methodologies can extract effective features improving the performance of cell classification.
more » « less
Multi-modal Spatial Clustering for Spatial Transcriptomics Utilizing High-resolution Histology Images

https://doi.org/10.1109/BIBM62325.2024.10822051

Li, Bingjun; Karami, Mostafa; Junayed, Masum Shah; Nabavi, Sheida (December 2024, IEEE)

Free, publicly-accessible full text available December 3, 2025
DCCNV: Enhanced CNV Detection in Single-Cell Sequencing Using Diffusion Process and Contrastive Learning

https://doi.org/10.1145/3698587.3701395

Karami, Mostafa; Li, Bingjun; Weiner, Samson; Hamzehei, Sahand; Nabavi, Sheida (November 2024, ACM)

Free, publicly-accessible full text available November 22, 2025
scGEMOC, A Graph Embedded Contrastive Learning Single-cell Multiomics Clustering Model

https://doi.org/10.1109/BIBM58861.2023.10385267

Li, Bingjun; Nabavi, Sheida (December 2023, IEEE)

Full Text Available
Contrastive Learning in Single-cell Multiomics Clustering

https://doi.org/10.1145/3584371.3613010

Li, Bingjun; Nabavi, Sheida (September 2023, ACM)

Full Text Available
Semi-supervised classification of disease prognosis using CR images with clinical data structured graph

https://doi.org/10.1145/3535508.3545548

Bai, Jun; Li, Bingjun; Nabavi, Sheida (August 2022, Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics)

Full Text Available
Single-cell RNA sequencing data clustering using graph convolutional networks

https://doi.org/10.1109/BIBM52615.2021.9669529

Wang, Tianyu; Li, Bingjun; Nabavi, Sheida (December 2021, Proceeding of 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))

Full Text Available
Copy number variation detection using single cell sequencing data

https://doi.org/10.1145/3459930.3469556

Zare, Fatima; Stark, Jacob; Nabavi, Sheida (August 2021, Proceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics)

Full Text Available

« Prev Next »

Search for: All records