Search for: All records

Creators/Authors contains: "Park, Haesun"

« Prev Next »

Total Resources

8

Resource Type
Conference Paper

5

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

7

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Distributed-Memory Parallel JointNMF

https://doi.org/10.1145/3577193.3593733

Eswar, Srinivas ; Cobb, Benjamin ; Hayashi, Koby ; Kannan, Ramakrishnan ; Ballard, Grey ; Vuduc, Richard ; Park, Haesun ( June 2023 , Proceedings of the 37th International Conference on Supercomputing)

Joint Nonnegative Matrix Factorization (JointNMF) is a hybrid method for mining information from datasets that contain both feature and connection information. We propose distributed-memory parallelizations of three algorithms for solving the JointNMF problem based on Alternating Nonnegative Least Squares, Projected Gradient Descent, and Projected Gauss-Newton. We extend well-known communication-avoiding algorithms using a single processor grid case to our coupled case on two processor grids. We demonstrate the scalability of the algorithms on up to 960 cores (40 nodes) with 60\% parallel efficiency. The more sophisticated Alternating Nonnegative Least Squares (ANLS) and Gauss-Newton variants outperform the first-order gradient descent method in reducing the objective on large-scale problems. We perform a topic modelling task on a large corpus of academic papers that consists of over 37 million paper abstracts and nearly a billion citation relationships, demonstrating the utility and scalability of the methods.
more » « less
Free, publicly-accessible full text available June 21, 2024
Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means

https://doi.org/10.3389/fgene.2021.763263

Oh, Sooyoun ; Park, Haesun ; Zhang, Xiuwei ( November 2021 , Frontiers in Genetics)

Advances in single cell transcriptomics have allowed us to study the identity of single cells. This has led to the discovery of new cell types and high resolution tissue maps of them. Technologies that measure multiple modalities of such data add more detail, but they also complicate data integration. We offer an integrated analysis of the spatial location and gene expression profiles of cells to determine their identity. We propose scHybridNMF (single-cell Hybrid Nonnegative Matrix Factorization), which performs cell type identification by combining sparse nonnegative matrix factorization (sparse NMF) with k-means clustering to cluster high-dimensional gene expression and low-dimensional location data. We show that, under multiple scenarios, including the cases where there is a small number of genes profiled and the location data is noisy, scHybridNMF outperforms sparse NMF, k-means, and an existing method that uses a hidden Markov random field to encode cell location and gene expression data for cell type identification.
more » « less
Full Text Available
PLANC: Parallel Low-rank Approximation with Nonnegativity Constraints

https://doi.org/10.1145/3432185

Eswar, Srinivas ; Hayashi, Koby ; Ballard, Grey ; Kannan, Ramakrishnan ; Matheson, Michael A. ; Park, Haesun ( June 2021 , ACM Transactions on Mathematical Software)
null (Ed.)
We consider the problem of low-rank approximation of massive dense nonnegative tensor data, for example, to discover latent patterns in video and imaging applications. As the size of data sets grows, single workstations are hitting bottlenecks in both computation time and available memory. We propose a distributed-memory parallel computing solution to handle massive data sets, loading the input data across the memories of multiple nodes, and performing efficient and scalable parallel algorithms to compute the low-rank approximation. We present a software package called Parallel Low-rank Approximation with Nonnegativity Constraints, which implements our solution and allows for extension in terms of data (dense or sparse, matrices or tensors of any order), algorithm (e.g., from multiplicative updating techniques to alternating direction method of multipliers), and architecture (we exploit GPUs to accelerate the computation in this work). We describe our parallel distributions and algorithms, which are careful to avoid unnecessary communication and computation, show how to extend the software to include new algorithms and/or constraints, and report efficiency and scalability results for both synthetic and real-world data sets.
more » « less
Full Text Available
SWIFT: Scalable Wasserstein Factorization for Sparse Nonnegative Tensors

Afshar, Ardavan ; Yin, Kejing ; Yan, Sherry ; Qian, Cheng ; Ho, Joyce ; Park, Haesun ; Sun, Jimeng ( May 2021 , Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Full Text Available
Parallel Hierarchical Clustering using Rank-Two Nonnegative Matrix Factorization

https://doi.org/10.1109/HiPC50609.2020.00028

Manning, Lawton ; Ballard, Grey ; Kannan, Ramakrishnan ; Park, Haesun ( December 2020 , 2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC))
null (Ed.)
Full Text Available
Distributed-Memory Parallel Symmetric Nonnegative Matrix Factorization

https://doi.org/10.1109/SC41405.2020.00078

Eswar, Srinivas ; Hayashi, Koby ; Ballard, Grey ; Kannan, Ramakrishnan ; Vuduc, Richard ; Park, Haesun ( November 2020 , SC20: International Conference for High Performance Computing, Networking, Storage and Analysis)
null (Ed.)
Full Text Available
TASTE: temporal and static tensor factorization for phenotyping electronic health records

https://doi.org/10.1145/3368555.3384464

Afshar, Ardavan ; Perros, Ioakeim ; Park, Haesun ; deFilippi, Christopher ; Yan, Xiaowei ; Stewart, Walter ; Ho, Joyce ; Sun, Jimeng ( April 2020 , Proceedings of the ACM Conference on Health, Inference, and Learning)

Full Text Available
MPI-FAUN: An MPI-Based Framework for Alternating-Updating Nonnegative Matrix Factorization

https://doi.org/10.1109/TKDE.2017.2767592

Kannan, Ramakrishnan ; Ballard, Grey ; Park, Haesun ( March 2018 , IEEE Transactions on Knowledge and Data Engineering)

Full Text Available