NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Do We Really Need Complicated Graph Learning Models? -- A Simple but Effective Baseline

Sancak, Kaan; Balın, MFatih; Çatalyürek, Ümit V (November 2024, The Third Learning on Graphs Conference)

Full Text Available
Do We Really Need Complicated Graph Learning Models? -- A Simple but Effective Baseline

Sancak, K; Balin, MF; Çatalyürek, ÜV (October 2024, The Third Learning on Graphs Conference)

Full Text Available
More Recent Advances in (Hyper)Graph Partitioning

https://doi.org/10.1145/3571808

Çatalyürek, Ümit; Devine, Karen; Faraj, Marcelo; Gottesbüren, Lars; Heuer, Tobias; Meyerhenke, Henning; Sanders, Peter; Schlag, Sebastian; Schulz, Christian; Seemaier, Daniel; et al (December 2023, ACM Computing Surveys)

In recent years, significant advances have been made in the design and evaluation of balanced (hyper)graph partitioning algorithms. We survey trends of the past decade in practical algorithms for balanced (hyper)graph partitioning together with future research directions. Our work serves as an update to a previous survey on the topic [29]. In particular, the survey extends the previous survey by also covering hypergraph partitioning and has an additional focus on parallel algorithms.
more » « less
Full Text Available
A Portable Sparse Solver Framework for Large Matrices on Heterogeneous Architectures

https://doi.org/10.1109/HiPC56025.2022.00030

Rabbi, Fazlay; Daley, Christopher S.; Çatalyürek, Ümit V.; Aktulga, Hasan Metin (December 2022, International Conference on High Performance Computing, Data, & Analytics)

Full Text Available
IMpart: A Partitioning-based Parallel Approach to Accelerate Influence Maximization

https://doi.org/10.1109/HiPC56025.2022.00028

Barik, Reet; Minutoli, Marco; Halappanavar, Mahantesh; Kalyanaraman, Ananth (December 2022, Proceedings of the International Conference on High Performance Computing, Data, and Analytics (HiPC))

Full Text Available
BOA: A partitioned view of genome assembly

https://doi.org/10.1016/j.isci.2022.105273

An, Xiaojing; Ghosh, Priyanka; Keppler, Patrick; Kurt, Sureyya Emre; Krishnamoorthy, Sriram; Sadayappan, Ponnuswamy; Rajam, Aravind Sukumaran; Çatalyürek, Ümit V.; Kalyanaraman, Ananth (November 2022, iScience)

Full Text Available
Accelerating Graph Computations on 3D NoC-enabled PIM Architectures

https://doi.org/10.1145/3564290

Choudhury, Dwaipayan; Xiang, Lizhi; Rajam, Aravind Sukumaran; Kalyanaraman, Ananth; Pande, Partha Pratim (October 2022, ACM Transactions on Design Automation of Electronic Systems)

Graph application workloads are dominated by random memory accesses with poor locality. To tackle the irregular and sparse nature of computation, ReRAM-based Processing-in-Memory (PIM) architectures have been proposed recently. Most of these ReRAM architecture designs have focused on mapping graph computations into a set of multiply-and-accumulate (MAC) operations. ReRAMs also offer a key advantage in reducing memory latency between cores and memory by allowing for processing-in-memory (PIM). However, when implemented on a ReRAM-based manycore architecture, graph applications still pose two key challenges – significant storage requirements (particularly due to wasted zero cell storage), and significant amount of on-chip traffic. To tackle these two challenges, in this paper we propose the design of a 3D NoC-enabled ReRAM-based manycore architecture. Our proposed architecture incorporates a novel crossbar-aware node reordering to reduce ReRAM storage requirements. Secondly, its 3D NoC-enabled design reduces on-chip communication latency. Our architecture outperforms the state-of-the-art in ReRAM-based graph acceleration by up to 5x in performance while consuming up to 10.3x less energy for a range of graph inputs and workloads.
more » « less
Full Text Available
Efficient Hierarchical State Vector Simulation of Quantum Circuits via Acyclic Graph Partitioning

https://doi.org/10.1109/CLUSTER51413.2022.00041

Fang, Bo; Ozkaya, M. Yusuf; Li, Ang; Catalyurek, Umit V.; Krishnamoorthy, Sriram (September 2022, International Conference on Cluster Computing)

Full Text Available
MG-GCN: A Scalable multi-GPU GCN Training Framework

https://doi.org/10.1145/3545008.3545082

Balin, Muhammed Fatih; Sancak, Kaan; Catalyurek, Umit V. (August 2022, Proceedings of the 51st International Conference on Parallel Processing (ICPP))

Full Text Available
Towards scaling community detection on distributed-memory heterogeneous systems

https://doi.org/10.1016/j.parco.2022.102898

Gawande, Nitin; Ghosh, Sayan; Halappanavar, Mahantesh; Tumeo, Antonino; Kalyanaraman, Ananth (July 2022, Parallel Computing)

Full Text Available

« Prev Next »

Search for: All records