NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

HD2FPGA: Automated Framework for Accelerating Hyperdimensional Computing on FPGAs

https://doi.org/10.1109/ISQED57927.2023.10129332

Zhang, Tinaqi; Salamat, Sahand; Khaleghi, Behnam; Morris, Justin; Aksanli, Baris; Rosing, Tajana Simunic (April 2023, 2023 24th International Symposium on Quality Electronic Design (ISQED))

Full Text Available
NASCENT2: Generic Near-Storage Sort Accelerator for Data Analytics on SmartSSD

https://doi.org/10.1145/3472769

Salamat, Sahand; Zhang, Hui; Ki, Yang Seok; Rosing, Tajana (June 2022, ACM Transactions on Reconfigurable Technology and Systems)

As the size of data generated every day grows dramatically, the computational bottleneck of computer systems has shifted toward storage devices. The interface between the storage and the computational platforms has become the main limitation due to its limited bandwidth, which does not scale when the number of storage devices increases. Interconnect networks do not provide simultaneous access to all storage devices and thus limit the performance of the system when executing independent operations on different storage devices. Offloading the computations to the storage devices eliminates the burden of data transfer from the interconnects. Near-storage computing offloads a portion of computations to the storage devices to accelerate big data applications. In this article, we propose a generic near-storage sort accelerator for data analytics, NASCENT2, which utilizes Samsung SmartSSD, an NVMe flash drive with an on-board FPGA chip that processes data in situ. NASCENT2 consists of dictionary decoder, sort, and shuffle FPGA-based accelerators to support sorting database tables based on a key column with any arbitrary data type. It exploits data partitioning applied by data processing management systems, such as SparkSQL, to breakdown the sort operations on colossal tables to multiple sort operations on smaller tables. NASCENT2 generic sort provides 2 × speedup and 15.2 × energy efficiency improvement as compared to the CPU baseline. It moreover considers the specifications of the SmartSSD (e.g., the FPGA resources, interconnect network, and solid-state drive bandwidth) to increase the scalability of computer systems as the number of storage devices increases. With 12 SmartSSDs, NASCENT2 is 9.9× (137.2 ×) faster and 7.3 × (119.2 ×) more energy efficient in sorting the largest tables of TPCC and TPCH benchmarks than the FPGA (CPU) baseline.
more » « less
Full Text Available
Store-n-Learn: Classification and Clustering with Hyperdimensional Computing across Flash Hierarchy

https://doi.org/10.1145/3503541

Gupta, Saransh; Khaleghi, Behnam; Salamat, Sahand; Morris, Justin; Ramkumar, Ranganathan; Yu, Jeffrey; Tiwari, Aniket; Kang, Jaeyoung; Imani, Mohsen; Aksanli, Baris; et al (May 2022, ACM Transactions on Embedded Computing Systems)

Processing large amounts of data, especially in learning algorithms, poses a challenge for current embedded computing systems. Hyperdimensional (HD) computing (HDC) is a brain-inspired computing paradigm that works with high-dimensional vectors called hypervectors . HDC replaces several complex learning computations with bitwise and simpler arithmetic operations at the expense of an increased amount of data due to mapping the data into high-dimensional space. These hypervectors, more often than not, cannot be stored in memory, resulting in long data transfers from storage. In this article, we propose Store-n-Learn, an in-storage computing solution that performs HDC classification and clustering by implementing encoding, training, retraining, and inference across the flash hierarchy. To hide the latency of training and enable efficient computation, we introduce the concept of batching in HDC. We also present on-chip acceleration for HDC encoding in flash planes. This enables us to exploit the high parallelism provided by the flash hierarchy and encode multiple data points in parallel in both batched and non-batched fashion. Store-n-Learn also implements a single top-level FPGA accelerator with novel implementations for HDC classification training, retraining, inference, and clustering on the encoded data. Our evaluation over 10 popular datasets shows that Store-n-Learn is on average 222× (543×) faster than CPU and 10.6× (7.3×) faster than the state-of-the-art in-storage computing solution, INSIDER for HDC classification (clustering).
more » « less
Full Text Available
FPGA Acceleration of Pairwise Distance Calculation for Viral Transmission Clustering

Salamat, Sahand; Moshiri, Niema; Rosing, Tajana (January 2021, IEEE Biomedical Circuits and Systems Conference)

Full Text Available
Accelerating Hyperdimensional Computing on FPGAs by Exploiting Computational Reuse

https://doi.org/10.1109/TC.2020.2992662

Salamat, Sahand; Imani, Mohsen; Rosing, Tajana (August 2020, IEEE Transactions on Computers)
null (Ed.)
Full Text Available
FPGA Acceleration of Protein Back-Translation and Alignment

https://doi.org/10.23919/DATE51398.2021.9474103

Salamat, Sahand; Kang, Jaeyoung; Kim, Yeseong; Imani, Mohsen; Moshiri, Niema; Rosing, Tajana (February 2021, 2021 Design, Automation & Test in Europe Conference & Exhibition (DATE))

Full Text Available
NASCENT: Near-Storage Acceleration of Database Sort on SmartSSD

https://doi.org/10.1145/3431920.3439298

Salamat, Sahand; Haj Aboutalebi, Armin; Khaleghi, Behnam; Lee, Joo Hwan; Ki, Yang Seok; Rosing, Tajana (February 2021, 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays)

Full Text Available
Revisiting HyperDimensional Learning for FPGA and Low-Power Architectures

https://doi.org/10.1109/HPCA51647.2021.00028

Imani, Mohsen; Zou, Zhuowen; Bosch, Samuel; Rao, Sanjay Anantha; Salamat, Sahand; Kumar, Venkatesh; Kim, Yeseong; Rosing, Tajana (February 2021, 27th IEEE International Symposium on High-Performance Computer Architecture)

Full Text Available
F5-HD: Fast Flexible FPGA-based Framework for Refreshing Hyperdimensional Computing

https://doi.org/10.1145/3289602.3293913

Salamat, Sahand; Imani, Mohsen; Khaleghi, Behnam; Rosing, Tajana (January 2019, ACM/SIGDA International Symposium on Field-Programmable Gate Arrays)

Full Text Available
RNSnet: In-Memory Neural Network Acceleration Using Residue Number System

https://doi.org/10.1109/ICRC.2018.8638592

Salamat, Sahand; Imani, Mohsen; Gupta, Sarangh; Rosing, Tajana (November 2018, IEEE International Conference on Rebooting Computing)

Full Text Available

« Prev Next »

Search for: All records