Search for: All records

Creators/Authors contains: "Cabrera, Anthony M"

« Prev Next »

Total Resources

9

Resource Type
Conference Paper

9

Conference Proceeding

0

Dataset

0

Journal Article

0

Workshop Report

0

Availability
Full Text / Resource Available

9

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Challenges Designing for FPGAs Using High-Level Synthesis

https://doi.org/10.1109/HPEC55821.2022.9926398

Faber, Clayton J. ; Harris, Steven D. ; Xiac, Zhili ; Chamberlain, Roger D. ; Cabrera, Anthony M. ( September 2022 , Proc. of IEEE High-Performance Extreme Computing Conference)

Full Text Available
HLS Portability from Intel to Xilinx: A Case Study

https://doi.org/10.1109/HPEC49654.2021.9622785

Xiao, Zhili ; Chamberlain, Roger D. ; Cabrera, Anthony M. ( September 2021 , Proc. of IEEE High-Performance Extreme Computing Conference (HPEC))

Full Text Available
Platform Agnostic Streaming Data Application Performance Models

https://doi.org/10.1109/RSDHA54838.2021.00008

Faber, Clayton J. ; Plano, Tom ; Kodali, Samatha ; Xiao, Zhili ; Dwaraki, Abhishek ; Buhler, Jeremy D. ; Chamberlain, Roger D. ; Cabrera, Anthony M. ( November 2021 , Proc. of IEEE/ACM Workshop on Redefining Scalability for Diversely Heterogeneous Architectures (RSDHA))

Full Text Available
Design and Performance Evaluation of Optimizations for OpenCL FPGA Kernels

https://doi.org/10.1109/HPEC43674.2020.9286221

Cabrera, Anthony M. ; Chamberlain, Roger D. ( September 2020 , Proc. of IEEE High-Performance Extreme Computing Conference)
null (Ed.)
Full Text Available
Designing Domain Specific Computing Systems

https://doi.org/10.1109/FCCM48280.2020.00052

Cabrera, Anthony M. ; Chamberlain, Roger D. ( May 2020 , IEEE 28th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM))

Domain specific computing is an idea that has been pro-posed as a path forward given the slowing of Moore’s Law and the breakdown of Dennard scaling. Two fundamental questions include: (1) how does one define a domain; and (2) how does one go about architecting hardware that performs well for that domain? We present our preliminary work towards answering these questions.
more » « less
Full Text Available
Toward Evaluating High-Level Synthesis Portability and Performance between Intel and Xilinx FPGAs

https://doi.org/10.1145/3456669.3456699

Cabrera, Anthony M ; Young, Aaron R ; Lambert, Jacob ; Xiao, Zhili ; An, Amy ; Lee, Seyong ; Jin, Zheming ; Kim, Jungwon ; Buhler, Jeremy ; Chamberlain, Roger D ; et al ( April 2021 , Proc. of 9th International Workshop on OpenCL)
null (Ed.)
Full Text Available
Multi-spectral Reuse Distance: Divining Spatial Information from Temporal Data

https://doi.org/10.1109/HPEC.2019.8916398

Cabrera, Anthony M. ; Chamberlain, Roger D. ; Beard, Jonathan C. ( September 2019 , Proceedings of IEEE High-Performance Extreme Computing Conference (HPEC))

The problem of efficiently feeding processing elements and finding ways to reduce data movement is pervasive in computing. Efficient modeling of both temporal and spatial locality of memory references is invaluable in identifying superfluous data movement in a given application. To this end, we present a new way to infer both spatial and temporal locality using reuse distance analysis. This is accomplished by performing reuse distance analysis at different data block granularities: specifically, 64B, 4KiB, and 2MiB sizes. This process of simultaneously observing reuse distance with multiple granularities is called multi-spectral reuse distance. This approach allows for a qualitative analysis of spatial locality, through observing the shifting of mass in an application's reuse signature at different granularities. Furthermore, the shift of mass is empirically measured by calculating the Earth Mover's Distance between reuse signatures of an application. From the characterization, it is possible to determine how spatially dense the memory references of an application are based on the degree to which the mass has shifted (or not shifted) and how close (or far) the Earth Mover's Distance is to zero as the data block granularity is increased. It is also possible to determine an appropriate page size from this information, and whether or not a given page is being fully utilized. From the applications profiled, it is observed that not all applications will benefit from having a larger page size. Additionally, larger data block granularities subsuming smaller ones suggest that larger pages will allow for more spatial locality exploitation, but examining the memory footprint will show whether those larger pages are fully utilized or not.
more » « less
Full Text Available
Exploring Portability and Performance of OpenCL FPGA Kernels on Intel HARPv2

https://doi.org/10.1145/3318170.3318180

Cabrera, Anthony M. ; Chamberlain, Roger D. ( May 2019 , Proc. of 7th International Workshop on OpenCL)

FPGAs offer a heterogenous compute solution to the continuous de- sire for increased performance by enabling the creation of application- specific hardware that accelerates computation. While the barrier to entry has historically been steep, advances in High Level Synthe- sis (HLS) are making FPGAs more accessible. Specifically, the Intel FPGA OpenCL SDK allows software designers to abstract away low level details of architecting hardware on an FPGA and allows them to author computational kernels in a higher level language. Furthermore, Intel has developed a system that incorporates both a multicore Xeon CPU and Arria 10 FPGA into the same chip package as part of the Heterogeneous Accelerator Research Program (HARP) that can be targeted by their SDK. In this work, we target the second iteration of the HARP platform (HARPv2) using HLS through porting of OpenCL kernels originally written for FPGAs connected via a PCIe bus. We evaluate the HARPv2 system’s performance against previously reported results, explore the portability of kernels through a hardware design space search, and empirically show the benefits of using the shared virtual memory (SVM) abstraction over explicit reads and writes.
more » « less
Full Text Available
Data Integration Tasks on Heterogeneous Systems Using OpenCL

https://doi.org/10.1145/3318170.3318187

Faber, Clayton J. ; Cabrera, Anthony M. ; Booker, Orondé ; Maayan, Gabe ; Chamberlain, Roger D. ( May 2019 , Proc. of 7th International Workshop on OpenCL)

In the era of big data, many new algorithms are developed to try and find the most efficient way to perform computations with massive amounts of data. However, what is often overlooked is the preprocessing step for many of these applications. The Data Integration Benchmark Suite (DIBS) was designed to understand the characteristics of dataset transformations in a hardware agnostic way. While on the surface these applications have a high amount of data parallelism, there are caveats in their specification that can potentially affect this characteristic. Even still, OpenCL can be an effective deployment environment for these applications. In this work we take a subset of the data transformations from each category presented in DIBS and implement them in OpenCL to evaluate their performance for heterogeneous systems. For targeting heterogeneous systems, we take a common application and attempt to deploy it to three platforms targetable by OpenCL (CPU, GPU, and FPGA). The applications are evaluated by their average transformation data rate. We illustrate the advantages of each compute device in the data integration space along with different communications schemes allowed for host/device communication in the OpenCL platform.
more » « less
Full Text Available