<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcq="http://purl.org/dc/terms/"><records count="1" morepages="false" start="1" end="1"><record rownumber="1"><dc:product_type>Journal Article</dc:product_type><dc:title>TAPA: A Scalable Task-parallel Dataflow Programming Framework for Modern FPGAs with Co-optimization of HLS and Physical Design</dc:title><dc:creator>Guo, Licheng; Chi, Yuze; Lau, Jason; Song, Linghao; Tian, Xingyu; Khatti, Moazin; Qiao, Weikang; Wang, Jie; Ustun, Ecenur; Fang, Zhenman; Zhang, Zhiru; Cong, Jason</dc:creator><dc:corporate_author/><dc:editor/><dc:description>&lt;p&gt;In this article, we propose TAPA, an end-to-end framework that compiles a C++ task-parallel dataflow program into a high-frequency FPGA accelerator. Compared to existing solutions, TAPA has two major advantages. First, TAPA provides a set of convenient APIs that allows users to easily express flexible and complex inter-task communication structures. Second, TAPA adopts a coarse-grained floorplanning step during HLS compilation for accurate pipelining of potential critical paths. In addition, TAPA implements several optimization techniques specifically tailored for modern HBM-based FPGAs. In our experiments with a total of 43 designs, we improve the average frequency from 147 MHz to 297 MHz (a 102% improvement) with no loss of throughput and a negligible change in resource utilization. Notably, in 16 experiments, we make the originally unroutable designs achieve 274 MHz, on average. The framework is available at&lt;ext-link ext-link-type='url' href='https://github.com/UCLA-VAST/tapa'&gt;https://github.com/UCLA-VAST/tapa&lt;/ext-link&gt;and the core floorplan module is available at&lt;ext-link ext-link-type='url' href='https://github.com/UCLA-VAST/AutoBridge'&gt;https://github.com/UCLA-VAST/AutoBridge&lt;/ext-link&gt;&lt;/p&gt;</dc:description><dc:publisher>ACM Transactions on Reconfigurable Technology and Systems</dc:publisher><dc:date>2023-12-31</dc:date><dc:nsf_par_id>10550474</dc:nsf_par_id><dc:journal_name>ACM Transactions on Reconfigurable Technology and Systems</dc:journal_name><dc:journal_volume>16</dc:journal_volume><dc:journal_issue>4</dc:journal_issue><dc:page_range_or_elocation>1 to 31</dc:page_range_or_elocation><dc:issn>1936-7406</dc:issn><dc:isbn/><dc:doi>https://doi.org/10.1145/3609335</dc:doi><dcq:identifierAwardId>1937599</dcq:identifierAwardId><dc:subject/><dc:version_number/><dc:location/><dc:rights/><dc:institution/><dc:sponsoring_org>National Science Foundation</dc:sponsoring_org></record></records></rdf:RDF>