<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcq="http://purl.org/dc/terms/"><records count="1" morepages="false" start="1" end="1"><record rownumber="1"><dc:product_type>Journal Article</dc:product_type><dc:title>SCALES: SCALable and Area-Efficient Systolic Accelerator for Ternary Polynomial Multiplication</dc:title><dc:creator>Coulon, Samuel; Bao, Tianyou; Xie, Jiafeng</dc:creator><dc:corporate_author/><dc:editor/><dc:description>Polynomial multiplication is a key component in many post-quantum cryptography and homomorphic encryption schemes. One recurring variation, ternary polynomial multiplication over ring Zq/(xn+1) where one input polynomial has ternary coefficients {−1,0,1} and the other has large integer coefficients {0, q−1}, has recently drawn significant attention from various communities. Following this trend, this paper presents a novel SCALable and area-Efficient Systolic (SCALES) accelerator for ternary polynomial multiplication. In total, we have carried out three layers of coherent interdependent efforts. First, we have rigorously derived a novel block-processing strategy and algorithm based on the schoolbook method for polynomial multiplication. Then, we have innovatively implemented the proposed algorithm as the SCALES accelerator with the help of a number of field-programmable gate array (FPGA)-oriented optimization techniques. Lastly, we have conducted a thorough implementation analysis to showcase the efficiency of the proposed accelerator. The comparison demonstrated that the SCALES accelerator has at least 19.0% and 23.8% less equivalent area-time product (eATP) than the state-of-the-art designs. We hope this work can stimulate continued research in the field.</dc:description><dc:publisher>IEEE</dc:publisher><dc:date>2024-07-01</dc:date><dc:nsf_par_id>10568341</dc:nsf_par_id><dc:journal_name>IEEE Computer Architecture Letters</dc:journal_name><dc:journal_volume>23</dc:journal_volume><dc:journal_issue>2</dc:journal_issue><dc:page_range_or_elocation>243 to 246</dc:page_range_or_elocation><dc:issn>1556-6056</dc:issn><dc:isbn/><dc:doi>https://doi.org/10.1109/LCA.2024.3505872</dc:doi><dcq:identifierAwardId>2020625</dcq:identifierAwardId><dc:subject>Area-efficient</dc:subject><dc:subject>block-processing</dc:subject><dc:subject>FPGA</dc:subject><dc:subject>scalable</dc:subject><dc:subject>systolic hardware accelerator</dc:subject><dc:subject>ternary polynomial multiplication.</dc:subject><dc:version_number/><dc:location/><dc:rights/><dc:institution/><dc:sponsoring_org>National Science Foundation</dc:sponsoring_org></record></records></rdf:RDF>