NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Polynomial-Time Program Equivalence for Machine Knitting

https://doi.org/10.1145/3747517

Hurtig, Nathan; Lin, Jenny Han; Price, Thomas S; Schulz, Adriana; McCann, James; Bernstein, Gilbert Louis (August 2025, Proceedings of the ACM on Programming Languages)

We present an algorithm that canonicalizes the algebraic representations of the topological semantics of machine knitting programs. Machine knitting is a staple technology of modern textile production where hundreds of mechanical needles are manipulated to form yarn into interlocking loop structures. Our semantics are defined using a variant of a monoidal category, and they closely correspond to string diagrams. We formulate our canonicalization as an Abstract Rewriting System (ARS) over words in our category, and prove that our algorithm is correct and runs in polynomial time.
more » « less
Free, publicly-accessible full text available August 5, 2026
Exo 2: Growing a Scheduling Language

https://doi.org/10.1145/3669940.3707218

Ikarashi, Yuka; Qian, Kevin; Droubi, Samir; Reinking, Alex; Bernstein, Gilbert Louis; Ragan-Kelley, Jonathan (March 2025, ACM)

Free, publicly-accessible full text available March 30, 2026
High-level Programming for Application Networks

Zhu, Xiangfeng; Wang, Yuyao; Liu, Banruo; Wu, Yongtong; Bojanic, Nikola; Chen, Jingrong; Bernstein, Gilbert; Krishnamurthy, Arvind; Kumar, Sam; Mahajan, Ratul; et al (April 2025, USENIX)

Application networks facilitate communication between the microservices of cloud applications. They are built today using service meshes with low-level specifications that make it difficult to express application-specific functionality (e.g., access control based on RPC fields), and they can more than double the RPC latency. We develop AppNet, a framework that makes it easy to build expressive and high-performance application networks. Developers specify rich RPC processing in a high-level language with generalized match-action rules and built-in state management. We compile the specifications to high-performance code after optimizing where (e.g., client, server) and how (e.g., RPC library, proxy) each RPC processing element runs. The optimization uses symbolic abstraction and execution to judge if different runtime configurations of possibly-stateful RPC processing elements are semantically equivalent for arbitrary RPC streams. Our experiments show that AppNet can express common application network function in only 7-28 lines of code. Its optimizations lower RPC processing latency by up to 82%.
more » « less
Free, publicly-accessible full text available April 28, 2026
High-level Programming for Application Networks.

Zhu, Xiangfeng Zhu; Wang, Yuyao; Liu, Banruo; Wu, Yongtong Wu; Bojanic, Nikola; Chen, Jingrong; Bernstein, Gilbert L; Krishnamurthy, Arvind; Kumar, Sam; Mahajan, Ratul; et al (April 2025, USENIX NSDI)

Free, publicly-accessible full text available April 28, 2026
Understanding and Supporting Debugging Workflows in CAD

https://doi.org/10.1145/3654777.3676353

Hähnlein, Felix; Bernstein, Gilbert; Schulz, Adriana (October 2024, ACM)

Full Text Available
A Verified Compiler for a Functional Tensor Language

https://doi.org/10.1145/3656390

Liu, Amanda; Bernstein, Gilbert; Chlipala, Adam; Ragan-Kelley, Jonathan (June 2024, Proceedings of the ACM on Programming Languages)

Producing efficient array code is crucial in high-performance domains like image processing and machine learning. It requires the ability to control factors like compute intensity and locality by reordering computations into different stages and granularities with respect to where they are stored. However, traditional pure, functional tensor languages struggle to do so. In a previous publication, we introduced ATL as a pure, functional tensor language capable of systematically decoupling compute and storage order via a set of high-level combinators known as reshape operators. Reshape operators are a unique functional-programming construct since they manipulate storage location in the generated code by modifying the indices that appear on the left-hand sides of storage expressions. We present a formal correctness proof for an implementation of the compilation algorithm, marking the first verification of a lowering algorithm targeting imperative loop nests from a source functional language that enables separate control of compute and storage ordering. One of the core difficulties of this proof required properly formulating the complex invariants to ensure that these storage-index remappings were well-formed. Notably, this exercise revealed asoundness bugin the original published compilation algorithm regarding the truncation reshape operators. Our fix is a new type system that captures safety conditions that were previously implicit and enables us to prove compiler correctness for well-typed source programs. We evaluate this type system and compiler implementation on a range of common programs and optimizations, including but not limited to those previously studied to demonstrate performance comparable to established compilers like Halide.
more » « less
FPGA Technology Mapping Using Sketch-Guided Program Synthesis

https://doi.org/10.1145/3620665.3640387

Smith, Gus Henry; Kushigian, Benjamin; Canumalla, Vishal; Cheung, Andrew; Lyubomirsky, Steven; Porncharoenwase, Sorawee; Just, René; Bernstein, Gilbert Louis; Tatlock, Zachary (April 2024, ACM)

Full Text Available
Distributions for Compositionally Differentiating Parametric Discontinuities

https://doi.org/10.1145/3649843

Michel, Jesse; Mu, Kevin; Yang, Xuanda; Bangaru, Sai Praveen; Collins, Elias Rojas; Bernstein, Gilbert; Ragan-Kelley, Jonathan; Carbin, Michael; Li, Tzu-Mao (April 2024, Proceedings of the ACM on Programming Languages)

Computations in physical simulation, computer graphics, and probabilistic inference often require the differentiation of discontinuous processes due to contact, occlusion, and changes at a point in time. Popular differentiable programming languages, such as PyTorch and JAX, ignore discontinuities during differentiation. This is incorrect forparametric discontinuities—conditionals containing at least one real-valued parameter and at least one variable of integration. We introduce Potto, the first differentiable first-order programming language to soundly differentiate parametric discontinuities. We present a denotational semantics for programs and program derivatives and show the two accord. We describe the implementation of Potto, which enables separate compilation of programs. Our prototype implementation overcomes previous compile-time bottlenecks achieving an 88.1x and 441.2x speed up in compile time and a 2.5x and 7.9x speed up in runtime, respectively, on two increasingly large image stylization benchmarks. We showcase Potto by implementing a prototype differentiable renderer with separately compiled shaders.
more » « less
Full Text Available
SLANG.D: Fast, Modular and Differentiable Shader Programming

Bangaru, Sai; Wu, Lifan; Munkberg, Jacob; Bernstein, Gilbert; Ragan-Kelley, Jonathan; Durand, Fredo; Lefohn, Aaron; He, Yong (December 2023, Transactions on Graphics)

Full Text Available
Semantics and Scheduling for Machine Knitting Compilers

https://doi.org/10.1145/3592449

Lin, Jenny; Narayanan, Vidya; Ikarashi, Yuka; Ragan-Kelley, Jonathan; Bernstein, Gilbert; McCann, James (August 2023, ACM Transactions on Graphics)

Machine knitting is a well-established fabrication technique for complex soft objects, and both companies and researchers have developed tools for generating machine knitting patterns. However, existing representations for machine knitted objects are incomplete (do not cover the complete domain of machine knittable objects) or overly specific (do not account for symmetries and equivalences among knitting instruction sequences). This makes it difficult to define correctness in machine knitting, let alone verify the correctness of a given program or program transformation. The major contribution of this work is a formal semantics for knitout, a low-level Domain Specific Language for knitting machines. We accomplish this by using what we call the "fenced tangle," which extends concepts from knot theory to allow for a mathematical definition of knitting program equivalence that matches the intuition behind knit objects. Finally, using this formal representation, we prove the correctness of a sequence of rewrite rules; and demonstrate how these rewrite rules can form the foundation for higher-level tasks such as compiling a program for a specific machine and optimizing for time/reliability, all while provably generating the same knit object under our proposed semantics. By establishing formal definitions of correctness, this work provides a strong foundation for compiling and optimizing knit programs.
more » « less
Full Text Available

« Prev Next »

Search for: All records