NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Toward a taxonomy of trust for probabilistic machine learning

https://doi.org/10.1126/sciadv.abn3999

Broderick, Tamara; Gelman, Andrew; Meager, Rachael; Smith, Anna L.; Zheng, Tian (February 2023, Science Advances)

A taxonomy delineates where trust can break down in a probabilistic machine learning workflow that informs critical decisions.
more » « less
Full Text Available
Semi-symbolic inference for efficient streaming probabilistic programming

https://doi.org/10.1145/3563347

Atkinson, Eric; Yuan, Charles; Baudart, Guillaume; Mandel, Louis; Carbin, Michael (October 2022, Proceedings of the ACM on Programming Languages)

A streaming probabilistic program receives a stream of observations and produces a stream of distributions that are conditioned on these observations. Efficient inference is often possible in a streaming context using Rao-Blackwellized particle filters (RBPFs), which exactly solve inference problems when possible and fall back on sampling approximations when necessary. While RBPFs can be implemented by hand to provide efficient inference, the goal of streaming probabilistic programming is to automatically generate such efficient inference implementations given input probabilistic programs. In this work, we propose semi-symbolic inference, a technique for executing probabilistic programs using a runtime inference system that automatically implements Rao-Blackwellized particle filtering. To perform exact and approximate inference together, the semi-symbolic inference system manipulates symbolic distributions to perform exact inference when possible and falls back on approximate sampling when necessary. This approach enables the system to implement the same RBPF a developer would write by hand. To ensure this, we identify closed families of distributions – such as linear-Gaussian and finite discrete models – on which the inference system guarantees exact inference. We have implemented the runtime inference system in the ProbZelus streaming probabilistic programming language. Despite an average 1.6× slowdown compared to the state of the art on existing benchmarks, our evaluation shows that speedups of 3×-87× are obtainable on a new set of challenging benchmarks we have designed to exploit closed families.
more » « less
Full Text Available
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling

https://doi.org/10.1109/MICRO56248.2022.00096

Wu, Yannan Nellie; Tsai, Po-An; Parashar, Angshuman; Sze, Vivienne; Emer, Joel S. (October 2022, 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO))

Full Text Available
Statically bounded-memory delayed sampling for probabilistic streams

https://doi.org/10.1145/3485492

Atkinson, Eric; Baudart, Guillaume; Mandel, Louis; Yuan, Charles; Carbin, Michael (October 2021, Proceedings of the ACM on Programming Languages)

Probabilistic programming languages aid developers performing Bayesian inference. These languages provide programming constructs and tools for probabilistic modeling and automated inference. Prior work introduced a probabilistic programming language, ProbZelus, to extend probabilistic programming functionality to unbounded streams of data. This work demonstrated that the delayed sampling inference algorithm could be extended to work in a streaming context. ProbZelus showed that while delayed sampling could be effectively deployed on some programs, depending on the probabilistic model under consideration, delayed sampling is not guaranteed to use a bounded amount of memory over the course of the execution of the program. In this paper, we the present conditions on a probabilistic program’s execution under which delayed sampling will execute in bounded memory. The two conditions are dataflow properties of the core operations of delayed sampling: the m -consumed property and the unseparated paths property . A program executes in bounded memory under delayed sampling if, and only if, it satisfies the m -consumed and unseparated paths properties. We propose a static analysis that abstracts over these properties to soundly ensure that any program that passes the analysis satisfies these properties, and thus executes in bounded memory under delayed sampling.
more » « less
Full Text Available
Simplifying dependent reductions in the polyhedral model

https://doi.org/10.1145/3434301

Yang, Cambridge; Atkinson, Eric; Carbin, Michael (January 2021, Proceedings of the ACM on Programming Languages)

A Reduction – an accumulation over a set of values, using an associative and commutative operator – is a common computation in many numerical computations, including scientific computations, machine learning, computer vision, and financial analytics. Contemporary polyhedral-based compilation techniques make it possible to optimize reductions, such as prefix sums, in which each component of the reduction’s output potentially shares computation with another component in the reduction. Therefore an optimizing compiler can identify the computation shared between multiple components and generate code that computes the shared computation only once. These techniques, however, do not support reductions that – when phrased in the language of the polyhedral model – span multiple dependent statements. In such cases, existing approaches can generate incorrect code that violates the data dependences of the original, unoptimized program. In this work, we identify and formalize the optimization of dependent reductions as an integer bilinear program. We present a heuristic optimization algorithm that uses an affine sequential schedule of the program to determine how to simplfy reductions yet still preserve the program’s dependences. We demonstrate that the algorithm provides optimal complexity for a set of benchmark programs from the literature on probabilistic inference algorithms, whose performance critically relies on simplifying these reductions. The complexities for 10 of the 11 programs improve siginifcantly by factors at least of the sizes of the input data, which are in the range of 10 4 to 10 6 for typical real application inputs. We also confirm the significance of the improvement by showing speedups in wall-clock time that range from 1.1x to over 10 6 x.
more » « less
Full Text Available

Search for: All records