Search for: All records

Creators/Authors contains: "Payne, Ethan"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Visualizing and Generalizing Integrated Attributions

https://doi.org/10.1007/978-3-031-78189-6_29

Payne, Ethan; Patrick, David; Fernandez, Amanda S (December 2024, Springer Nature Switzerland)

Explainability and attribution for deep neural networks remains an open area of study due to the importance of adequately interpreting the behavior of such ubiquitous learning models. The method of expected gradients [10] reduced the baseline dependence of integrated gradients [27] and allowed for improved interpretability of attributions as representative of the broader gradient landscape, however both methods are visualized using an ambiguous transformation which obscures attribution information and neglects to distinguish between color channels. While expected gradients takes an expectation over the entire dataset, this is only one possible domain in which an explanation can be contextualized. In order to generalize the larger family of attribution methods containing integrated gradients and expected gradients, we instead frame each attribution as a volume integral over a set of interest within the input space, allowing for new levels of specificity and revealing novel sources of attribution information. Additionally, we demonstrate these new unique sources of feature attribution information using a refined visualization method which allows for both signed and unsigned attributions to be visually salient for each color channel. This new formulation provides a framework for developing and explaining a much broader family of attribution measures, and for computing attributions relevant to diverse contexts such as local and non-local neighborhoods. We evaluate our novel family of attribution measures and our improved visualization method using qualitative and quantitative approaches with the CIFAR10 and ImageNet datasets and the Quantus XAI library.
more » « less
Full Text Available
Visualizing and Generalizing Integrated Attributions

Payne, Ethan; Patrick, David; Fernandez, Amanda S (December 2024, Springer-Verlag, ACM Digital Library)

Explainability and attribution for deep neural networks remains an open area of study due to the importance of adequately interpreting the behavior of such ubiquitous learning models. The method of expected gradients [10] reduced the baseline dependence of integrated gradients [27] and allowed for improved interpretability of attributions as representative of the broader gradient landscape, however both methods are visualized using an ambiguous transformation which obscures attribution information and neglects to distinguish between color channels. While expected gradients takes an expectation over the entire dataset, this is only one possible domain in which an explanation can be contextualized. In order to generalize the larger family of attribution methods containing integrated gradients and expected gradients, we instead frame each attribution as a volume integral over a set of interest within the input space, allowing for new levels of specificity and revealing novel sources of attribution information. Additionally, we demonstrate these new unique sources of feature attribution information using a refined visualization method which allows for both signed and unsigned attributions to be visually salient for each color channel. This new formulation provides a framework for developing and explaining a much broader family of attribution measures, and for computing attributions relevant to diverse contexts such as local and non-local neighborhoods. We evaluate our novel family of attribution measures and our improved visualization method using qualitative and quantitative approaches with the CIFAR10 and ImageNet datasets and the Quantus XAI library.
more » « less
Full Text Available
Explaining Model Parameters Using the Product Space

Payne, Ethan; Patrick, David; Fernandez, Amanda S (December 2024, Springer-Verlag, ACM Digital Library)

With the increasing interest in explainable attribution for deep neural networks, it is important to consider not only the importance of individual inputs, but also the model parameters themselves. Existing methods, such as Neuron Integrated Gradients [18] and Conductance [6], attempt model attribution by applying attribution methods, such as Integrated Gradients, to the inputs of each model parameter. While these methods seem to map attributions to individual parameters, these are actually aggregated feature attributions which completely ignore the parameter space and also suffer from the same underlying limitations of Integrated Gradients. In this work, we compute parameter attributions by leveraging the recent family of measures proposed by Generalized Integrated Attributions, by instead computing integrals over the product space of inputs and parameters. This usage of the product space allows us to now explain individual neurons from varying perspectives and interpret them with the same intuition as inputs. To the best of our knowledge, ours is the first method which actually utilizes the gradient landscape of the parameter space to explain each individual weight and bias. We confirm the utility of our parameter attributions by computing exploratory statistics for a wide variety of image classification datasets and by performing pruning analyses on a standard architecture, which demonstrate that our attribution measures are able to identify both important and unimportant neurons in a convolutional neural network.
more » « less
Full Text Available
Explaining Model Parameters Using the Product Space

https://doi.org/10.1007/978-3-031-78192-6_1

Payne, Ethan; Patrick, David; Fernandez, Amanda S (December 2024, Springer Nature Switzerland)

With the increasing interest in explainable attribution for deep neural networks, it is important to consider not only the importance of individual inputs, but also the model parameters themselves. Existing methods, such as Neuron Integrated Gradients [18] and Conductance [6], attempt model attribution by applying attribution methods, such as Integrated Gradients, to the inputs of each model parameter. While these methods seem to map attributions to individual parameters, these are actually aggregated feature attributions which completely ignore the parameter space and also suffer from the same underlying limitations of Integrated Gradients. In this work, we compute parameter attributions by leveraging the recent family of measures proposed by Generalized Integrated Attributions, by instead computing integrals over the product space of inputs and parameters. This usage of the product space allows us to now explain individual neurons from varying perspectives and interpret them with the same intuition as inputs. To the best of our knowledge, ours is the first method which actually utilizes the gradient landscape of the parameter space to explain each individual weight and bias. We confirm the utility of our parameter attributions by computing exploratory statistics for a wide variety of image classification datasets and by performing pruning analyses on a standard architecture, which demonstrate that our attribution measures are able to identify both important and unimportant neurons in a convolutional neural network.
more » « less
Full Text Available
Curvature Dependence of Gravitational-Wave Tests of General Relativity

https://doi.org/10.1103/PhysRevLett.133.251401

Payne, Ethan; Isi, Maximiliano; Chatziioannou, Katerina; Lehner, Luis; Chen, Yanbei; Farr, Will M (December 2024, Physical Review Letters)

Full Text Available
Are all models wrong? Falsifying binary formation models in gravitational-wave astronomy using exceptional events

https://doi.org/10.1093/mnras/stae2521

Passenger, Lachlan; Thrane, Eric; Lasky, Paul; Payne, Ethan; Stevenson, Simon; Farr, Ben (November 2024, Monthly Notices of the Royal Astronomical Society)

ABSTRACT As the catalogue of gravitational-wave transients grows, several entries appear ‘exceptional’ within the population. Tipping the scales with a total mass of $$\sim 150 \,{\rm M}_\odot$$, GW190521 likely contained black holes in the pair-instability mass gap. The event GW190814, meanwhile, is unusual for its extreme mass ratio and the mass of its secondary component. A growing model-building industry has emerged to provide explanations for such exceptional events, and Bayesian model selection is frequently used to determine the most informative model. However, Bayesian methods can only take us so far. They provide no answer to the question: does our model provide an adequate explanation for exceptional events in the data? If none of the models we are testing provide an adequate explanation, then it is not enough to simply rank our existing models – we need new ones. In this paper, we introduce a method to answer this question with a frequentist p-value. We apply the method to different models that have been suggested to explain the unusually massive event GW190521: hierarchical mergers in active galactic nuclei and globular clusters. We show that some (but not all) of these models provide adequate explanations for exceptionally massive events like GW190521.
more » « less
Spin Doctors: How to Diagnose a Hierarchical Merger Origin

https://doi.org/10.3847/2041-8213/ad3e82

Payne, Ethan; Kremer, Kyle; Zevin, Michael (April 2024, The Astrophysical Journal Letters)

Abstract Gravitational-wave observations provide the unique opportunity of studying black hole formation channels and histories—but only if we can identify their origin. One such formation mechanism is the dynamical synthesis of black hole binaries in dense stellar systems. Given the expected isotropic distribution of component spins of binary black holes in gas-free dynamical environments, the presence of antialigned or in-plane spins with respect to the orbital angular momentum is considered a tell-tale sign of a merger’s dynamical origin. Even in the scenario where birth spins of black holes are low, hierarchical mergers attain large component spins due to the orbital angular momentum of the prior merger. However, measuring such spin configurations is difficult. Here, we quantify the efficacy of the spin parameters encoding aligned-spin (χ_eff) and in-plane spin (χ_p) at classifying such hierarchical systems. Using Monte Carlo cluster simulations to generate a realistic distribution of hierarchical merger parameters from globular clusters, we can infer mergers’χ_effandχ_p. The cluster populations are simulated using Advanced LIGO-Virgo sensitivity during the detector network’s third observing period and projections for design sensitivity. Using a “likelihood-ratio”-based statistic, we find that ∼2% of the recovered population by the current gravitational-wave detector network has a statistically significantχ_pmeasurement, whereas noχ_effmeasurement was capable of confidently determining a system to be antialigned with the orbital angular momentum at current detector sensitivities. These results indicate that measuring spin-precession throughχ_pis a more detectable signature of hierarchical mergers and dynamical formation than antialigned spins.
more » « less
Full Text Available
Transdimensional Inference for Gravitational-wave Astronomy with Bilby

https://doi.org/10.3847/1538-4365/ad9deb

Tong, Hui; Guttman, Nir; Clarke, Teagan_A; Lasky, Paul_D; Thrane, Eric; Payne, Ethan; Nathan, Rowina; Farr, Ben; Fishbach, Maya; Ashton, Gregory; et al (January 2025, The Astrophysical Journal Supplement Series)

Abstract It has become increasingly useful to answer questions in gravitational-wave astronomy usingtransdimensionalmodels, where the number of free parameters can be varied depending on the complexity required to fit the data. Given the growing interest in transdimensional inference, we introduce a new package for the Bayesian inference Library (Bilby), calledtBilby. ThetBilbypackage allows users to set up transdimensional inference calculations using the existingBilbyarchitecture with off-the-shelf nested samplers and/or Markov Chain Monte Carlo algorithms. Transdimensional models are particularly helpful when seeking to test theoretically uncertain predictions described by phenomenological models. For example, bursts of gravitational waves can be modeled using a superposition ofNwavelets, whereNis itself a free parameter. Short pulses are modeled with small values ofN, whereas longer, more complicated signals are represented with a large number of wavelets stitched together. Other transdimensional models have been used to describe instrumental noise and the population properties of gravitational-wave sources. We provide a few demonstrations oftBilby, including fitting the gravitational-wave signal GW150914 with a superposition ofNsine-Gaussian wavelets. We outline our plans to further develop thetBilbycode suite for a broader range of transdimensional problems.
more » « less
Fortifying gravitational-wave tests of general relativity against astrophysical assumptions

https://doi.org/10.1103/PhysRevD.108.124060

Payne, Ethan; Isi, Maximiliano; Chatziioannou, Katerina; Farr, Will M (December 2023, Physical Review D)

Full Text Available
Transdimensional inference for gravitational-wave astronomy with Bilby

Tong, Hui; Guttman, Nir; Clarke, Teagan A; Lasky, Paul D; Thrane, Eric; Payne, Ethan; Nathan, Rowina; Farr, Ben; Fishbach, Maya; Ashton, Gregory; et al (April 2024, The Astrophysical journal Supplement series)

It has become increasingly useful to answer questions in gravitational-wave astronomy using transdimensional models where the number of free parameters can be varied depending on the complexity required to fit the data. Given the growing interest in transdimensional inference, we introduce a new package for the Bayesian inference Library (Bilby) called tBilby. The tBilby package allows users to set up transdimensional inference calculations using the existing Bilby architecture with off-the-shelf nested samplers and/or Markov Chain Monte Carlo algorithms. Transdimensional models are particularly helpful when we seek to test theoretically uncertain predictions described by phenomenological models. For example, bursts of gravitational waves can be modelled using a superposition of N wavelets where N is itself a free parameter. Short pulses are modelled with small values of N whereas longer, more complicated signals are represented with a large number of wavelets stitched together. Other transdimensional models have found use describing instrumental noise and the population properties of gravitational-wave sources. We provide a few demonstrations of tBilby, including fitting the gravitational-wave signal GW150914 with a superposition of N sine-Gaussian wavelets. We outline our plans to further develop the tbilby code suite for a broader range of transdimensional problems.
more » « less
Full Text Available

« Prev Next »