skip to main content

Title: Distributed Principal Subspace Analysis for Partitioned Big Data: Algorithms, Analysis, and Implementation
Award ID(s):
1453073 1907658 1940074
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
IEEE Transactions on Signal and Information Processing over Networks
Page Range / eLocation ID:
699 to 715
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Ruis, Andrew R. ; Lee, Seung B. (Ed.)
    While there has been much growth in the use of microblogging platforms (e.g., Twitter) to share information on a range of topics, researchers struggle to analyze the large volumes of data produced on such platforms. Established methods such as Sentiment Analysis (SA) have been criticized over their inaccuracy and limited analytical depth. In this exploratory methodological paper, we propose a combination of SA with Epistemic Network Analysis (ENA) as an alternative approach for providing richer qualitative and quantitative insights into Twitter discourse. We illustrate the application and potential use of these approaches by visualizing the differences between tweets directed or discussing Democrats and Republicans after the COVID-19 Stimulus Package announcement in the US. SA was integrated into ENA models in two ways: as a part of the blocking variable and as a set of codes. Our results suggest that incorporating SA into ENA allowed for a better understanding of how groups viewed the components of the stimulus issue by splitting them by sentiment and enabled a meaningful inclusion of data with singular subject focus into the ENA models. 
    more » « less
  2. ABSTRACT We present Bayesian active galactic nucleus (AGN) Decomposition Analysis for Sloan Digital Sky Survey (SDSS) Spectra, an open source spectral analysis code designed for automatic detailed deconvolution of AGN and host galaxy spectra, implemented in python, and designed for the next generation of large-scale surveys. The code simultaneously fits all spectral components, including power-law continuum, stellar line-of-sight velocity distribution, Fe ii emission, as well as forbidden (narrow), permitted (broad), and outflow emission line features, all performed using Markov chain Monte Carlo to obtain robust uncertainties and autocorrelation analysis to assess parameter convergence. Our code also utilizes multiprocessing for batch fitting large samples of spectra while efficiently managing memory and computation resources and is currently being used in a cluster environment to fit thousands of SDSS spectra. We use our code to perform a correlation analysis of 63 SDSS type 1 AGNs with evidence of strong non-gravitational outflow kinematics in the [O iii] λ5007 emission feature. We confirm findings from previous studies that show the core of the [O iii] profile is a suitable surrogate for stellar velocity dispersion σ*, however there is evidence that the core experiences broadening that scales with outflow velocity. We find sufficient evidence that σ*, [O iii] core dispersion, and the non-gravitational outflow dispersion of the [O iii] profile form a plane whose fit results in a scatter of ∼0.1 dex. Finally, we discuss the implications, caveats, and recommendations when using the [O iii] dispersion as a surrogate for σ* for the MBH−σ* relation. 
    more » « less