NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Controlled Sequential Information Fusion with Social Sensors

https://doi.org/10.1109/TAC.2020.3046024

Bhatt, Sujay; Krishnamurthy, Vikram (January 2021, IEEE Transactions on Automatic Control)
null (Ed.)
Full Text Available
Quickest Change Detection of Time Inconsistent Anticipatory Agents. Human-Sensor and Cyber-Physical Systems

https://doi.org/10.1109/TSP.2021.3050977

Krishnamurthy, Vikram (January 2021, IEEE Transactions on Signal Processing)
null (Ed.)
Full Text Available
Friendship paradox biases perceptions in directed networks

https://doi.org/10.1038/s41467-020-14394-x

Alipourfard, Nazanin; Nettasinghe, Buddhika; Abeliuk, Andrés; Krishnamurthy, Vikram; Lerman, Kristina (December 2020, Nature Communications)

Full Text Available
Rationally Inattentive Inverse Reinforcement Learning Explains YouTube Commenting Behavior

Hoiles, William; Krishnamurthy, Vikram; Pattanayak, Kunal (September 2020, Journal of machine learning research)
null (Ed.)
We consider a novel application of inverse reinforcement learning with behavioral economics constraints to model, learn and predict the commenting behavior of YouTube viewers. Each group of users is modeled as a rationally inattentive Bayesian agent which solves a contextual bandit problem. Our methodology integrates three key components. First, to identify distinct commenting patterns, we use deep embedded clustering to estimate framing information (essential extrinsic features) that clusters users into distinct groups. Second, we present an inverse reinforcement learning algorithm that uses Bayesian revealed preferences to test for rationality: does there exist a utility function that rationalizes the given data, and if yes, can it be used to predict commenting behavior? Finally, we impose behavioral economics constraints stemming from rational inattention to characterize the attention span of groups of users. The test imposes a Rényi mutual information cost constraint which impacts how the agent can select attention strategies to maximize their expected utility. After a careful analysis of a massive YouTube dataset, our surprising result is that in most YouTube user groups, the commenting behavior is consistent with optimizing a Bayesian utility with rationally inattentive constraints. The paper also highlights how the rational inattention model can accurately predict commenting behavior. The massive YouTube dataset and analysis used in this paper are available on GitHub and completely reproducible
more » « less
Full Text Available
Convex Stochastic Dominance in Bayesian Localization, Filtering, and Controlled Sensing POMDPs

https://doi.org/10.1109/TIT.2019.2948598

Krishnamurthy, Vikram (May 2020, IEEE Transactions on Information Theory)

Full Text Available
Identifying Cognitive Radars - Inverse Reinforcement Learning Using Revealed Preferences

https://doi.org/10.1109/TSP.2020.3013516

Krishnamurthy, Vikram; Angley, Daniel; Evans, Robin; Moran, Bill (January 2020, IEEE Transactions on Signal Processing)
null (Ed.)
Full Text Available
Inverse Filtering for Hidden Markov Models With Applications to Counter-Adversarial Autonomous Systems

https://doi.org/10.1109/TSP.2020.3019177

Mattila, Robert; Rojas, Cristian R.; Krishnamurthy, Vikram; Wahlberg, Bo (January 2020, IEEE Transactions on Signal Processing)
null (Ed.)
Full Text Available
How to Calibrate Your Adversary's Capabilities? Inverse Filtering for Counter-Autonomous Systems

https://doi.org/10.1109/TSP.2019.2956676

Krishnamurthy, Vikram; Rangaswamy, Muralidhar (December 2019, IEEE Transactions on Signal Processing)

Full Text Available
Policy Gradient using Weak Derivatives for Reinforcement Learning

https://doi.org/10.1109/CDC40024.2019.9029403

Bhatt, Sujay; Koppel, Alec; Krishnamurthy, Vikram (December 2019, 2019 IEEE 58th Conference on Decision and Control)

This paper considers policy search in continuous state-action reinforcement learning problems. Typically, one computes search directions using a classic expression for the policy gradient called the Policy Gradient Theorem, which decomposes the gradient of the value function into two factors: the score function and the Q-function. This paper presents four results: (i) an alternative policy gradient theorem using weak (measure-valued) derivatives instead of score-function is established; (ii) the stochastic gradient estimates thus derived are shown to be unbiased and to yield algorithms that converge almost surely to stationary points of the non-convex value function of the reinforcement learning problem; (iii) the sample complexity of the algorithm is derived and is shown to be O(1/ k); (iv) finally, the expected variance of the gradient estimates obtained using weak derivatives is shown to be lower than those obtained using the popular score-function approach. Experiments on OpenAI gym pendulum environment illustrate the superior performance of the proposed algorithm.
more » « less
Full Text Available
"What Do Your Friends Think?": Efficient Polling Methods for Networks Using Friendship Paradox

https://doi.org/10.1109/TKDE.2019.2940914

Nettasinghe, Buddhika; Krishnamurthy, Vikram (September 2019, IEEE Transactions on Knowledge and Data Engineering)

This paper deals with randomized polling of a social network. In the case of forecasting the outcome of an election between two candidates A and B, classical intent polling asks randomly sampled individuals: who will you vote for? Expectation polling asks: who do you think will win? In this paper, we propose a novel neighborhood expectation polling (NEP) strategy that asks randomly sampled individuals: what is your estimate of the fraction of votes for A? Therefore, in NEP, sampled individuals will naturally look at their neighbors (defined by the underlying social network graph) when answering this question. Hence, the mean squared error (MSE) of NEP methods rely on selecting the optimal set of samples from the network. To this end, we propose three NEP algorithms for the following cases: (i) the social network graph is not known but, random walks (sequential exploration) can be performed on the graph (ii) the social network graph is unknown. For case (i) and (ii), two algorithms based on a graph theoretic consequence called friendship paradox are proposed. Theoretical results on the dependence of the MSE of the algorithms on the properties of the network are established. Numerical results on real and synthetic data sets are provided to illustrate the performance of the algorithms.
more » « less
Full Text Available

« Prev Next »

Search for: All records