NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Boosting Soft Q-Learning by Bounding

Adamczyk, Jacob; Makarenko, Volodymyr; Tiomkin, Stas; Kulkarni, Rahul (September 2024, Reinforcement Learning Journal)

Full Text Available
Bayesian inference approach for entropy regularized reinforcement learning with stochastic dynamics

Argenis Arriojas, Jacob Adamczyk (July 2023, Proceedings of Machine Learning Research)

Full Text Available
Bounding the optimal value function in compositional reinforcement learning

Jacob Adamczyk, Volodymyr Makarenko (July 2023, Proceedings of Machine Learning Research)

Full Text Available
Bayesian inference approach for entropy regularized reinforcement learning with stochastic dynamics

Argenis Arriojas, Jacob Adamczyk (July 2023, Proceedings of Machine Learning Research)

Full Text Available
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning

https://doi.org/10.1609/aaai.v37i6.25817

Adamczyk, Jacob; Arriojas, Argenis; Tiomkin, Stas; Kulkarni, Rahul V. (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

In reinforcement learning (RL), the ability to utilize prior knowledge from previously solved tasks can allow agents to quickly solve new problems. In some cases, these new problems may be approximately solved by composing the solutions of previously solved primitive tasks (task composition). Otherwise, prior knowledge can be used to adjust the reward function for a new problem, in a way that leaves the optimal policy unchanged but enables quicker learning (reward shaping). In this work, we develop a general framework for reward shaping and task composition in entropy-regularized RL. To do so, we derive an exact relation connecting the optimal soft value functions for two entropy-regularized RL problems with different reward functions and dynamics. We show how the derived relation leads to a general result for reward shaping in entropy-regularized RL. We then generalize this approach to derive an exact relation connecting optimal value functions for the composition of multiple tasks in entropy-regularized RL. We validate these theoretical contributions with experiments showing that reward shaping and task composition lead to faster learning in various settings.
more » « less
Full Text Available
Entropy regularized reinforcement learning using large deviation theory

https://doi.org/10.1103/PhysRevResearch.5.023085

Arriojas, Argenis; Adamczyk, Jacob; Tiomkin, Stas; Kulkarni, Rahul V. (May 2023, Physical Review Research)

Full Text Available
Modulation of stochastic gene expression by nuclear export processes

https://doi.org/10.1109/CDC45484.2021.9683294

Smith, Madeline; Soltani, Mohammad; Kulkarni, Rahul; Singh, Abhyudai (December 2021, 2021 60th IEEE Conference on Decision and Control (CDC))

Inside mammalian cells, single genes are known to be transcribed in stochastic bursts leading to the synthesis of nuclear RNAs that are subsequently exported to the cytoplasm to create mRNAs. We systematically characterize the role of export processes in shaping the extent of random fluctuations (i.e. noise) in the mRNA level of a given gene. Using the method of Partitioning of Poisson arrivals, we derive an exact analytical expression for the noise in mRNA level assuming that the nuclear retention time of each RNA is an independent and identically distributed random variable following an arbitrary distribution. These results confirm recent experimental/theoretical findings that decreasing the nuclear export rate buffers the noise in mRNA level, and counterintuitively, decreasing the noise in the nuclear retention time enhances the noise in the mRNA level. Next, we further generalize the model to consider a dynamic extrinsic disturbance that affects the nuclear-to-cytoplasm export. Our results show that noise in the mRNA level varies non-monotonically with the disturbance timescale. More specifically, high- and low-frequency external disturbances have little impact on the mRNA noise level, while noise is amplified at intermediate frequencies. In summary, our results systematically uncover how the coupling of bursty transcription with nuclear export can both attenuate or amplify noise in mRNA levels depending on the nuclear retention time distribution and the presence of extrinsic fluctuations.
more » « less
Full Text Available
Constraining the complexity of promoter dynamics using fluctuations in gene expression

https://doi.org/10.1088/1478-3975/ab4e57

Kumar, Niraj; Kulkarni, Rahul V (January 2020, Physical Biology)

Full Text Available

Search for: All records