- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
0002000001010000
- More
- Availability
-
40
- Author / Contributor
- Filter by Author / Creator
-
-
Sarkar, Soumik (4)
-
Tan, Sin Yong (4)
-
Balu, Aditya (2)
-
Jiang, Zhanhong (2)
-
Lee, Young M (2)
-
Chakraborty, Subhadeep (1)
-
Feng, Jiale (1)
-
Harris, Laura (1)
-
Hedge, Chinmay (1)
-
Hegde, Chinmay (1)
-
Katanbaf, Mohamad (1)
-
Lee, Xian Yeow (1)
-
Saffari, Ali (1)
-
Saha, Homagni (1)
-
Smith, Joshua R. (1)
-
Tan, Kai Liang (1)
-
Tavassoli, Riley (1)
-
Waite, Joshua R (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Jiang, Zhanhong; Lee, Xian Yeow; Tan, Sin Yong; Tan, Kai Liang; Balu, Aditya; Lee, Young M; Hegde, Chinmay; Sarkar, Soumik (, Proceedings of the AAAI Conference on Artificial Intelligence)We propose a novel policy gradient method for multi-agent reinforcement learning, which leverages two different variance-reduction techniques and does not require large batches over iterations. Specifically, we propose a momentum-based decentralized policy gradient tracking (MDPGT) where a new momentum-based variance reduction technique is used to approximate the local policy gradient surrogate with importance sampling, and an intermediate parameter is adopted to track two consecutive policy gradient surrogates. MDPGT provably achieves the best available sample complexity of O(N -1 e -3) for converging to an e-stationary point of the global average of N local performance functions (possibly nonconcave). This outperforms the state-of-the-art sample complexity in decentralized model-free reinforcement learning and when initialized with a single trajectory, the sample complexity matches those obtained by the existing decentralized policy gradient methods. We further validate the theoretical claim for the Gaussian policy function. When the required error tolerance e is small enough, MDPGT leads to a linear speed up, which has been previously established in decentralized stochastic optimization, but not for reinforcement learning. Lastly, we provide empirical results on a multi-agent reinforcement learning benchmark environment to support our theoretical findings.more » « less
-
Balu, Aditya; Jiang, Zhanhong; Tan, Sin Yong; Hedge, Chinmay; Lee, Young M; Sarkar, Soumik (, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))
-
Saffari, Ali; Tan, Sin Yong; Katanbaf, Mohamad; Saha, Homagni; Smith, Joshua R.; Sarkar, Soumik (, EMDL 2021: 5th International Workshop on Embedded and Mobile Deep Learning)Occupancy detection systems are commonly equipped with high quality cameras and a processor with high computational power to run detection algorithms. This paper presents a human occupancy detection system that uses battery-free cameras and a deep learning model implemented on a low-cost hub to detect human presence. Our low-resolution camera harvests energy from ambient light and transmits data to the hub using backscatter communication. We implement the state-of-the-art YOLOv5 network detection algorithm that offers high detection accuracy and fast inferencing speed on a Raspberry Pi 4 Model B. We achieve an inferencing speed of ∼100ms per image and an overall detection accuracy of >90% with only 2GB CPU RAM on the Raspberry Pi. In the experimental results, we also demonstrate that the detection is robust to noise, illuminance, occlusion, and angle of depression.more » « less
An official website of the United States government

Full Text Available