NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Cottage: Coordinated Time Budget Assignment for Latency, Quality and Power Optimization in Web Search

https://doi.org/10.1109/HPCA53966.2022.00017

Zhou, Liang; Bhuyan, Laxmi N.; Ramakrishnan, K. K. (April 2022, IEEE International Symposium on High-Performance Computer Architecture (HPCA))

Full Text Available
SPoTKD: A Protocol for Symmetric Key Distribution Over Public Channels Using Self-Powered Timekeeping Devices

https://doi.org/10.1109/TIFS.2022.3158089

Rahman, Mustafizur; Zhou, Liang; Chakrabartty, Shantanu (January 2022, IEEE Transactions on Information Forensics and Security)

Full Text Available
Balancing Latency and Quality in Web Search

https://doi.org/10.1109/NAS51552.2021.9605375

Zhou, Liang; Ramakrishnan, K. K. (October 2021, IEEE International Conference on Networking, Architecture and Storage (NAS))

Full Text Available
An Optimization Framework for Privacy-preserving Access Control in Cloud-Fog Computing Systems

https://doi.org/10.1109/VTC2020-Fall49728.2020.9348516

Jiang, Yili; Zhang, Kuan; Qian, Yi; Zhou, Liang (November 2020, IEEE VTC 2020-Fall)
null (Ed.)
Full Text Available
Gemini: Learning to Manage CPU Power for Latency-Critical Search Engines

https://doi.org/10.1109/MICRO50266.2020.00059

Zhou, Liang; Bhuyan, Laxmi N.; Ramakrishnan, K. K. (October 2020, 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO))
null (Ed.)
Saving energy for latency-critical applications like web search can be challenging because of their strict tail latency constraints. State-of-the-art power management frameworks use Dynamic Voltage and Frequency Scaling (DVFS) and Sleep states techniques to slow down the request processing and finish the search just-in-time. However, accurately predicting the compute demand of a request can be difficult. In this paper, we present Gemini, a novel power management framework for latency- critical search engines. Gemini has two unique features to capture the per query service time variation. First, at light loads without request queuing, a two-step DVFS is used to manage the CPU power. Our two-step DVFS selects the initial CPU frequency based on the query specific service time prediction and then judiciously boosts the initial frequency at the right time to catch-up to the deadline. The determination of boosting time further relies on estimating the error in the prediction of individual query’s service time. At high loads, where there is request queuing, only the current request being executed and the critical request in the queue adopt a two-step DVFS. All the other requests in-between use the same frequency to reduce the frequency transition overhead. Second, we develop two separate neural network models, one for predicting the service time and the other for the error in the prediction. The combination of these two predictors significantly improves the power saving and tail latency results of our two-step DVFS. Gemini is implemented on the Solr search engine. Evaluations on three representative query traces show that Gemini saves 41% of the CPU power, and is better than other state-of-the-art techniques.
more » « less
Full Text Available
Swan: a two-step power management for distributed search engines

https://doi.org/10.1145/3370748.3406573

Zhou, Liang; Bhuyan, Laxmi N.; Ramakrishnan, K. K. (August 2020, Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED '20))

Full Text Available
Goldilocks: Adaptive Resource Provisioning in Containerized Data Centers

Zhou, Liang; Bhuyan, Laxmi; Ramakrishnan, K. K. (July 2019, 39th IEEE International Conference on Distributed Computing Systems (ICDCS 2019))

Power management in data centers is challenging because of fluctuating workloads and strict task completion time requirements. Recent resource provisioning systems, such as Borg and RC-Informed, pack tasks on servers to save power. However, current power optimization frameworks based on packing leave very little headroom for spikes, and the task completion times are compromised. In this paper, we design Goldilocks, a novel resource provisioning system for optimizing both power and task completion time by allocating tasks to servers in groups. Tasks hosted in containers are grouped together by running a graph partitioning algorithm. Containers communicating frequently are placed together, which improves the task completion times. We also leverage new findings on power consumption of modern- day servers to ensure that their utilizations are in a range where they are power-proportional. Both testbed implementation measurements and large-scale trace-driven simulations prove that Goldilocks outperforms all the previous works on data center power saving. Goldilocks saves power by 11.7%-26.2% depending on the workload, whereas the best of the implemented alternatives, Borg, saves 8.9%-22.8%. The energy per request for the Twitter content caching workload in Goldilocks is only 33% of RC-Informed. Finally, the best alternative in terms of task completion time, E-PVM, has 1.17-3.29 times higher task completion times than Goldilocks across different workloads.
more » « less
Full Text Available
Goldilocks: Adaptive Resource Provisioning in Containerized Data Centers

https://doi.org/10.1109/ICDCS.2019.00072

Zhou, Liang; Bhuyan, Laxmi N.; Ramakrishnan, K. K. (July 2019, International Conference on Distributed Computing Systems (ICDCS))

Full Text Available
DREAM: DistRibuted Energy-Aware traffic Management for Data Center Networks

https://doi.org/10.1145/3307772.3328291

Zhou, Liang; Bhuyan, Laxmi N.; Ramakrishnan, K. K. (January 2019, Proceedings of the Tenth ACM International Conference on Future Energy Systems)

Full Text Available
Gaussian Process Regression for Improving the Performance of Self-powered Time-of-Occurrence Sensors

Zhou, Liang; Aono, Kenji Aono; Chakrabartty, Shantanu (January 2018, Proceedings of the 61st IEEE International Midwest Symposium on Circuits and Systems)

Full Text Available

« Prev Next »

Search for: All records