Search for: All records

Award ID contains: 1717179

« Prev Next »

Total Resources

9

Resource Type
Conference Paper

6

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

8

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AggFirstJoin: Optimizing Geo-Distributed Joins using Aggregation-Based Transformations

https://doi.org/10.1109/CCGrid57682.2023.00046

Kumar, Dhruv ; Ahmad, Sohaib ; Chandra, Abhishek ; Sitaraman, Ramesh K. ( May 2023 , IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid))

Free, publicly-accessible full text available May 1, 2024
Towards WAN-aware join sampling over geo-distributed data

https://doi.org/10.1145/3517206.3526268

Kumar, Dhruv ; Wolfrath, Joel ; Chandra, Abhishek ; Sitaraman, Ramesh K. ( April 2022 , EdgeSys '22: Proceedings of the 5th International Workshop on Edge Systems, Analytics and Networking)

Full Text Available
AggNet: Cost-Aware Aggregation Networks for Geo-distributed Streaming Analytics

https://doi.org/10.1145/3453142.3491276

Dhruv Kumar ; Sohaib Ahmad ; Abhishek Chandra ; Ramesh K. Sitaraman ( December 2021 , IEEE/ACM Symposium on Edge Computing (SEC))

Large-scale real-time analytics services continuously collect and analyze data from end-user applications and devices distributed around the globe. Such analytics requires data to be transferred over the wide-area network (WAN) to data centers (DCs) capable of processing the data. Since WAN bandwidth is expensive and scarce, it is beneficial to reduce WAN traffic by partially aggregating the data closer to end-users. We propose aggregation networks for performing aggregation on a geo-distributed edge-cloud infrastructure consisting of edge servers, transit and destination DCs. We identify a rich set of research questions aimed at reducing the traffic costs in an aggregation network. We present an optimization formulation for solving these questions in a principled manner, and use insights from the optimization solutions to propose an efficient, near-optimal practical heuristic. We implement the heuristic in AggNet, built on top of Apache Flink. We evaluate our approach using a geo-distributed deployment on Amazon EC2 as well as a WAN-emulated local testbed. Our evaluation using real-world traces from Twitter and Akamai shows that our approach is able to achieve 47% to 83% reduction in traffic cost over existing baselines without any compromise in timeliness.
more » « less
Full Text Available
AggNet: Cost-Aware Aggregation Networks for Geo-distributed Streaming Analytics

Kumar, Dhruv ; Ahmad, Sohaib ; Chandra, Abhishek ; Sitaraman, Ramesh K. ( January 2021 , ACM/IEEE Symposium on Edge Computing (SEC'21))
null (Ed.)
Large-scale real-time analytics services continuously collect and analyze data from end-user applications and devices distributed around the globe. Such analytics requires data to be transferred over the wide-area network (WAN) to data centers (DCs) capable of processing the data. Since WAN bandwidth is expensive and scarce, it is beneficial to reduce WAN traffic by partially aggregating the data closer to end-users. We propose aggregation networks for per- forming aggregation on a geo-distributed edge-cloud infrastructure consisting of edge servers, transit and destination DCs. We identify a rich set of research questions aimed at reducing the traffic costs in an aggregation network. We present an optimization formula- tion for solving these questions in a principled manner, and use insights from the optimization solutions to propose an efficient, near-optimal practical heuristic. We implement the heuristic in AggNet, built on top of Apache Flink. We evaluate our approach using a geo-distributed deployment on Amazon EC2 as well as a WAN-emulated local testbed. Our evaluation using real-world traces from Twitter and Akamai shows that our approach is able to achieve 47% to 83% reduction in traffic cost over existing baselines without any compromise in timeliness.
more » « less
Full Text Available
RL-Cache: Learning-Based Cache Admission for Content Delivery

https://doi.org/10.1109/JSAC.2020.3000415

Kirilin, Vadim ; Sundarrajan, Aditya ; Gorinsky, Sergey ; Sitaraman, Ramesh K. ( June 2020 , IEEE Journal on Selected Areas in Communications)

Content delivery networks (CDNs) distribute much of the Internet content by caching and serving the objects requested by users. A major goal of a CDN is to maximize the hit rates of its caches, thereby enabling faster content downloads to the users. Content caching involves two components: an admission algorithm to decide whether to cache an object and an eviction algorithm to determine which object to evict from the cache when it is full. In this paper, we focus on cache admission and propose a novel algorithm called RL-Cache that uses model-free reinforcement learning (RL) to decide whether or not to admit a requested object into the CDN’s cache. Unlike prior approaches that use a small set of criteria for decision making, RL-Cache weights a large set of features that include the object size, recency, and frequency of access. We develop a publicly available implementation of RL-Cache and perform an evaluation using production traces for the image, video, and web traffic classes from Akamai’s CDN. The evaluation shows that RL-Cache improves the hit rate in comparison with the state of the art and imposes only a modest resource overhead on the CDN servers. Further, RL-Cache is robust enough that it can be trained in one location and executed on request traces of the same or different traffic classes in other locations of the same geographic region. The paper also reports extensive analyses of the RL-Cache sensitivity to its features and hyperparameter values. The analyses validate the made design choices and reveal interesting insights into the RL-Cache behavior.
more » « less
Full Text Available
A TTL-based Approach for Data Aggregation in Geo-distributed Streaming Analytics

https://doi.org/10.1145/3309697.3331491

Kumar, Dhruv ; Li, Jian ; Chandra, Abhishek ; Sitaraman, Ramesh K. ( June 2019 , ACM SIGMETRICS 2019)

Full Text Available
A TTL-based Approach for Data Aggregation in Geo-distributed Streaming Analytics

https://doi.org/10.1145/3341617.3326144

Kumar, Dhruv ; Li, Jian ; Chandra, Abhishek ; Sitaraman, Ramesh ( June 2019 , Proceedings of the ACM on Measurement and Analysis of Computing Systems)

Full Text Available
RL-Cache: Learning-Based Cache Admission for Content Delivery

https://doi.org/10.1145/3341216.3342214

Kirilin, Vadim ; Sundarrajan, Aditya ; Gorinsky, Sergey ; Sitaraman, Ramesh K. ( January 2019 , NetAI'19: Proceedings of the 2019 Workshop on Network Meets AI & ML)

Content delivery networks (CDNs) distribute much of the Internet content by caching and serving the objects requested by users. A major goal of a CDN is to maximize the hit rates of its caches, thereby enabling faster content downloads to the users. Content caching involves two components: an admission algorithm to decide whether to cache an object and an eviction algorithm to decide which object to evict from the cache when it is full. In this paper, we focus on cache admission and propose a novel algorithm called RL-Cache that uses model-free reinforcement learning (RL) to decide whether or not to admit a requested object into the CDN's cache. Unlike prior approaches that use a small set of criteria for decision making, RL-Cache weights a large set of features that include the object size, recency, and frequency of access. We develop a publicly available implementation of RL-Cache and perform an evaluation using production traces for the image, video, and web traffic classes from Akamai's CDN. The evaluation shows that RL-Cache improves the hit rate in comparison with the state of the art and imposes only a modest resource overhead on the CDN servers. Further, RL-Cache is robust enough that it can be trained in one location and executed on request traces of the same or different traffic classes in other locations of the same geographic region.
more » « less
Full Text Available
Adaptive TTL-Based Caching for Content Delivery

https://doi.org/10.1109/TNET.2018.2818468

Basu, Soumya ; Sundarrajan, Aditya ; Ghaderi, Javad ; Shakkottai, Sanjay ; Sitaraman, Ramesh ( June 2018 , IEEE/ACM Transactions on Networking)

Full Text Available