NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficient Scheduling Policies for Microsecond-Scale Tasks

Sarah McClure, Amy Ousterhout (April 2022, USENIX Symposium on Networked Systems Design and Implementation)

Full Text Available
Remote Memory Calls

https://doi.org/10.1145/3422604.3425923

Amaro, Emmanuel; Luo, Zhihong; Ousterhout, Amy; Krishnamurthy, Arvind; Panda, Aurojit; Ratnasamy, Sylvia; Shenker, Scott (January 2021, HotNets 2021)
null (Ed.)
Full Text Available
On the Use of ML for Blackbox System Performance Prediction

Fu, Silvery; Gupta, Saurabh; Mittal, Radhika; Ratnasamy, Sylvia (January 2021, NSDI)
null (Ed.)
Full Text Available
Bertha: Tunneling through the Network API

https://doi.org/10.1145/3422604.3425927

Narayan, Akshay; Panda, Aurojit; Alizadeh, Mohammad; Balakrishnan, Hari; Krishnamurthy, Arvind; Shenker, Scott (November 2020, HotNets 2020)
null (Ed.)
Full Text Available
Fast and Efficient Container Startup at the Edge via Dependency Scheduling

Fu, Silvery; Mittal, Radhika; Zhang, Lei; Ratnasamy, Sylvia (June 2020, HotEdge)

Full Text Available
Can far memory improve job throughput?

https://doi.org/10.1145/3342195.3387522

Amaro, Emmanuel; Branner-Augmon, Christopher; Luo, Zhihong; Ousterhout, Amy; Aguilera, Marcos K.; Panda, Aurojit; Ratnasamy, Sylvia; Shenker, Scott (April 2020, EuroSys ’20)

As memory requirements grow, and advances in memory technology slow, the availability of sufficient main memory is increasingly the bottleneck in large compute clusters. One solution to this is memory disaggregation, where jobs can remotely access memory on other servers, or far memory. This paper first presents faster swapping mechanisms and a far memory-aware cluster scheduler that make it possible to support far memory at rack scale. Then, it examines the conditions under which this use of far memory can increase job throughput. We find that while far memory is not a panacea, for memory-intensive workloads it can provide performance improvements on the order of 10% or more even without changing the total amount of memory available.
more » « less
Full Text Available
Datacenter Congestion Control: Identifying what is essential and making it practical

Mushtaq, Aisha; Mittal, Radhika; McCauley, James; Alizadeh, Mohammad; Ratnasamy, Sylvia; Shenker, Scott (January 2019, Computer communication review)

Recent years have seen a slew of papers on datacenter congestion control mechanisms. In this editorial, we ask whether the bulk of this research is needed for the common case where congestion control involves hosts responding to simple congestion signals from the network and the performance goal is reducing some average measure of Flow Completion Time. We raise this question because we find that, out of all the possible variations one could make in congestion control algorithms, the most essential feature is the switch scheduling algorithm. More specifically, we find that congestion control mechanisms that use Shortest-Remaining-Processing-Time (SRPT) achieve superior performance as long as the rate-setting algorithm at the host is reasonable. We further find that while SRPT’s performance is quite robust to host behaviors, the performance of schemes that use scheduling algorithms like FIFO or Fair Queuing depend far more crucially on the rate-setting algorithm, and their performance is typically worse than what can be achieved with SRPT. Given these findings, we then ask whether it is practical to realize SRPT in switches without requiring custom hardware. We observe that approximate and deployable SRPT (ADS) designs exist, which leverage the small number of priority queues supported in almost all commodity switches, and require only software changes in the host and the switches. Our evaluations with one very simple ADS design shows that it can achieve performance close to true SRPT and is significantly better than FIFO. Thus, the answer to our basic question – whether the bulk of recent research on datacenter congestion control algorithms is needed for the common case – is no.
more » « less
Full Text Available
THOUGHTS ON LOAD DISTRIBUTION AND THE ROLE OF PROGRAMMABLE SWITCHES

McCauley, James; Panda, Aurojit; Krishnamurthy, Arvind; Shenker, Scott (January 2019, Computer communication review)

The trend towards powerfully programmable network switching hardware has led to much discussion of the exciting new ways in which it can be used. In this paper, we take a step back, and examine how it should be used.
more » « less
Full Text Available
Revisiting network support for RDMA

https://doi.org/10.1145/3230543.3230557

Mittal, Radhika; Shpiner, Alexander; Panda, Aurojit; Zahavi, Eitan; Krishnamurthy, Arvind; Ratnasamy, Sylvia; Shenker, Scott (August 2018, SIGCOMM 2018)

Full Text Available
Monotasks: Architecting for Performance Clarity in Data Analytics Frameworks

https://doi.org/10.1145/3132747.3132766

Ousterhout, Kay; Canel, Christopher; Ratnasamy, Sylvia; Shenker, Scott (October 2017, SOSP 2017)

Full Text Available

Search for: All records