Search for: All records

Creators/Authors contains: "Kim, Miryung"

« Prev Next »

Total Resources

32

Resource Type
Conference Paper

30

Conference Proceeding

0

Dataset

0

Journal Article

2

Workshop Report

0

Availability
Full Text / Resource Available

31

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Co-dependence Aware Fuzzing for Dataflow-Based Big Data Analytics

https://doi.org/10.1145/3611643.3616298

Humayun, Ahmad ; Kim, Miryung ; Gulzar, Muhammad Ali ( November 2023 , Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)

Free, publicly-accessible full text available November 30, 2024
Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory

Wang, Chenxi ; Qiao, Yifan ; Ma, Haoran ; Liu, Shi ; Chen, Wenguang ; Netravali, Ravi ; Kim, Miryung ; Xu, Harry ( April 2023 , 20th USENIX Symposium on Networked Systems Design and Implementation)

Full Text Available
Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony

Qiao, Yifan ; Wang, Chenxi ; Ruan, Zhenyuan ; Belay, Adam ; Lu, Qingda ; Zhang, Yiying ; Kim, Miryung ; Xu, Harry ( April 2023 , 20th USENIX Symposium on Networked Systems Design and Implementation)

Full Text Available
Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony

Qiao, Yifan ; Wang, Chenxi ; Ruan, Zhenyuan ; Belay, Adam ; Lu, Qingda ; Zhang, Yiying ; Kim, Miryung ; Xu, Guoqing Harry ( January 2023 , NSDI)
Concept-Annotated Examples for Library Comparison

https://doi.org/10.1145/3526113.3545647

Yan, Litao ; Kim, Miryung ; Hartmann, Bjoern ; Zhang, Tianyi ; Glassman, Elena L. ( October 2022 , UIST '22: Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology)

Programmers often rely on online resources—such as code examples, documentation, blogs, and Q&A forums—to compare similar libraries and select the one most suitable for their own tasks and contexts. However, this comparison task is often done in an ad-hoc manner, which may result in suboptimal choices. Inspired by Analogical Learning and Variation Theory, we hypothesize that rendering many concept-annotated code examples from different libraries side-by-side can help programmers (1) develop a more comprehensive understanding of the libraries’ similarities and distinctions and (2) make more robust, appropriate library selections. We designed a novel interactive interface, ParaLib, and used it as a technical probe to explore to what extent many side-by-side concepted-annotated examples can facilitate the library comparison and selection process. A within-subjects user study with 20 programmers shows that, when using ParaLib, participants made more consistent, suitable library selections and provided more comprehensive summaries of libraries’ similarities and differences.
more » « less
Full Text Available
Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory

Wang, Chenxi ; Qiao, Yifan ; Ma, Haoran ; Liu, Shi ; Zhang, Yiying ; Chen, Wenguang ; Netravali, Ravi ; Kim, Miryung ; Xu, Guoqing Harry ( January 2023 , NSDI)
HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair

https://doi.org/10.1145/3503222.3507748

Zhang, Qian ; Wang, Jiyuan ; Xu, Guoqing Harry ; Kim, Miryung ( February 2022 , ASPLOS 2022: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

Despite the trend of incorporating heterogeneity and specialization in hardware, the development of heterogeneous applications is limited to a handful of engineers with deep hardware expertise. We propose HeteroGen that takes C/C++ code as input and automatically generates an HLS version with test behavior preservation and better performance. Key to the success of HeteroGen is adapting the idea of search-based program repair to the heterogeneous computing domain, while addressing two technical challenges. First, the turn-around time of HLS compilation and simulation is much longer than the usual C/C++ compilation and execution time; therefore, HeteroGen applies pattern-oriented program edits guided by common fix patterns and their dependences. Second, behavior and performance checking requires testing, but test cases are often unavailable. Thus, HeteroGen auto-generates test inputs suitable for checking C to HLS-C conversion errors, while providing high branch coverage for the original C code. An evaluation of HeteroGen shows that it produces an HLS-compatible version for nine out of ten real-world heterogeneous applications fully automatically, applying up to 438 lines of edits to produce an HLS version 1.63x faster than the original version.
more » « less
Full Text Available
Mako: a low-pause, high-throughput evacuating collector for memory-disaggregated datacenters

https://doi.org/10.1145/3519939.3523441

Ma, Haoran ; Liu, Shi ; Wang, Chenxi ; Qiao, Yifan ; Bond, Michael D. ; Blackburn, Stephen M. ; Kim, Miryung ; Xu, Guoqing Harry ( June 2022 , ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI))

Full Text Available
OptDebug: Fault-Inducing Operation Isolation for Dataflow Applications

https://doi.org/10.1145/3472883.3487016

Gulzar, Muhammad Ali ; Kim, Miryung ( November 2021 , ACM Symposium on Cloud Computing 2021)

Fault-isolation is extremely challenging in large scale data processing in cloud environments. Data provenance is a dominant existing approach to isolate data records responsible for a given output. However, data provenance concerns fault isolation only in the data-space, as opposed to fault isolation in the code-space---how can we precisely localize operations or APIs responsible for a given suspicious or incorrect result? We present OptDebug that identifies fault-inducing operations in a dataflow application using three insights. First, debugging is easier with a small-scale input than a large-scale input. So it uses data provenance to simplify the original input records to a smaller set leading to test failures and test successes. Second, keeping track of operation provenance is crucial for debugging. Thus, it leverages automated taint analysis to propagate the lineage of operations downstream with individual records. Lastly, each operation may contribute to test failures to a different degree. Thus OptDebug ranks each operation's spectra---the relative participation frequency in failing vs. passing tests. In our experiments, OptDebug achieves 100% recall and 86% precision in terms of detecting faulty operations and reduces the debugging time by 17x compared to a naïve approach. Overall, OptDebug shows great promise in improving developer productivity in today's complex data processing pipelines by obviating the need to re-execute the program repetitively with different inputs and manually examine program traces to isolate buggy code.
more » « less
Full Text Available
Sibylvariant Transformations for Robust Text Classification

https://doi.org/10.18653/v1/2022.findings-acl.140

Harel-Canada, Fabrice ; Gulzar, Muhammad Ali ; Peng, Nanyun ; Kim, Miryung ( January 2022 , Findings of the Association for Computational Linguistics: ACL 2022)

The vast majority of text transformation techniques in NLP are inherently limited in their ability to expand input space coverage due to an implicit constraint to preserve the original class label. In this work, we propose the notion of sibylvariance (SIB) to describe the broader set of transforms that relax the label-preserving constraint, knowably vary the expected class, and lead to significantly more diverse input distributions. We offer a unified framework to organize all data transformations, including two types of SIB: (1) Transmutations convert one discrete kind into another, (2) Mixture Mutations blend two or more classes together. To explore the role of sibylvariance within NLP, we implemented 41 text transformations, including several novel techniques like Concept2Sentence and SentMix. Sibylvariance also enables a unique form of adaptive training that generates new input mixtures for the most confused class pairs, challenging the learner to differentiate with greater nuance. Our experiments on six benchmark datasets strongly support the efficacy of sibylvariance for generalization performance, defect detection, and adversarial robustness.
more » « less
Full Text Available

« Prev Next »