NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BeeBox: Hardening BPF against Transient Execution Attacks

Jin, Di; Gaidis, Alexander J; Kemerlis, Vasileios P (August 2024, USENIX)

Full Text Available
EPF: Evil Packet Filter

Jin, Di; Atlidakis, Vaggelis; Kemerlis, Vasileios P. (July 2023, Proceedings of the USENIX Annual Technical Conference)

The OS kernel is at the forefront of a system's security. Therefore, its own security is crucial for the correctness and integrity of user applications. With a plethora of bugs continuously discovered in OS kernel code, defenses and mitigations are essential for practical kernel security. One important defense strategy is to isolate user-controlled memory from kernel-accessible memory, in order to mitigate attacks like ret2usr and ret2dir. We present EPF (Evil Packet Filter): a new method for bypassing various (both deployed and proposed) kernel isolation techniques by abusing the BPF infrastructure of the Linux kernel: i.e., by leveraging BPF code, provided by unprivileged users/programs, as attack payloads. We demonstrate two different EPF instances, namely BPF-Reuse and BPF-ROP, which utilize malicious BPF payloads to mount privilege escalation attacks in both 32- and 64-bit x86 platforms. We also present the design, implementation, and evaluation of a set of defenses to enforce the isolation between BPF instructions and benign kernel data, and the integrity of BPF program execution, effectively providing protection against EPF-based attacks. Our implemented defenses show minimal overhead (<3%) in BPF-heavy tasks.
more » « less
Full Text Available
Leveraging the Graph Structure of Neural Network Training Dynamics

https://doi.org/10.1145/3511808.3557628

Vahedian, Fatemeh; Li, Ruiyu; Trivedi, Puja; Jin, Di; Koutra, Danai (October 2022, ACM International Conference on Information & Knowledge Management)

Full Text Available
Toward Understanding and Evaluating Structural Node Embeddings

https://doi.org/10.1145/3481639

Jin, Junchen; Heimann, Mark; Jin, Di; Koutra, Danai (June 2022, ACM Transactions on Knowledge Discovery from Data)

While most network embedding techniques model the proximity between nodes in a network, recently there has been significant interest in structural embeddings that are based on node equivalences , a notion rooted in sociology: equivalences or positions are collections of nodes that have similar roles—i.e., similar functions, ties or interactions with nodes in other positions—irrespective of their distance or reachability in the network. Unlike the proximity-based methods that are rigorously evaluated in the literature, the evaluation of structural embeddings is less mature. It relies on small synthetic or real networks with labels that are not perfectly defined, and its connection to sociological equivalences has hitherto been vague and tenuous. With new node embedding methods being developed at a breakneck pace, proper evaluation, and systematic characterization of existing approaches will be essential to progress. To fill in this gap, we set out to understand what types of equivalences structural embeddings capture. We are the first to contribute rigorous intrinsic and extrinsic evaluation methodology for structural embeddings, along with carefully-designed, diverse datasets of varying sizes. We observe a number of different evaluation variables that can lead to different results (e.g., choice of similarity measure, classifier, and label definitions). We find that degree distributions within nodes’ local neighborhoods can lead to simple yet effective baselines in their own right and guide the future development of structural embedding. We hope that our findings can influence the design of further node embedding methods and also pave the way for more comprehensive and fair evaluation of structural embedding methods.
more » « less
Full Text Available
On Generalizing Static Node Embedding to Dynamic Settings

https://doi.org/10.1145/3488560.3498428

Jin, Di; Kim, Sungchul; Rossi, Ryan A.; Koutra, Danai (February 2022, WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining)

Full Text Available
Deep transfer learning for multi-source entity linkage via domain adaptation

https://doi.org/10.14778/3494124.3494131

Jin, Di; Sisman, Bunyamin; Wei, Hao; Dong, Xin Luna; Koutra, Danai (November 2021, Proceedings of the VLDB Endowment)

Multi-source entity linkage focuses on integrating knowledge from multiple sources by linking the records that represent the same real world entity. This is critical in high-impact applications such as data cleaning and user stitching. The state-of-the-art entity linkage pipelines mainly depend on supervised learning that requires abundant amounts of training data. However, collecting well-labeled training data becomes expensive when the data from many sources arrives incrementally over time. Moreover, the trained models can easily overfit to specific data sources, and thus fail to generalize to new sources due to significant differences in data and label distributions. To address these challenges, we present AdaMEL, a deep transfer learning framework that learns generic high-level knowledge to perform multi-source entity linkage. AdaMEL models the attribute importance that is used to match entities through an attribute-level self-attention mechanism, and leverages the massive unlabeled data from new data sources through domain adaptation to make it generic and data-source agnostic. In addition, AdaMEL is capable of incorporating an additional set of labeled data to more accurately integrate data sources with different attribute importance. Extensive experiments show that our framework achieves state-of-the-art results with 8.21% improvement on average over methods based on supervised learning. Besides, it is more stable in handling different sets of data sources in less runtime.
more » « less
Full Text Available
Risk averse choices of managed beach widths under environmental uncertainty

https://doi.org/10.1111/nrm.12324

Jin, Di; Hoagland, Porter; Ashton, Andrew D. (June 2021, Natural Resource Modeling)
null (Ed.)
Full Text Available
A Deep Dive Into Understanding The Random Walk-Based Temporal Graph Learning

https://doi.org/10.1109/IISWC53511.2021.00019

Talati, Nishil; Jin, Di; Yet, Haojie; Brahmakshatriya, Ajay; Dasika, Ganesh; Amarasinghe, Saman; Mudge, Trevor; Koutra, Danai; Dreslinski, Ronald (November 2021, 2021 IEEE International Symposium on Workload Characterization (IISWC))

Full Text Available
Engineered coastal berm-dune renourishment in New Jersey: can coastal communities continue to hold the line?

https://doi.org/10.1139/anc-2020-0024

Kolodin, Jesse; Lorenzo-Trueba, Jorge; Hoagland, Porter; Jin, Di; Ashton, Andrew (January 2021, Anthropocene Coasts)

Following the significant coastal changes caused by Hurricane Sandy in 2012, engineered berm-dunes were constructed along the New Jersey coastline to enhance protection from future storms. Following construction, property values on Long Beach Island, NJ, increased in three beachfront communities. The projects were financed entirely through federal disaster assistance, but the percentage of future maintenance costs must be covered by local communities. Whether communities are willing or capable of financially contributing to maintenance remains unclear because (i) some homeowners prefer ocean views over the protection afforded by the berm-dune structures, and (ii) stakeholder risk perceptions can change over time. To investigate the relationships between berm-dune geometries, values of coastal protection, and ocean view values, we developed a geo-economic model of the natural and anthropogenic processes that shape beach and dune morphology. The model results suggest that coastal communities may exhibit significant differences in their capabilities to maintain engineered dunes depending on stakeholder wealth and risk perception. In particular, communities with strong preferences for ocean views are less likely to maintain large-scale berm-dune structures over the long term. If these structures are abandoned, the vulnerability of the coast to future storms will increase.
more » « less
Full Text Available
BiTe-GCN: A New GCN Architecture via Bidirectional Convolution of Topology and Features on Text-Rich Networks

https://doi.org/10.1145/3437963.3441774

Jin, Di; Song, Xiangchen; Yu, Zhizhi; Liu, Ziyang; Zhang, Heling; Cheng, Zhaomeng; Han, Jiawei (March 2021, WSDM'21, The Fourteenth ACM International Conference on Web Search and Data Mining, March 2021)
Liane Lewin-Eytan, David Carmel (Ed.)
Graph convolutional networks (GCNs), aiming to obtain node embeddings by integrating high-order neighborhood information through stacked graph convolution layers, have demonstrated great power in many network analysis tasks such as node classification and link prediction. However, a fundamental weakness of GCNs, that is, topological limitations, including over-smoothing and local homophily of topology, limits their ability to represent networks. Existing studies for solving these topological limitations typically focus only on the convolution of features on network topology, which inevitably relies heavily on network structures. Moreover, most networks are text-rich, so it is important to integrate not only document-level information, but also the local text information which is particularly significant while often ignored by the existing methods. To solve these limitations, we propose BiTe-GCN, a novel GCN architecture modeling via bidirectional convolution of topology and features on text-rich networks. Specifically, we first transform the original text-rich network into an augmented bi-typed heterogeneous network, capturing both the global document-level information and the local text-sequence information from texts.We then introduce discriminative convolution mechanisms, which performs convolution on this augmented bi-typed network, realizing the convolutions of topology and features altogether in the same system, and learning different contributions of these two parts (i.e., network part and text part), automatically for the given learning objectives. Extensive experiments on text-rich networks demonstrate that our new architecture outperforms the state-of-the-arts by a breakout improvement. Moreover, this architecture can also be applied to several e-commerce search scenes such as JD searching, and experiments on JD dataset show the superiority of the proposed architecture over the baseline methods.
more » « less
Full Text Available

« Prev Next »

Search for: All records