Skyway: Connecting Managed Heaps in Distributed Big Data Systems

Nguyen, Khanh; Fang, Lu; Navasca, Christian; Xu, Guoqing; Demsky, Brian; Lu, Shan

doi:10.1145/3173162.3173200

Citation Details

Skyway: Connecting Managed Heaps in Distributed Big Data Systems

Managed languages such as Java and Scala are prevalently used in development of large-scale distributed systems. Under the managed runtime, when performing data transfer across machines, a task frequently conducted in a Big Data system, the system needs to serialize a sea of objects into a byte sequence before sending them over the network. The remote node receiving the bytes then deserializes them back into objects. This process is both performance-inefficient and labor-intensive: (1) object serialization/deserialization makes heavy use of reflection, an expensive runtime operation and/or (2) serialization/deserialization functions need to be hand-written and are error-prone. This paper presents Skyway, a JVM-based technique that can directly connect managed heaps of different (local or remote) JVM processes. Under Skyway, objects in the source heap can be directly written into a remote heap without changing their formats. Skyway provides performance benefits to any JVM-based system by completely eliminating the need (1) of invoking serialization/deserialization functions, thus saving CPU time, and (2) of requiring developers to hand-write serialization functions. more »

Award ID(s):: 1703598

PAR ID:: 10079573

Author(s) / Creator(s):: Nguyen, Khanh; Fang, Lu; Navasca, Christian; Xu, Guoqing; Demsky, Brian; Lu, Shan

Date Published:: 2018-03-01

Journal Name:: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems

Page Range / eLocation ID:: 56 to 69

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3173162.3173200

More Like this