skip to main content


Title: Progressive and Punctuated Magnetic Mineral Diagenesis: The Rock Magnetic Record of Multiple Fluid Inputs and Progressive Pyritization in a Volcano‐Bounded Basin, IODP Site U1437, Izu Rear Arc
Award ID(s):
1642268
NSF-PAR ID:
10129473
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Journal of Geophysical Research: Solid Earth
Volume:
124
Issue:
6
ISSN:
2169-9313
Page Range / eLocation ID:
5357 to 5378
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. With the requirements to enable data analytics and exploration interactively and efficiently, progressive data processing, especially progressive join, became essential to data science. Join queries are particularly challenging due to the correlation between input datasets which causes the results to be biased towards some join keys. Existing methods carefully control which parts of the input to process in order to improve the quality of progressive results. If the quality is not satisfactory, they will process more data to improve the result. In this paper, we propose an alternative approach that initially seems counter-intuitive but surprisingly works very well. After query processing, we intentionally report fewer results to the user with the goal of improving the quality. The key idea is that if the output is deviated from the correct distribution, we temporarily hide some results to correct the bias. As we process more data, the hidden results are inserted back until the full dataset is processed. The main challenge is that we do not know the correct output distribution while the progressive query is running. In this work, we formally define the progressive join problem with quality and progressive result rate constraints. We propose an input&output quality-aware progressive join framework (QPJ) that (1) provides input control that decides which parts of the input to process; (2) estimates the final result distribution progressively; (3) automatically controls the quality of the progressive output rate; and (4) combines input&output control to enable quality control of the progressive results. We compare QPJ with existing methods and show QPJ can provide the progressive output that can represent the final answer better than existing methods. 
    more » « less
  2. null (Ed.)