- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources2
- Resource Type
-
20
- Availability
-
20
- Author / Contributor
- Filter by Author / Creator
-
-
Aliakbarpour, M. (2)
-
Diakonikolas, I. (1)
-
Kumar, R. (1)
-
Macke, S. (1)
-
Parameswaran, A. (1)
-
Rubinfeld, R (1)
-
Rubinfeld, R. (1)
-
#Tyler Phillips, Kenneth E. (0)
-
& *Soto, E. (0)
-
& Ahmed, Khadija. (0)
-
& Akcil-Okan, O. (0)
-
& Akuom, D. (0)
-
& Andrews-Larson, C. (0)
-
& Archibald, J. (0)
-
& Attari, S. Z. (0)
-
& Ayala, O. (0)
-
& Babbitt, W. (0)
-
& Baek, Y. (0)
-
& Bai, F. (0)
-
& Barth-Cohen, L. (0)
-
- Filter by Editor
-
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Ruiz-Arias, P.M. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
A. Beygelzimer (0)
-
A. Ghate, K. Krishnaiyer (0)
-
A. I. Sacristán, J. C. (0)
-
A. Weinberg, D. Moore-Russo (0)
-
A. Weinberger (0)
-
A.I. Sacristán, J.C. Cortés-Zavala (0)
-
ACS (0)
-
AIAA (0)
-
AIAA Propulsion and Energy 2021 (0)
-
AIAA SciTech (0)
-
ASEE Manufacturing Division (0)
-
ASME (0)
-
ASME ICEF (0)
-
ASSOCIATE EDITORS: Bahar, Ivet (Department (0)
-
Aardal, Karen (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Aggregating data is fundamental to data analytics, data exploration, and OLAP. Approximate query processing (AQP) techniques are often used to accelerate computation of aggregates using samples, for which confidence intervals (CIs) are widely used to quantify the associated error. CIs used in practice fall into two categories: techniques that are tight but not correct, i.e., they yield tight intervals but only offer asymptoticguarantees,makingthem unreliable, or techniques that are correct but not tight, i.e., they offer rigorous guarantees, but are overly conservative, leading to confidence intervals that are too loose to be useful. In this paper, we develop a CI techniquemore »
-
Aliakbarpour, M. ; Kumar, R. ; Rubinfeld, R ( , Proceedings of the Thirty-Second Conference on Learning Theory (COLT 2019))There has been significant study on the sample complexity of testing properties of distributions over large domains. For many properties, it is known that the sample complexity can be substantially smaller than the domain size. For example, over a domain of size n, distinguishing the uniform distribution from distributions that are far from uniform in ℓ1-distance uses only O(n−−√) samples. However, the picture is very different in the presence of arbitrary noise, even when the amount of noise is quite small. In this case, one must distinguish if samples are coming from a distribution that is ϵ-close to uniform frommore »the case where the distribution is (1−ϵ)-far from uniform. The latter task requires nearly linear in n samples (Valiant, 2008; Valiant and Valiant, 2017a). In this work, we present a noise model that on one hand is more tractable for the testing problem, and on the other hand represents a rich class of noise families. In our model, the noisy distribution is a mixture of the original distribution and noise, where the latter is known to the tester either explicitly or via sample access; the form of the noise is also known \emph{a priori}. Focusing on the identity and closeness testing problems leads to the following mixture testing question: Given samples of distributions p,q1,q2, can we test if p is a mixture of q1 and q2? We consider this general question in various scenarios that differ in terms of how the tester can access the distributions, and show that indeed this problem is more tractable. Our results show that the sample complexity of our testers are exactly the same as for the classical non-mixture case.« less