NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Nerve-to-cancer transfer of mitochondria during cancer metastasis

https://doi.org/10.1038/s41586-025-09176-8

Hoover, Gregory; Gilbert, Shila; Curley, Olivia; Obellianne, Clémence; Lin, Mike T; Hixson, William; Pierce, Terry W; Andrews, Joel F; Alexeyev, Mikhail F; Ding, Yi; et al (June 2025, Nature)

Free, publicly-accessible full text available June 25, 2026
SmallMap: Low-cost Community Road Map Sensing with Uncertain Delivery Behavior

https://doi.org/10.1145/3659596

Hong, Zhiqing; Wang, Haotian; Ding, Yi; Wang, Guang; He, Tian; Zhang, Desheng (May 2024, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies)

Accurate road networks play a crucial role in modern mobile applications such as navigation and last-mile delivery. Most existing studies primarily focus on generating road networks in open areas like main roads and avenues, but little attention has been given to the generation of community road networks in closed areas such as residential areas, which becomes more and more significant due to the growing demand for door-to-door services such as food delivery. This lack of research is primarily attributed to challenges related to sensing data availability and quality. In this paper, we design a novel framework called SmallMap that leverages ubiquitous multi-modal sensing data from last-mile delivery to automatically generate community road networks with low costs. Our SmallMap consists of two key modules: (1) a Trajectory of Interest Detection module enhanced by exploiting multi-modal sensing data collected from the delivery process; and (2) a Dual Spatio-temporal Generative Adversarial Network module that incorporates Trajectory of Interest by unsupervised road network adaptation to generate road networks automatically. To evaluate the effectiveness of SmallMap, we utilize a two-month dataset from one of the largest logistics companies in China. The extensive evaluation results demonstrate that our framework significantly outperforms state-of-the-art baselines, achieving a precision of 90.5%, a recall of 87.5%, and an F1-score of 88.9%, respectively. Moreover, we conduct three case studies in Beijing City for courier workload estimation, Estimated Time of Arrival (ETA) in last-mile delivery, and fine-grained order assignment.
more » « less
Full Text Available
Statistical Learning for Individualized Asset Allocation

https://doi.org/10.1080/01621459.2022.2139265

Ding, Yi; Li, Yingying; Song, Rui (January 2024, Journal of the American Statistical Association)

Full Text Available
Turaco: Complexity-Guided Data Sampling for Training Neural Surrogates of Programs

https://doi.org/10.1145/3622856

Renda, Alex; Ding, Yi; Carbin, Michael (October 2023, Proceedings of the ACM on Programming Languages)

Programmers and researchers are increasingly developing surrogates of programs, models of a subset of the observable behavior of a given program, to solve a variety of software development challenges. Programmers train surrogates from measurements of the behavior of a program on a dataset of input examples. A key challenge of surrogate construction is determining what training data to use to train a surrogate of a given program. We present a methodology for sampling datasets to train neural-network-based surrogates of programs. We first characterize the proportion of data to sample from each region of a program's input space (corresponding to different execution paths of the program) based on the complexity of learning a surrogate of the corresponding execution path. We next provide a program analysis to determine the complexity of different paths in a program. We evaluate these results on a range of real-world programs, demonstrating that complexity-guided sampling results in empirical improvements in accuracy.
more » « less
Impact of Annotator Demographics on Sentiment Dataset Labeling

https://doi.org/10.1145/3555632

Ding, Yi; You, Jacob; Machulla, Tonja-Katrin; Jacobs, Jennifer; Sen, Pradeep; Höllerer, Tobias (November 2022, Proceedings of the ACM on Human-Computer Interaction)

As machine learning methods become more powerful and capture more nuances of human behavior, biases in the dataset can shape what the model learns and is evaluated on. This paper explores and attempts to quantify the uncertainties and biases due to annotator demographics when creating sentiment analysis datasets. We ask >1000 crowdworkers to provide their demographic information and annotations for multimodal sentiment data and its component modalities. We show that demographic differences among annotators impute a significant effect on their ratings, and that these effects also occur in each component modality. We compare predictions of different state-of-the-art multimodal machine learning algorithms against annotations provided by different demographic groups, and find that changing annotator demographics can cause >4.5 in accuracy difference when determining positive versus negative sentiment. Our findings underscore the importance of accounting for crowdworker attributes, such as demographics, when building datasets, evaluating algorithms, and interpreting results for sentiment analysis.
more » « less
Full Text Available
NURD: Negative-Unlabeled Learning for Online Datacenter Straggler Prediction

Ding, Yi; Rao, Avinash; Song, Hyebin; Willett, Rebecca; Hoffmann, Henry (August 2022, Conference on Machine Learning and Systems)

Full Text Available
NURD: Negative-Unlabeled Learning for Online Datacenter Straggler Prediction

Ding, Yi; Rao, Avinash; Song, Hyebin; Willett, Rebecca; Hoffmann, Henry (July 2022, Proceedings of Machine Learning and Systems 4 (MLSys 2022))

Full Text Available
CoMiner: nationwide behavior-driven unsupervised spatial coordinate mining from uncertain delivery events

https://doi.org/10.1145/3557915.3560944

Hong, Zhiqing; Wang, Guang; Lyu, Wenjun; Guo, Baoshen; Ding, Yi; Wang, Haotian; Wang, Shuai; Liu, Yunhuai; Zhang, Desheng (November 2022, ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems 2022 (ACM SIGSPATIAL 2022))

Full Text Available
CAFQA: A Classical Simulation Bootstrap for Variational Quantum Algorithms

https://doi.org/10.1145/3567955.3567958

Ravi, Gokul Subramanian; Gokhale, Pranav; Ding, Yi; Kirby, William; Smith, Kaitlin; Baker, Jonathan M.; Love, Peter J.; Hoffmann, Henry; Brown, Kenneth R.; Chong, Frederic T. (December 2022, Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1 (ASPLOS 2023))

Classical computing plays a critical role in the advancement of quantum frontiers in the NISQ era. In this spirit, this work uses classical simulation to bootstrap Variational Quantum Algorithms (VQAs). VQAs rely upon the iterative optimization of a parameterized unitary circuit (ansatz) with respect to an objective function. Since quantum machines are noisy and expensive resources, it is imperative to classically choose the VQA ansatz initial parameters to be as close to optimal as possible to improve VQA accuracy and accelerate their convergence on today’s devices. This work tackles the problem of finding a good ansatz initialization, by proposing CAFQA, a Clifford Ansatz For Quantum Accuracy. The CAFQA ansatz is a hardware-efficient circuit built with only Clifford gates. In this ansatz, the parameters for the tunable gates are chosen by searching efficiently through the Clifford parameter space via classical simulation. The resulting initial states always equal or outperform traditional classical initialization (e.g., Hartree-Fock), and enable high-accuracy VQA estimations. CAFQA is well-suited to classical computation because: a) Clifford-only quantum circuits can be exactly simulated classically in polynomial time, and b) the discrete Clifford space is searched efficiently via Bayesian Optimization. For the Variational Quantum Eigensolver (VQE) task of molecular ground state energy estimation (up to 18 qubits), CAFQA’s Clifford Ansatz achieves a mean accuracy of nearly 99% and recovers as much as 99.99% of the molecular correlation energy that is lost in Hartree-Fock initialization. CAFQA achieves mean accuracy improvements of 6.4x and 56.8x, over the state-of-the-art, on different metrics. The scalability of the approach allows for preliminary ground state energy estimation of the challenging chromium dimer (Cr2) molecule. With CAFQA’s high-accuracy initialization, the convergence of VQAs is shown to accelerate by 2.5x, even for small molecules. Furthermore, preliminary exploration of allowing a limited number of non-Clifford (T) gates in the CAFQA framework, shows that as much as 99.9% of the correlation energy can be recovered at bond lengths for which Clifford-only CAFQA accuracy is relatively limited, while remaining classically simulable.
more » « less
Full Text Available
Direct detection of molecular hydrogen upon p- and n-doping of organic semiconductors with complex oxidants or reductants

https://doi.org/10.1039/D3TA00231D

Pallini, Francesca; Mattiello, Sara; Manfredi, Norberto; Mecca, Sara; Fedorov, Alexey; Sassi, Mauro; Al Kurdi, Khaled; Ding, Yi-Fan; Pan, Chen-Kai; Pei, Jian; et al (April 2023, Journal of Materials Chemistry A)

Molecular doping can increase the conductivity of organic semiconductors and plays an increasingly important role in emerging and established plastic electronics applications. 4-(1,3-Dimethyl-2,3-dihydro-1 H -benzimidazol-2-yl)- N , N -dimethylaniline (N-DMBI-H) and tris(pentafluorophenyl)borane (BCF) are established n- and p-dopants, respectively, but neither functions as a simple one-electron redox agent. Molecular hydrogen has been suggested to be a byproduct in several proposed mechanisms for doping using both N-DMBI-H and BCF. In this paper we show for the first time the direct detection of molecular hydrogen in the uncatalysed doping of a variety of polymeric and molecular semiconductors using these dopants. Our results provide insight into the doping mechanism, providing information complementary to that obtained from more commonly applied methods such as optical, electron spin resonance, and electrical measurements.
more » « less
Full Text Available

« Prev Next »

Search for: All records