Search for: All records

Creators/Authors contains: "Li, Xiangyu"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Metallicity Catalog of Very Metal-poor Main-sequence Turn-off and Red Giant Stars from LAMOST DR10

https://doi.org/10.3847/1538-4365/ade3ca

Li, Xiangyu; Chen, Huiling; Huang, Yang; Zhang, Huawei; Beers, Timothy C; Zhu, Linxuan; Liu, Jifeng (August 2025, The Astrophysical Journal Supplement Series)

Abstract We present a catalog of 8440 candidate very metal-poor (VMP; [Fe/H] ≤ −2.0) main-sequence turn-off (MSTO) and red giant stars in the Milky Way, identified from low-resolution spectra in LAMOST DR10. More than 7000 of these candidates are brighter thanG ∼ 16, making them excellent targets for high-resolution spectroscopic follow-up with 4–10 m class telescopes. Unlike most previous studies, we employed an empirical calibration to estimate metallicities from the equivalent widths of the calcium triplet lines, taking advantage of the high signal-to-noise ratio in the red arm of LAMOST spectra. We further refined this calibration to improve its reliability for more distant stars. This method enables robust identification of VMP candidates with metallicities as low as [Fe/H] = −4.0 among both MSTO and red giant stars. Comparisons with metal-poor samples from other spectroscopic surveys and high-resolution follow-up observations confirm the accuracy of our estimates, showing a typical median offset of ∼0.1 dex and a standard deviation of ∼0.2 dex.
more » « less
Free, publicly-accessible full text available August 1, 2026
Enriched Representation Learning for Longitudinal Chest X-ray Analysis: A Novel Approach for Improved Disease Detection and Localization

https://doi.org/10.1109/ICDM58522.2023.00135

Li, Xiangyu; Ovanessians, Armand; Wang, Hua (December 2023, IEEE ICDM 2023)

Chest X-rays are commonly used for diagnosing and characterizing lung diseases, but the complex morphological patterns in radiographic appearances can challenge clinicians in making accurate diagnoses. To address this challenge, various learning methods have been developed for algorithm-aided disease detection and automated diagnosis. However, most existing methods fail to account for the heterogeneous variability in longitudinal imaging records and the presence of missing or inconsistent temporal data. In this paper, we propose a novel longitudinal learning framework that enriches inconsistent imaging data over sequential time points by leveraging 2D Principal Component Analysis (2D-PCA) and a robust adaptive loss function. We also derive an efficient solution algorithm that ensures both objective and sequence convergence for the non-convex optimization problem. Our experiments on the CheXpert dataset demonstrate improved performance in capturing indicative abnormalities in medical images and achieving satisfactory diagnoses. We believe that our method will be of significant interest to the research community working on medical image analysis.
more » « less
Full Text Available
Discovering Protein Interactions and Repurposing Drugs in SARS-CoV-2 (COVID-19) via Learning on Robust Multipartite Graphs

https://doi.org/10.1109/ICDM58522.2023.00038

Li, Xiangyu; Ovanessians, Armand; Wang, Hua (December 2023, IEEE ICDM 2023)

The COVID-19 pandemic caused by SARS-CoV-2 has emphasized the importance of studying virus-host protein-protein interactions (PPIs) and drug-target interactions (DTIs) to discover effective antiviral drugs. While several computational algorithms have been developed for this purpose, most of them overlook the interplay pathways during infection along PPIs and DTIs. In this paper, we present a novel multipartite graph learning approach to uncover hidden binding affinities in PPIs and DTIs. Our method leverages a comprehensive biomolecular mechanism network that integrates protein-protein, genetic, and virus-host interactions, enabling us to learn a new graph that accurately captures the underlying connected components. Notably, our method identifies clustering structures directly from the new graph, eliminating the need for post-processing steps. To mitigate the detrimental effects of noisy or outlier data in sparse networks, we propose a robust objective function that incorporates the L2,p-norm and a constraint based on the pth-order Ky-Fan norm applied to the graph Laplacian matrix. Additionally, we present an efficient optimization method tailored to our framework. Experimental results demonstrate the superiority of our approach over existing state-of-the-art techniques, as it successfully identifies potential repurposable drugs for SARS-CoV-2, offering promising therapeutic options for COVID-19 treatment.
more » « less
Full Text Available
Beyond the Simplex: Hadamard-Infused Deep Sparse Representations for Enhanced Similarity Measures

https://doi.org/10.1109/ICKG59574.2023.00026

Li, Xiangyu; Gherardi, Umberto; Ovanessians, Armand; Wang, Hua (December 2023, IEEE ICKG 2023)

Graphical representations are essential for comprehending high-dimensional data across diverse fields, yet their construction often presents challenges due to the limitations of traditional methods. This paper introduces a novel methodology, Beyond Simplex Sparse Representation (BSSR), which addresses critical issues such as parameter dependencies, scale inconsistencies, and biased data interpretation in constructing similarity graphs. BSSR leverages the robustness of sparse representation to noise and outliers, while incorporating deep learning techniques to enhance scalability and accuracy. Furthermore, we tackle the optimization of the standard simplex, a pervasive problem, by introducing a transformative approach that converts the constraint into a smooth manifold using the Hadamard parametrization. Our proposed Tangent Perturbed Riemannian Gradient Descent (T-PRGD) algorithm provides an efficient and scalable solution for optimization problems with standard simplex or L1-norm sphere constraints. These contributions, including the BSSR methodology, robustness and scalability through deep representation, shift-invariant sparse representation, and optimization on the unit sphere, represent major advancements in the field. Our work offers novel perspectives on data representation challenges and sets the stage for more accurate analysis in the era of big data.
more » « less
Full Text Available
On Mean-Optimal Robust Linear Discriminant Analysis

https://doi.org/10.1109/ICDM54844.2022.00129

Li, Xiangyu; Wang, Hua (November 2022, 2022 IEEE International Conference on Data Mining (ICDM))

Linear discriminant analysis (LDA) is widely used for dimensionality reduction under supervised learning settings. Traditional LDA objective aims to minimize the ratio of squared Euclidean distances that may not perform optimally on noisy data sets. Multiple robust LDA objectives have been proposed to address this problem, but their implementations have two major limitations. One is that their mean calculations use the squared l2-norm distance to center the data, which is not valid when the objective does not use the Euclidean distance. The second problem is that there is no generalized optimization algorithm to solve different robust LDA objectives. In addition, most existing algorithms can only guarantee the solution to be locally optimal, rather than globally optimal. In this paper, we review multiple robust loss functions and propose a new and generalized robust objective for LDA. Besides, to better remove the mean value within data, our objective uses an optimal way to center the data through learning. As one important algorithmic contribution, we derive an efficient iterative algorithm to optimize the resulting non-smooth and non-convex objective function. We theoretically prove that our solution algorithm guarantees that both the objective and the solution sequences converge to globally optimal solutions at a sub-linear convergence rate. The experimental results demonstrate the effectiveness of our new method, achieving significant improvements compared to the other competing methods.
more » « less
Full Text Available
Discovery of drug targets and therapeutic agents based on drug repositioning to treat lung adenocarcinoma

https://doi.org/10.1016/j.biopha.2023.114486

Graves, Occam Kelly; Kim, Woonghee; Özcan, Mehmet; Ashraf, Sajda; Turkez, Hasan; Yuan, Meng; Zhang, Cheng; Mardinoglu, Adil; Li, Xiangyu (May 2023, Biomedicine & Pharmacotherapy)

Full Text Available
Highly stratified mid-Pliocene Southern Ocean in PlioMIP2

https://doi.org/10.5194/cp-20-1067-2024

Weiffenbach, Julia E.; Dijkstra, Henk A.; von der Heydt, Anna S.; Abe-Ouchi, Ayako; Chan, Wing-Le; Chandan, Deepak; Feng, Ran; Haywood, Alan M.; Hunter, Stephen J.; Li, Xiangyu; et al (January 2024, Climate of the Past)

Abstract. During the mid-Pliocene warm period (mPWP; 3.264–3.025 Ma), atmospheric CO2 concentrations were approximately 400 ppm, and the Antarctic Ice Sheet was substantially reduced compared to today. Antarctica is surrounded by the Southern Ocean, which plays a crucial role in the global oceanic circulation and climate regulation. Using results from the Pliocene Model Intercomparison Project (PlioMIP2), we investigate Southern Ocean conditions during the mPWP with respect to the pre-industrial period. We find that the mean sea surface temperature (SST) warming in the Southern Ocean is 2.8 °C, while global mean SST warming is 2.4 °C. The enhanced warming is strongly tied to a dramatic decrease in sea ice cover over the mPWP Southern Ocean. We also see a freshening of the ocean (sub)surface, driven by an increase in precipitation over the Southern Ocean and Antarctica. The warmer and fresher surface leads to a highly stratified Southern Ocean that can be related to weakening of the deep abyssal overturning circulation. Sensitivity simulations show that the decrease in sea ice cover and enhanced warming is largely a consequence of the reduction in the Antarctic Ice Sheet. In addition, the mPWP geographic boundary conditions are responsible for approximately half of the increase in mPWP SST warming, sea ice loss, precipitation, and stratification increase over the Southern Ocean. From these results, we conclude that a strongly reduced Antarctic Ice Sheet during the mPWP has a substantial influence on the state of the Southern Ocean and exacerbates the changes that are induced by a higher CO2 concentration alone. This is relevant for the long-term future of the Southern Ocean, as we expect melting of the western Antarctic Ice Sheet in the future, an effect that is not currently taken into account in future projections by Coupled Model Intercomparison Project (CMIP) ensembles.
more » « less
Full Text Available
Highly restricted near‐surface permafrost extent during the mid-Pliocene warm period

https://doi.org/10.1073/pnas.2301954120

Guo, Donglin; Wang, Huijun; Romanovsky, Vladimir E.; Haywood, Alan M.; Pepin, Nick; Salzmann, Ulrich; Sun, Jianqi; Yan, Qing; Zhang, Zhongshi; Li, Xiangyu; et al (September 2023, Proceedings of the National Academy of Sciences)

Accurate understanding of permafrost dynamics is critical for evaluating and mitigating impacts that may arise as permafrost degrades in the future; however, existing projections have large uncertainties. Studies of how permafrost responded historically during Earth’s past warm periods are helpful in exploring potential future permafrost behavior and to evaluate the uncertainty of future permafrost change projections. Here, we combine a surface frost index model with outputs from the second phase of the Pliocene Model Intercomparison Project to simulate the near‐surface (~3 to 4 m depth) permafrost state in the Northern Hemisphere during the mid-Pliocene warm period (mPWP, ~3.264 to 3.025 Ma). This period shares similarities with the projected future climate. Constrained by proxy-based surface air temperature records, our simulations demonstrate that near‐surface permafrost was highly spatially restricted during the mPWP and was 93 ± 3% smaller than the preindustrial extent. Near‐surface permafrost was present only in the eastern Siberian uplands, Canadian high Arctic Archipelago, and northernmost Greenland. The simulations are similar to near‐surface permafrost changes projected for the end of this century under the SSP5-8.5 scenario and provide a perspective on the potential permafrost behavior that may be expected in a warmer world.
more » « less
Full Text Available
On the climatic influence of CO ₂ forcing in the Pliocene

https://doi.org/10.5194/cp-19-747-2023

Burton, Lauren E.; Haywood, Alan M.; Tindall, Julia C.; Dolan, Aisling M.; Hill, Daniel J.; Abe-Ouchi, Ayako; Chan, Wing-Le; Chandan, Deepak; Feng, Ran; Hunter, Stephen J.; et al (January 2023, Climate of the Past)

Abstract. Understanding the dominant climate forcings in the Pliocene is crucial to assessing the usefulness of the Pliocene as an analogue for our warmer future. Here, we implement a novel yet simple linear factorisation method to assess the relative influence of CO2 forcing in seven models of the Pliocene Model Intercomparison Project Phase 2 (PlioMIP2) ensemble. Outputs are termed “FCO2” and show the fraction of Pliocene climate change driven by CO2. The accuracy of the FCO2 method is first assessed through comparison to an energy balance analysis previously used to assess drivers of surface air temperature in the PlioMIP1 ensemble. After this assessment, the FCO2 method is applied to achieve an understanding of the drivers of Pliocene sea surface temperature and precipitation for the first time. CO2 is found to be the most important forcing in the ensemble forPliocene surface air temperature (global mean FCO2=0.56), sea surface temperature (global mean FCO2=0.56), and precipitation (global mean FCO2=0.51). The range between individual models is found to be consistent between these three climate variables, and the models generally show good agreement on the sign of the most important forcing. Our results provide the most spatially complete view of the drivers ofPliocene climate to date and have implications for both data–modelcomparison and the use of the Pliocene as an analogue for the future. ThatCO2 is found to be the most important forcing reinforces thePliocene as a good palaeoclimate analogue, but the significant effect ofnon-CO2 forcing at a regional scale (e.g. orography and ice sheet forcing at high latitudes) reminds us that it is not perfect, and these additional influencing factors must not be overlooked. This comparison is further complicated when considering the Pliocene as a state in quasi-equilibrium with CO2 forcing compared to the transient warming being experienced at present.
more » « less
Full Text Available
Factor-Bounded Nonnegative Matrix Factorization

https://doi.org/10.1145/3451395

Liu, Kai; Li, Xiangyu; Zhu, Zhihui; Brand, Lodewijk; Wang, Hua (May 2021, ACM Transactions on Knowledge Discovery from Data)
null (Ed.)
Nonnegative Matrix Factorization (NMF) is broadly used to determine class membership in a variety of clustering applications. From movie recommendations and image clustering to visual feature extractions, NMF has applications to solve a large number of knowledge discovery and data mining problems. Traditional optimization methods, such as the Multiplicative Updating Algorithm (MUA), solves the NMF problem by utilizing an auxiliary function to ensure that the objective monotonically decreases. Although the objective in MUA converges, there exists no proof to show that the learned matrix factors converge as well. Without this rigorous analysis, the clustering performance and stability of the NMF algorithms cannot be guaranteed. To address this knowledge gap, in this article, we study the factor-bounded NMF problem and provide a solution algorithm with proven convergence by rigorous mathematical analysis, which ensures that both the objective and matrix factors converge. In addition, we show the relationship between MUA and our solution followed by an analysis of the convergence of MUA. Experiments on both toy data and real-world datasets validate the correctness of our proposed method and its utility as an effective clustering algorithm.
more » « less
Full Text Available

« Prev Next »