NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Counting manatee aggregations using deep neural networks and Anisotropic Gaussian Kernel

https://doi.org/10.1038/s41598-023-45507-3

Wang, Zhiqiang; Pang, Yiran; Ulus, Cihan; Zhu, Xingquan (December 2023, Scientific Reports)

Abstract Manatees are aquatic mammals with voracious appetites. They rely on sea grass as the main food source, and often spend up to eight hours a day grazing. They move slow and frequently stay in groups (i.e. aggregations) in shallow water to search for food, making them vulnerable to environment change and other risks. Accurate counting manatee aggregations within a region is not only biologically meaningful in observing their habit, but also crucial for designing safety rules for boaters, divers, etc., as well as scheduling nursing, intervention, and other plans. In this paper, we propose a deep learning based crowd counting approach to automatically count number of manatees within a region, by using low quality images as input. Because manatees have unique shape and they often stay in shallow water in groups, water surface reflection, occlusion, camouflage etc. making it difficult to accurately count manatee numbers. To address the challenges, we propose to use Anisotropic Gaussian Kernel (AGK), with tunable rotation and variances, to ensure that density functions can maximally capture shapes of individual manatees in different aggregations. After that, we apply AGK kernel to different types of deep neural networks primarily designed for crowd counting, including VGG, SANet, Congested Scene Recognition network (CSRNet), MARUNet etc. to learn manatee densities and calculate number of manatees in the scene. By using generic low quality images extracted from surveillance videos, our experiment results and comparison show that AGK kernel based manatee counting achieves minimum Mean Absolute Error (MAE) and Root Mean Square Error (RMSE). The proposed method works particularly well for counting manatee aggregations in environments with complex background.
more » « less
Full Text Available
Tackling Bias, Privacy, and Scarcity in Health Data Analytics

Wang, Shuwen (October 2023, Florida Atlantic University Dissertation)

Health data analytics has emerged as a critical domain with immense potential to revolutionize healthcare delivery, disease management, and medical research. However, it is confronted by formidable challenges, including sample bias, data privacy concerns, and the cost and scarcity of labeled data. These challenges collectively impede the development of accurate and robust machine learning models for various healthcare applications, from disease diagnosis to treatment recommendations. Sample bias and specificity refer to the inherent challenges in working with health datasets that may not be representative of the broader population or may exhibit disparities in their distributions. These biases can significantly impact the generalizability and effectiveness of machine learning models in healthcare, potentially leading to suboptimal outcomes for certain patient groups. Data privacy and locality are paramount concerns in the era of digital health records and wearable devices. The need to protect sensitive patient information while still extracting valuable insights from these data sources poses a delicate balancing act. Moreover, the geographic and jurisdictional differences in data regulations further complicate the use of health data in a global context. Label cost and scarcity pertain to the often labor-intensive and expensive process of obtaining ground-truth labels for supervised learning tasks in healthcare. The limited availability of labeled data can hinder the development and deployment of machine learning models, particularly in specialized medical domains. This dissertation mainly focuses on health data analytics and explores approaches to tackle the above challenges. More specifically, the following three problems will be studied from different perspectives: (1) Sample bias and specificity in health data. (2) Data privacy and locality in health data. (3) Label cost and scarcity in health data.
more » « less
Full Text Available
FedDNA: Federated learning using dynamic node alignment

https://doi.org/10.1371/journal.pone.0288157

Wang, Shuwen; Zhu, Xingquan (July 2023, PLOS ONE)
Donta, Praveen Kumar (Ed.)
Federated Learning (FL), as a new computing framework, has received significant attentions recently due to its advantageous in preserving data privacy in training models with superb performance. During FL learning, distributed sites first learn respective parameters. A central site will consolidate learned parameters, using average or other approaches, and disseminate new weights across all sites to carryout next round of learning. The distributed parameter learning and consolidation repeat in an iterative fashion until the algorithm converges or terminates. Many FL methods exist to aggregate weights from distributed sites, but most approaches use a static node alignment approach, where nodes of distributed networks are statically assigned, in advance, to match nodes and aggregate their weights. In reality, neural networks, especially dense networks, have nontransparent roles with respect to individual nodes. Combined with random nature of the networks, static node matching often does not result in best matching between nodes across sites. In this paper, we propose, FedDNA, adynamic node alignmentfederated learning algorithm. Our theme is to find best matching nodes between different sites, and then aggregate weights of matching nodes for federated learning. For each node in a neural network, we represent its weight values as a vector, and use a distance function to find most similar nodes,i.e., nodes with the smallest distance from other sides. Because finding best matching across all sites are computationally expensive, we further design a minimum spanning tree based approach to ensure that a node from each site will have matched peers from other sites, such that the total pairwise distances across all sites are minimized. Experiments and comparisons demonstrate that FedDNA outperforms commonly used baseline, such as FedAvg, for federated learning.
more » « less
Full Text Available
SGCCL: Siamese Graph Contrastive Consensus Learning for Personalized Recommendation

https://doi.org/10.1145/3539597.3570422

Li, Boyu; Guo, Ting; Zhu, Xingquan; Li, Qian; Wang, Yang; Chen, Fang (February 2023, The 16th ACM International WSDM Conference)

Full Text Available
Temporal Adaptive Aggregation Network for Dynamic Graph Learning

https://doi.org/10.1109/BigData55660.2022.10020659

Wu, Man; Zhu, Xingquan (December 2022, 2022 IEEE International Conference on Big Data (Big Data))

Full Text Available
Predictive Masking for Semi-Supervised Graph Contrastive Learning

https://doi.org/10.1109/BigData55660.2022.10020970

Jin, Yufei; Zhu, Xingquan (December 2022, 2022 IEEE International Conference on Big Data (Big Data))

Full Text Available
Nationwide hospital admission data statistics and disease-specific 30-day readmission prediction

https://doi.org/10.1007/s13755-022-00195-7

Wang, Shuwen; Zhu, Xingquan (December 2022, Health Information Science and Systems)

Full Text Available
Knowledge Graph Embedding by Double Limit Scoring Loss

https://doi.org/10.1109/TKDE.2021.3060755

Zhou, Xiaofei; Niu, Lingfeng; Zhu, Qiannan; Zhu, Xingquan; Liu, Ping; Tan, Jianlong; Guo, Li (December 2022, IEEE Transactions on Knowledge and Data Engineering)

Full Text Available
Transfer Naïve Bayes Learning using Augmentation and Stacking for SMS Spam Detection

https://doi.org/10.1109/ICKG55886.2022.00042

Ulus, Cihan; Wang, Zhiqiang; Iqbal, Sheikh M.A.; Khan, K.Md.Salman; Zhu, Xingquan (November 2022, 2022 IEEE International Conference on Knowledge Graph (ICKG))

Full Text Available
Local Contrastive Feature Learning for Tabular Data

https://doi.org/10.1145/3511808.3557630

Gharibshah, Zhabiz; Zhu, Xingquan (October 2022, Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM ’22), October 17–21, 2022, Atlanta, GA, USA)

Contrastive self-supervised learning has been successfully used in many domains, such as images, texts, graphs, etc., to learn features without requiring label information. In this paper, we propose a new local contrastive feature learning (LoCL) framework, and our theme is to learn local patterns/features from tabular data. In order to create a niche for local learning, we use feature correlations to create a maximum-spanning tree, and break the tree into feature subsets, with strongly correlated features being assigned next to each other. Convolutional learning of the features is used to learn latent feature space, regulated by contrastive and reconstruction losses. Experiments on public tabular datasets show the effectiveness of the proposed method versus state-of-the-art baseline methods.
more » « less
Full Text Available

« Prev Next »

Search for: All records