NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Difference of anisotropic and isotropic TV for segmentation under blur and Poisson noise

https://doi.org/10.3389/fcomp.2023.1131317

Bui, Kevin; Lou, Yifei; Park, Fredrick; Xin, Jack (June 2023, Frontiers in Computer Science)

In this paper, we aim to segment an image degraded by blur and Poisson noise. We adopt a smoothing-and-thresholding (SaT) segmentation framework that finds a piecewise-smooth solution, followed by k -means clustering to segment the image. Specifically for the image smoothing step, we replace the least-squares fidelity for Gaussian noise in the Mumford-Shah model with a maximum posterior (MAP) term to deal with Poisson noise and we incorporate the weighted difference of anisotropic and isotropic total variation (AITV) as a regularization to promote the sparsity of image gradients. For such a nonconvex model, we develop a specific splitting scheme and utilize a proximal operator to apply the alternating direction method of multipliers (ADMM). Convergence analysis is provided to validate the efficacy of the ADMM scheme. Numerical experiments on various segmentation scenarios (grayscale/color and multiphase) showcase that our proposed method outperforms a number of segmentation methods, including the original SaT.
more » « less
Full Text Available
Synchronized Front Propagation and Delayed Flame Quenching in Strain G-Equation and Time-Periodic Cellular Flows

Liu, Yu-Yu; Xin, Jack (January 2023, Minimax theory and its applications)

Full Text Available
Recurrence of optimum for training weight and activation quantized networks

https://doi.org/10.1016/j.acha.2022.07.006

Long, Ziang; Yin, Penghang; Xin, Jack (January 2023, Applied and Computational Harmonic Analysis)

Full Text Available
Enhancing Zero-Shot Many to Many Voice Conversion via Self-Attention VAE with Structurally Regularized Layers

https://doi.org/10.1109/AI4I54798.2022.00022

Long, Ziang; Zheng, Yunling; Yu, Meng; Xin, Jack (September 2022, International Conference on Artificial Intelligence for Industries)

Full Text Available
Channel Pruning in Quantization-aware Training: an Adaptive Projection-gradient Descent-shrinkage-splitting Method

https://doi.org/10.1109/AI4I54798.2022.00015

Li, Zhijian; Xin, Jack (September 2022, International Conference on Artificial Intelligence for Industries)

Full Text Available
Searching Intrinsic Dimensions of Vision Transformers

https://doi.org/https://doi.org/10.17758/HEAIG10.H0622602

Xue, Fanghui; Yang, Biao; Qi, Yingyong; Xin, Jack (June 2022, The 20th International Conference on Innovations in Engineering and Sciences)

It has been shown by many researchers that transformers perform as well as convolutional neural networks in many computer vision tasks. Meanwhile, the large computational costs of its attention module hinder further studies and applications on edge devices. Some pruning methods have been developed to construct efficient vision transformers, but most of them have considered image classification tasks only. Inspired by these results, we propose SiDT, a method for pruning vision transformer backbones on more complicated vision tasks like object detection, based on the search of transformer dimensions. Experiments on CIFAR-100 and COCO datasets show that the backbones with 20% or 40% dimensions/parameters pruned can have similar or even better performance than the unpruned models. Moreover, we have also provided the complexity analysis and comparisons with the previous pruning methods.
more » « less
Full Text Available
An Integrated Approach to Produce Robust Deep Neural Network Models with High Efficiency

https://doi.org/10.1007/978-3-030-95470-3_34

Li, Zhijian; Wang, Bao; Xin, Jack (February 2022, Lecture notes in computer science)

Deep Neural Networks (DNNs) need to be both efficient and robust for practical uses. Quantization and structure simplification are promising ways to adapt DNNs to mobile devices, and adversarial training is one of the most successful methods to train robust DNNs. In this work, we aim to realize both advantages by applying a convergent relaxation quantization algorithm, i.e., Binary-Relax (BR), to an adversarially trained robust model, i.e. the ResNets Ensemble via Feynman-Kac Formalism (EnResNet). We discover that high-precision quantization, such as ternary (tnn) or 4-bit, produces sparse DNNs. However, this sparsity is unstructured under adversarial training. To solve the problems that adversarial training jeopardizes DNNs’ accuracy on clean images and break the structure of sparsity, we design a trade-off loss function that helps DNNs preserve natural accuracy and improve channel sparsity. With our newly designed trade-off loss function, we achieve both goals with no reduction of resistance under weak attacks and very minor reduction of resistance under strong adversarial attacks. Together with our model and algorithm selections and loss function design, we provide an integrated approach to produce robust DNNs with high efficiency and accuracy. Furthermore, we provide a missing benchmark on robustness of quantized models.
more » « less
Full Text Available
Improving Efficient Semantic Segmentation Networks by Enhancing Multi-scale Feature Representation via Resolution Path Based Knowledge Distillation and Pixel Shuffle

https://doi.org/10.1007/978-3-030-90436-4_26

Yang, Biao; Xue, Fanghui; Qi, Yingyong; Xin, Jack (January 2022, Lecture notes in computer science)

Multi-resolution paths and multi-scale feature representation are key elements of semantic segmentation networks. We develop two techniques for efficient networks based on the recent FasterSeg network architecture. One is to use a state-of-the-art high resolution network (e.g. HRNet) as a teacher to distill a light weight student network. Due to dissimilar structures in the teacher and student networks, distillation is not effective to be carried out directly in a standard way. To solve this problem, we introduce a tutor network with an added high resolution path to help distill a student network which improves FasterSeg student while maintaining its parameter/FLOPs counts. The other finding is to replace standard bilinear interpolation in the upscaling module of FasterSeg student net by a depth-wise separable convolution and a Pixel Shuffle module which leads to 1.9% (1.4%) mIoU improvements on low (high) input image sizes without increasing model size. A combination of these techniques will be pursued in future works.
more » « less
Full Text Available
RARTS: An Efficient First-Order Relaxed Architecture Search Method

https://doi.org/10.1109/ACCESS.2022.3185095

Xue, Fanghui; Qi, Yingyong; Xin, Jack (January 2022, IEEE Access)
Network Compression via Cooperative Architecture Search and Distillation

https://doi.org/DOI 10.1109/AI4I51902.2021.00018

Fangui Xue, Jack Xin (October 2021, Proceedings IEEE International Conference on Artificial Intelligence for Industries)

Neural Architecture Search (NAS) and its variants are competitive in many computer vision tasks lately. In this paper, we develop a Cooperative Architecture Search and Distillation (CASD) method for network compression. Compared with prior art, our method achieves better performance in ResNet-164 pruning on CIFAR-10 and CIFAR-100 image classifications, promising to be extended to other tasks.
more » « less
Full Text Available

« Prev Next »

Search for: All records