skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: GPU-acceleration of the distributed-memory database peptide search of mass spectrometry data
Abstract Database peptide search is the primary computational technique for identifying peptides from the mass spectrometry (MS) data. Graphical Processing Units (GPU) computing is now ubiquitous in the current-generation of high-performance computing (HPC) systems, yet its application in the database peptide search domain remains limited. Part of the reason is the use of sub-optimal algorithms in the existing GPU-accelerated methods resulting in significantly inefficient hardware utilization. In this paper, we design and implement a new-age CPU-GPU HPC framework, calledGiCOPS, for efficient and complete GPU-acceleration of the modern database peptide search algorithms on supercomputers. Our experimentation shows that the GiCOPS exhibits between 1.2 to 5$$\times$$ × speed improvement over its CPU-only predecessor, HiCOPS, and over 10$$\times$$ × improvement over several existing GPU-based database search algorithms for sufficiently large experiment sizes. We further assess and optimize the performance of our framework using the Roofline Model and report near-optimal results for several metrics including computations per second, occupancy rate, memory workload, branch efficiency and shared memory performance. Finally, the CPU-GPU methods and optimizations proposed in our work for complex integer- and memory-bounded algorithmic pipelines can also be extended to accelerate the existing and future peptide identification algorithms. GiCOPS is now integrated with our umbrella HPC framework HiCOPS and is available at:https://github.com/pcdslab/gicops.  more » « less
Award ID(s):
2312599 2126253
PAR ID:
10471974
Author(s) / Creator(s):
;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Volume:
13
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The amount of data produced by genome sequencing experiments has been growing rapidly over the past several years, making compression important for efficient storage, transfer and analysis of the data. In recent years, nanopore sequencing technologies have seen increasing adoption since they are portable, real-time and provide long reads. However, there has been limited progress on compression of nanopore sequencing reads obtained in FASTQ files since most existing tools are either general-purpose or specialized for short read data. We present NanoSpring, a reference-free compressor for nanopore sequencing reads, relying on an approximate assembly approach. We evaluate NanoSpring on a variety of datasets including bacterial, metagenomic, plant, animal, and human whole genome data. For recently basecalled high quality nanopore datasets, NanoSpring, which focuses only on the base sequences in the FASTQ file, uses just 0.35–0.65 bits per base which is 3–6$$\times$$ × lower than general purpose compressors like gzip. NanoSpring is competitive in compression ratio and compression resource usage with the state-of-the-art tool CoLoRd while being significantly faster at decompression when using multiple threads (> 4$$\times$$ × faster decompression with 20 threads). NanoSpring is available on GitHub athttps://github.com/qm2/NanoSpring. 
    more » « less
  2. Abstract This paper investigates convex quadratic optimization problems involvingnindicator variables, each associated with a continuous variable, particularly focusing on scenarios where the matrixQdefining the quadratic term is positive definite and its sparsity pattern corresponds to the adjacency matrix of a tree graph. We introduce a graph-based dynamic programming algorithm that solves this problem in time and memory complexity of$$\mathcal {O}(n^2)$$ O ( n 2 ) . Central to our algorithm is a precise parametric characterization of the cost function across various nodes of the graph corresponding to distinct variables. Our computational experiments conducted on both synthetic and real-world datasets demonstrate the superior performance of our proposed algorithm compared to existing algorithms and state-of-the-art mixed-integer optimization solvers. An important application of our algorithm is in the real-time inference of Gaussian hidden Markov models from data affected by outlier noise. Using a real on-body accelerometer dataset, we solve instances of this problem with over 30,000 variables in under a minute, and its online variant within milliseconds on a standard computer. A Python implementation of our algorithm is available athttps://github.com/aareshfb/Tree-Parametric-Algorithm.git. 
    more » « less
  3. Abstract Deep Neural Networks (DNNs) are increasingly deployed in critical applications, where ensuring their safety and robustness is paramount. We present$$_\text {CAV25}$$ CAV 25 , a high-performance DNN verification tool that uses the DPLL(T) framework and supports a wide-range of network architectures and activation functions. Since its debut in VNN-COMP’23, in which it achieved the New Participant Award and ranked 4th overall,$$_\text {CAV25}$$ CAV 25 has advanced significantly, achieving second place in VNN-COMP’24. This paper presents and evaluates the latest development of$$_\text {CAV25}$$ CAV 25 , focusing on the versatility, ease of use, and competitive performance of the tool.$$_\text {CAV25}$$ CAV 25 is available at:https://github.com/dynaroars/neuralsat. 
    more » « less
  4. Abstract By means of a unifying measure-theoretic approach, we establish lower bounds on the Hausdorff dimension of the space-time set which can support anomalous dissipation for weak solutions of fluid equations, both in the presence or absence of a physical boundary. Boundary dissipation, which can occur at both the time and the spatial boundary, is analyzed by suitably modifying the Duchon & Robert interior distributional approach. One implication of our results is that any bounded Euler solution (compressible or incompressible) arising as a zero viscosity limit of Navier–Stokes solutions cannot have anomalous dissipation supported on a set of dimension smaller than that of the space. This result is sharp, as demonstrated by entropy-producing shock solutions of compressible Euler (Drivas and Eyink in Commun Math Phys 359(2):733–763, 2018.https://doi.org/10.1007/s00220-017-3078-4; Majda in Am Math Soc 43(281):93, 1983.https://doi.org/10.1090/memo/0281) and by recent constructions of dissipative incompressible Euler solutions (Brue and De Lellis in Commun Math Phys 400(3):1507–1533, 2023.https://doi.org/10.1007/s00220-022-04626-0 624; Brue et al. in Commun Pure App Anal, 2023), as well as passive scalars (Colombo et al. in Ann PDE 9(2):21–48, 2023.https://doi.org/10.1007/s40818-023-00162-9; Drivas et al. in Arch Ration Mech Anal 243(3):1151–1180, 2022.https://doi.org/10.1007/s00205-021-01736-2). For$$L^q_tL^r_x$$ L t q L x r suitable Leray–Hopf solutions of the$$d-$$ d - dimensional Navier–Stokes equation we prove a bound of the dissipation in terms of the Parabolic Hausdorff measure$$\mathcal {P}^{s}$$ P s , which gives$$s=d-2$$ s = d - 2 as soon as the solution lies in the Prodi–Serrin class. In the three-dimensional case, this matches with the Caffarelli–Kohn–Nirenberg partial regularity. 
    more » « less
  5. Abstract Given a suitable solutionV(t, x) to the Korteweg–de Vries equation on the real line, we prove global well-posedness for initial data$$u(0,x) \in V(0,x) + H^{-1}(\mathbb {R})$$ u ( 0 , x ) V ( 0 , x ) + H - 1 ( R ) . Our conditions onVdo include regularity but do not impose any assumptions on spatial asymptotics. We show that periodic profiles$$V(0,x)\in H^5(\mathbb {R}/\mathbb {Z})$$ V ( 0 , x ) H 5 ( R / Z ) satisfy our hypotheses. In particular, we can treat localized perturbations of the much-studied periodic traveling wave solutions (cnoidal waves) of KdV. In the companion paper Laurens (Nonlinearity. 35(1):343–387, 2022.https://doi.org/10.1088/1361-6544/ac37f5) we show that smooth step-like initial data also satisfy our hypotheses. We employ the method of commuting flows introduced in Killip and Vişan (Ann. Math. (2) 190(1):249–305, 2019.https://doi.org/10.4007/annals.2019.190.1.4) where$$V\equiv 0$$ V 0 . In that setting, it is known that$$H^{-1}(\mathbb {R})$$ H - 1 ( R ) is sharp in the class of$$H^s(\mathbb {R})$$ H s ( R ) spaces. 
    more » « less