Search for: All records

Creators/Authors contains: "Wu, Zhenbin"

« Prev Next »

Total Resources

9

Resource Type
Conference Paper

1

Conference Proceeding

0

Dataset

0

Journal Article

8

Workshop Report

0

Availability
Full Text / Resource Available

9

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fast convolutional neural networks on FPGAs with hls4ml

https://doi.org/10.1088/2632-2153/ac0ea1

Aarrestad, Thea ; Loncar, Vladimir ; Ghielmetti, Nicolò ; Pierini, Maurizio ; Summers, Sioni ; Ngadiuba, Jennifer ; Petersson, Christoffer ; Linander, Hampus ; Iiyama, Yutaro ; Di Guglielmo, Giuseppe ; et al ( July 2021 , Machine Learning: Science and Technology)
null (Ed.)
Full Text Available
Compressing deep neural networks on FPGAs to binary and ternary precision with hls4ml

https://doi.org/10.1088/2632-2153/aba042

Ngadiuba, Jennifer ; Loncar, Vladimir ; Pierini, Maurizio ; Summers, Sioni ; Di Guglielmo, Giuseppe ; Duarte, Javier ; Harris, Philip ; Rankin, Dylan ; Jindariani, Sergo ; Liu, Mia ; et al ( December 2020 , Machine Learning: Science and Technology)
AIgean: An Open Framework for Machine Learning on Heterogeneous Clusters

https://doi.org/10.1109/FCCM48280.2020.00072

Tarafdar, Naif ; Guglielmo, Giuseppe Di ; Harris, Philip C ; Krupa, Jeffrey D ; Loncar, Vladimir ; Rankin, Dylan S ; Tran, Nhan ; Wu, Zhenbin ; Shen, Qianfeng ; Chow, Paul ( May 2020 , FCCM conference proceedings)

Full Text Available
Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Heinz, Aneesh ; Razavimaleki, Vasall ; Duarte, Javier ; DeZoort, Gage ; Ojalvo, Isobel ; Thais, Savannah ; Atkinson, Markus ; Neubauer, Mark ; Gray, Lindsey ; Jindariani, Sergo ; et al ( November 2020 , ArXivorg)
null (Ed.)
We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, and tracking performance of our implementations based on a benchmark dataset. We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.
more » « less
Full Text Available
Distance-Weighted Graph Neural Networks on FPGAs for Real-Time Particle Reconstruction in High Energy Physics

https://doi.org/10.3389/fdata.2020.598927

Iiyama, Yutaro ; Cerminara, Gianluca ; Gupta, Abhijay ; Kieseler, Jan ; Loncar, Vladimir ; Pierini, Maurizio ; Qasim, Shah Rukh ; Rieger, Marcel ; Summers, Sioni ; Van Onsem, Gerrit ; et al ( January 2021 , Frontiers in Big Data)
null (Ed.)
Graph neural networks have been shown to achieve excellent performance for several crucial tasks in particle physics, such as charged particle tracking, jet tagging, and clustering. An important domain for the application of these networks is the FGPA-based first layer of real-time data filtering at the CERN Large Hadron Collider, which has strict latency and resource constraints. We discuss how to design distance-weighted graph networks that can be executed with a latency of less than one μs on an FPGA. To do so, we consider a representative task associated to particle reconstruction and identification in a next-generation calorimeter operating at a particle collider. We use a graph network architecture developed for such purposes, and apply additional simplifications to match the computing constraints of Level-1 trigger systems, including weight quantization. Using the hls4ml library, we convert the compressed models into firmware to be implemented on an FPGA. Performance of the synthesized models is presented both in terms of inference accuracy and resource usage.
more » « less
Full Text Available
hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Fahim, Farah ; Hawks, Benjamin ; Herwig, Christian ; Hirschauer, James ; Jindariani, Serge ; Nhan, Trần ; Carloni, Luca ; DiGuglielmo, Giuseppe ; Harris, Phillip ; Krupa, Jeffrey ; et al ( April 2021 , ArXivorg)
null (Ed.)
Accessible machine learning algorithms, software, and diagnostic tools for energy-efficient devices and systems are extremely valuable across a broad range of application domains. In scientific domains, real-time near-sensor processing can drastically improve experimental design and accelerate scientific discoveries. To support domain scientists, we have developed hls4ml, an open-source software-hardware codesign workflow to interpret and translate machine learning algorithms for implementation with both FPGA and ASIC technologies. We expand on previous hls4ml work by extending capabilities and techniques towards low-power implementations and increased usability: new Python APIs, quantization-aware pruning, end-to-end FPGA workflows, long pipeline kernels for low power, and new device backends include an ASIC workflow. Taken together, these and continued efforts in hls4ml will arm a new generation of domain scientists with accessible, efficient, and powerful tools for machine-learning-accelerated discovery.
more » « less
Full Text Available
FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing

https://doi.org/10.1007/s41781-019-0027-2

Duarte, Javier ; Harris, Philip ; Hauck, Scott ; Holzman, Burt ; Hsu, Shih-Chieh ; Jindariani, Sergo ; Khan, Suffian ; Kreis, Benjamin ; Lee, Brian ; Liu, Mia ; et al ( December 2019 , Computing and Software for Big Science)

Full Text Available
Probing compressed bottom squarks with boosted jets and shape analysis

https://doi.org/10.1103/PhysRevD.92.095009

Dutta, Bhaskar ; Gurrola, Alfredo ; Hatakeyama, Kenichi ; Johns, Will ; Kamon, Teruki ; Sheldon, Paul ; Sinha, Kuver ; Wu, Sean ; Wu, Zhenbin ( November 2015 , Physical Review D)
A new calibration method for charm jet identification validated with proton-proton collision events at √s = 13 TeV

https://doi.org/10.1088/1748-0221/17/03/P03014

Tumasyan, Armen ; Adam, Wolfgang ; Andrejkovic, Janik Walter ; Bergauer, Thomas ; Chatterjee, Suman ; Dragicevic, Marko ; Escalante Del Valle, Alberto ; Fruehwirth, Rudolf ; Jeitler, Manfred ; Krammer, Natascha ; et al ( March 2022 , Journal of Instrumentation)

Abstract Many measurements at the LHC require efficient identification of heavy-flavour jets, i.e. jets originating from bottom (b) or charm (c) quarks. An overview of the algorithms used to identify c jets is described and a novel method to calibrate them is presented. This new method adjusts the entire distributions of the outputs obtained when the algorithms are applied to jets of different flavours. It is based on an iterative approach exploiting three distinct control regions that are enriched with either b jets, c jets, or light-flavour and gluon jets. Results are presented in the form of correction factors evaluated using proton-proton collision data with an integrated luminosity of 41.5 fb -1 at √s = 13 TeV, collected by the CMS experiment in 2017. The closure of the method is tested by applying the measured correction factors on simulated data sets and checking the agreement between the adjusted simulation and collision data. Furthermore, a validation is performed by testing the method on pseudodata, which emulate various mismodelling conditions. The calibrated results enable the use of the full distributions of heavy-flavour identification algorithm outputs, e.g. as inputs to machine-learning models. Thus, they are expected to increase the sensitivity of future physics analyses.
more » « less
Full Text Available