Microbench: automated metadata management for systems biology benchmarking and reproducibility in Python

Lubbock, Alexander L. R. (ORCID:0000000269508908); Lopez, Carlos F. (ORCID:0000000336687468); Kelso, ed., Janet

doi:10.1093/bioinformatics/btac580

Citation Details

Microbench: automated metadata management for systems biology benchmarking and reproducibility in Python

Abstract Motivation

Computational systems biology analyses typically make use of multiple software and their dependencies, which are often run across heterogeneous compute environments. This can introduce differences in performance and reproducibility. Capturing metadata (e.g. package versions, GPU model) currently requires repetitious code and is difficult to store centrally for analysis. Even where virtual environments and containers are used, updates over time mean that versioning metadata should still be captured within analysis pipelines to guarantee reproducibility.

Results

Microbench is a simple and extensible Python package to automate metadata capture to a file or Redis database. Captured metadata can include execution time, software package versions, environment variables, hardware information, Python version and more, with plugins. We present three case studies demonstrating Microbench usage to benchmark code execution and examine environment metadata for reproducibility purposes.

Availability and implementation

Install from the Python Package Index using pip install microbench. Source code is available from https://github.com/alubbock/microbench.

Supplementary information

Supplementary data are available at Bioinformatics online.

NSF-PAR ID:: 10375945

Author(s) / Creator(s):: Lubbock, Alexander L. R.; Lopez, Carlos F.; Kelso, ed., Janet

Publisher / Repository:: Oxford University Press

Date Published:: 2022-08-24

Journal Name:: Bioinformatics

Volume:: 38

Issue:: 20

ISSN:: 1367-4803

Page Range / eLocation ID:: p. 4823-4825

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/bioinformatics/btac580

More Like this