dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes

Zheng, Jinfang; Hu, Boyang; Zhang, Xinpeng; Ge, Qiwei; Yan, Yuchen; Akresi, Jerry; Piyush, Ved; Huang, Le; Yin, Yanbin (ORCID:000000017667881X)

doi:10.1093/nar/gkac1068

Citation Details

dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes

Abstract Carbohydrate Active EnZymes (CAZymes) are significantly important for microbial communities to thrive in carbohydrate rich environments such as animal guts, agricultural soils, forest floors, and ocean sediments. Since 2017, microbiome sequencing and assembly have produced numerous metagenome assembled genomes (MAGs). We have updated our dbCAN-seq database (https://bcb.unl.edu/dbCAN_seq) to include the following new data and features: (i) ∼498 000 CAZymes and ∼169 000 CAZyme gene clusters (CGCs) from 9421 MAGs of four ecological (human gut, human oral, cow rumen, and marine) environments; (ii) Glycan substrates for 41 447 (24.54%) CGCs inferred by two novel approaches (dbCAN-PUL homology search and eCAMI subfamily majority voting) (the two approaches agreed on 4183 CGCs for substrate assignments); (iii) A redesigned CGC page to include the graphical display of CGC gene compositions, the alignment of query CGC and subject PUL (polysaccharide utilization loci) of dbCAN-PUL, and the eCAMI subfamily table to support the predicted substrates; (iv) A statistics page to organize all the data for easy CGC access according to substrates and taxonomic phyla; and (v) A batch download page. In summary, this updated dbCAN-seq database highlights glycan substrates predicted for CGCs from microbiomes. Future work will implement the substrate prediction function in our dbCAN2 web server. more »

Award ID(s):: 1933521

PAR ID:: 10380857

Author(s) / Creator(s):: Zheng, Jinfang; Hu, Boyang; Zhang, Xinpeng; Ge, Qiwei; Yan, Yuchen; Akresi, Jerry; Piyush, Ved; Huang, Le; Yin, Yanbin

Publisher / Repository:: Oxford University Press

Date Published:: 2022-11-18

Journal Name:: Nucleic Acids Research

Volume:: 51

Issue:: D1

ISSN:: 0305-1048

Page Range / eLocation ID:: p. D557-D563

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/nar/gkac1068

More Like this