k -mer approaches for biodiversity genomics

Jenike, Katharine M; Campos-Domínguez, Lucía; Boddé, Marilou; Cerca, José; Hodson, Christina N; Schatz, Michael C; Jaron, Kamil S

doi:10.1101/gr.279452.124

Citation Details

This content will become publicly available on January 31, 2026

k -mer approaches for biodiversity genomics

The wide array of currently available genomes displays a wonderful diversity in size, composition, and structure and is quickly expanding thanks to several global biodiversity genomics initiatives. However, sequencing of genomes, even with the latest technologies, can still be challenging for both technical (e.g., small physical size, contaminated samples, or access to appropriate sequencing platforms) and biological reasons (e.g., germline-restricted DNA, variable ploidy levels, sex chromosomes, or very large genomes). In recent years,k-mer-based techniques have become popular to overcome some of these challenges. They are based on the simple process of dividing the analyzed sequences (e.g., raw reads or genomes) into a set of subsequences of lengthk, calledk-mers, and then analyzing the frequency or sequences of thosek-mers. Analyses based onk-mers allow for a rapid and intuitive assessment of complex sequencing data sets. Here, we provide a comprehensive review to the theoretical properties and practical applications ofk-mers in biodiversity genomics with a special focus on genome modeling. more »

Award ID(s):: 2216612

PAR ID:: 10591504

Author(s) / Creator(s):: Jenike, Katharine M; Campos-Domínguez, Lucía; Boddé, Marilou; Cerca, José; Hodson, Christina N; Schatz, Michael C; Jaron, Kamil S

Publisher / Repository:: Cold Spring Harbor Laboratory Press

Date Published:: 2025-01-31

Journal Name:: Genome Research

ISSN:: 1088-9051

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on January 31, 2026
Journal Article:
https://doi.org/10.1101/gr.279452.124

More Like this