Model Lakes. In EDBT 2025.

Pal, Koyena; Bau, David; Miller, Renée J

doi:10.48786/edbt.2025.81

Citation Details

Model Lakes. In EDBT 2025.

Given a set of deep learning models, it can be hard to find models appropriate to a task, understand the models, and characterize how models are different one from another. Currently, practi- tioners rely on manually-written documentation to understand and choose models. However, not all models have complete and reliable documentation. As the number of models increases, the challenges of finding, differentiating, and understanding mod- els become increasingly crucial. Inspired from research on data lakes, we introduce the concept of model lakes. We formalize key model lake tasks, including model attribution, versioning, search, and benchmarking, and discuss fundamental research challenges in the management of large models. We also explore what data management techniques can be brought to bear on the study of large model management. more »

Award ID(s):: 2325632 2107248

PAR ID:: 10614597

Author(s) / Creator(s):: Pal, Koyena; Bau, David; Miller, Renée J

Editor(s):: EDBT

Publisher / Repository:: OpenProceedings.org

Date Published:: 2025-01-01

Subject(s) / Keyword(s):: Data Management Database Technology

Format(s):: Medium: X

Institution:: EDBT International Conference on Extending Data Base Techology

Sponsoring Org:: National Science Foundation

Dataset:
https://doi.org/10.48786/edbt.2025.81

More Like this