PEARC'25
(Ed.)
The Rosen Center for Advanced Computing at Purdue University has recently released two Generative AI inference tools, AnvilGPT and Purdue GenAI Studio, to the research and campus communities. These services support over 1000 users who use 10+ open-source GenAI models to aid their work. Building on HPC’s long history of using open-source tools, these services are based on customized open-source frameworks and hosted entirely on-prem. This pa- per argues that building custom GenAI services from open-source frameworks is a scalable and cost-effective solution for providing access to Generative AI models. This paper shares the methodology and resources required to develop and host these services and seeks to be a resource for other research computing centers that wish to leverage their HPC investment to create similar services.
more »
« less
An official website of the United States government

