Vulnerability to Stability: Scalable Large Language Model in Queue-Based Web Service

Barek, Md Abdul; Rahid, Md Bajlur; Rahman, Md Mostafizur; Riad, ABM Kamrul; Francia, Guillermo; Shahriar, Hossain

Citation Details

This content will become publicly available on July 8, 2026

Vulnerability to Stability: Scalable Large Language Model in Queue-Based Web Service

Large Language Models (LLMs) have demonstrated exceptional capabilities in the field of Artificial Intelligence (AI) and are now widely used in various applications globally. However, one of their major challenges is handling high-concurrency workloads, especially under extreme conditions. When too many requests are sent simultaneously, LLMs often become unresponsive which leads to performance degradation and reduced reliability in real-world applications. To address this issue, this paper proposes a queue-based system that separates request handling from direct execution. By implementing a distributed queue, requests are processed in a structured and controlled manner, preventing system overload and ensuring stable performance. This approach also allows for dynamic scalability, meaning additional resources can be allocated as needed to maintain efficiency. Our experimental results show that this method significantly improves resilience under heavy workloads which prevents resource exhaustion and enables linear scalability. The findings highlight the effectiveness of a queue-based web service in ensuring LLMs remain responsive even under extreme workloads. more »

Award ID(s):: 2433800 2421324 1946442

PAR ID:: 10621516

Author(s) / Creator(s):: Barek, Md Abdul; Rahid, Md Bajlur; Rahman, Md Mostafizur; Riad, ABM Kamrul; Francia, Guillermo; Shahriar, Hossain

Publisher / Repository:: IEEE

Date Published:: 2025-07-08

Page Range / eLocation ID:: 995-1000

Subject(s) / Keyword(s):: Artificial Intelligence(AI) Large Language Model(LLM) Scalability Web Service Cybersecurity

Format(s):: Medium: X

Location:: Toronto, Canada

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 8, 2026
Conference Paper:
The DOI is not currently available.

More Like this