skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.
Attention:The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 7:00 AM ET to 7:30 AM ET on Friday, April 24 due to maintenance. We apologize for the inconvenience.


Search for: All records

Award ID contains: 2046444

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Multi-user environments present significant challenges in coordinating diverse preferences and resolving conflicts around shared resources. Current systems use a single-agent approach that struggles to balance individual needs with collective objectives. We introduce MALLM, a novel framework that deploys personalized LLM-based agents for each user on edge devices. MALLM integrates multi-sensor data fusion with a structured multi-agent decision-making mechanism, processing all data locally for enhanced privacy. Our edge-computing architecture enables real-time deliberation through evidence-based argumentation and consensus formation algorithms. The system continuously refines user profiles through sensor data while managing computational resources e!ciently. We evaluate MALLM through two case studies-health monitoring and personalized comfort management- demonstrating improved conflict resolution and resource e!ciency compared to conventional approaches. Our results show that MALLM e''ectively balances competing user priorities while preserving privacy in complex shared environments. 
    more » « less
  2. GPU sharing between workloads is an e!ective approach to increase GPU utilization and reduce idle power waste. To minimize resource contention under GPU sharing, current architectures allow users to allocate core GPU compute resources exclusively to workloads. However, identifying the most e''cient GPU compute resource allocation for colocated workloads is challenging, as it requires balancing potential performance degradation and power savings. This paper presents a framework for finding the most energy-e''cient compute allocation for colocated workload pairs under NVIDIA MPS using lightweight prediction models. Experimental results, using a range of training, inference, and general CUDA workloads, demonstrate that our solution outperforms the equal sharing strategy by 35%, on average, and is within 1.5% of the o#ine optimal strategy. 
    more » « less