STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
- PAR ID:
- 10423924
- Date Published:
- Journal Name:
- Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems
- Page Range / eLocation ID:
- 791 to 803
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found