Benchmarking comparison between serverless vs. vm- architectures for AI workloads

dc.contributor.authorFernando, HTV
dc.contributor.authorWijayanayake, J
dc.date.accessioned2026-05-14T06:19:52Z
dc.date.issued2025
dc.description.abstractThe rapid growth of Artificial Intelligence (AI) has led to an increased demand for cloud computing systems capable of handling diverse AI workloads. Serverless computing and VM-based cloud architectures are two dominant models for deploying AI applications, but choosing the appropriate model depends on performance factors such as latency, scalability, and cost-efficiency. This Systematic Literature Review (SLR) compares these two architectures, focusing on their effectiveness for AI tasks like real-time inference and AI model training. The review evaluates the performance metrics associated with each model, including cold-start latency, resource utilization, and scalability. Studies from 37 selected papers were analyzed using the PRISMA methodology, and key insights into the strengths and weaknesses of both models were extracted. Findings reveal that serverless systems excel in elastic scaling and cost-efficiency for bursty AI workloads but struggle with latency issues during cold starts. In contrast, VM-based systems offer predictable performance for long-duration tasks but often suffer from resource underutilization and higher costs. The review highlights the need for task-specific benchmarks and suggests the use of hybrid cloud models that combine the advantages of both approaches. Overall, this review contributes to the trade-offs between Serverless and VM-based cloud models for AI workloads and provides practical recommendations for future research and cloud deployment strategies.
dc.identifier.conferenceInternational Conference on Business Research
dc.identifier.departmentDepartment of Industrial Management
dc.identifier.doihttps://doi.org/10.31705/ICBR.2025.16
dc.identifier.emailtheekshana.jny@gmail.com
dc.identifier.facultyBusiness
dc.identifier.issn2630-7561
dc.identifier.pgnospp. 219-238
dc.identifier.placeMoratuwa, Sri Lanka
dc.identifier.proceeding8th International Conference on Business Research (ICBR 2025)
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/25243
dc.identifier.year2025
dc.language.isoen
dc.publisherBusiness Research Unit (BRU)
dc.subjectBENCHMARKING
dc.subjectCLOUD COMPUTING
dc.subjectCOMPARISON
dc.subjectSERVERLESS
dc.subjectVIRTUAL MACHINE
dc.titleBenchmarking comparison between serverless vs. vm- architectures for AI workloads
dc.typeConference-Full-text

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ICBR2025-32.pdf
Size:
503.82 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections