Scalable AI Model Deployment with AWS SageMaker and EKS |
||
|
|
|
© 2024 by IJCTT Journal | ||
Volume-72 Issue-11 |
||
Year of Publication : 2024 | ||
Authors : Joyanta Banerjee, Soumya Barman, Pratik Jain | ||
DOI : 10.14445/22312803/IJCTT-V72I11P114 |
How to Cite?
Joyanta Banerjee, Soumya Barman, Pratik Jain, "Scalable AI Model Deployment with AWS SageMaker and EKS," International Journal of Computer Trends and Technology, vol. 72, no. 11, pp. 135-142, 2024. Crossref, https://doi.org/10.14445/22312803/IJCTT-V72I11P114
Abstract
As businesses increasingly leverage Artificial Intelligence (AI) to drive innovation, the need for scalable, efficient, and secure AI model deployment is critical. AWS SageMaker and Amazon Elastic Kubernetes Service (EKS) offer a robust solution for deploying Machine Learning (ML) models in a scalable and resilient environment. This article explores integrating AWS SageMaker with EKS to create a highly available, containerized infrastructure for AI model deployment. SageMaker simplifies the process of building, training, and tuning ML models, while EKS provides a powerful platform for running these models in production, ensuring scalability through Kubernetes' container orchestration capabilities. Together, they enable enterprises to deploy AI models that can scale seamlessly to meet varying demands, handle high workloads, and maintain performance, all while taking advantage of the cloud-native ecosystem.
Keywords
AI/ML model, AWS, Containers, EKS, Sagemaker, Scalable deployment.
Reference
[1] What Is Amazon SageMaker? - Amazon SageMaker, Amazon.com, 2024. [Online]. Available: https://docs.aws.amazon.com/sagemaker/latest/dg/whatis.html
[2] using-kubernetes Run Machine Learning Models, AWS Amazon. [Online]. Available: https://aws.amazon.com/blogs/opensource/using-kubernetes-run-machine-learning-models-eks/
[3] Noah Gift, and Alfredo Deza, Practical MLOps, O’Reilly Online Learning, 2021.
[Google Scholar] [Publisher Link]
[4] Sanjeev Ganjihal et al., “Deploy Generative AI Models on Amazon EKS, Amazon Web Services, 2023. [Online]. Available: aws.amazon.com/blogs/containers/deploy-generative-ai-models-on-amazon-eks/
[5] Bilgin Ibryam, and Roland Huß, Kubernetes Patterns, 2nd Ed., Red Hat Developer, 2023. [Online]. Available: https://developers.redhat.com/e-books/kubernetes-patterns
[6] What Is Amazon EKS? - Amazon EKS, Amazon.com, 2024. [Online]. Available: https://docs.aws.amazon.com/eks/latest/userguide/what is-eks.html
[7] What Is Amazon Elastic Container Registry? - Amazon ECR, Amazon.com, 2024. [Online]. Available: https://docs.aws.amazon.com/AmazonECR/latest/userguide/what-is-ecr.html
[8] James Park et al., Reduce Model Deployment Costs by 50% on Average Using the Latest Features of Amazon SageMaker, Amazon Web Services, 2023. [Online]. Available: https://aws.amazon.com/blogs/machine-learning/reduce-model-deployment-costs-by-50-on-average using-sagemakers-latest-features/
[9] Ofir Nachmani, Compare EKS vs. Self-Managed Kubernetes on AWS, Search AWS, TechTarget, 2022. [Online]. Available: https://www.techtarget.com/searchaws/tip/2-options-to-deploy-Kubernetes-on-AWS-EKS-vs-self-managed