Optimize training of foundation models on Amazon SageMaker
Training machine learning models at scale often requires a significant amount of resources, time, and investment. In this session, we explain how to leverage Amazon SageMaker to train and tune machine learning (ML) models without the need to manage infrastructure. We walk through the various strategies to train large language models in a cost-effective and performant manner using Amazon SageMaker. Learn how to optimize training to minimize cost and achieve high performance with distributed training approaches, smart sifting, Amazon SageMaker HyperPod, AWS Trainium, and other optimization methods. The session concludes with tips and best practices for optimizing training to minimize cost and achieve high performance.
Speakers:
Gaurav Singh, Senior Solutions Architect, AWS India
Smiti Guru, Senior Solutions Architect, AWS India