Train ML models quickly and cost-effectively with Amazon SageMaker (Level 200)

Training machine learning models at scale often requires significant investments. In this session, we show how Amazon SageMaker enables you to reduce time and costs to train and tune machine learning (ML) models without the need to manage infrastructure. Learn how to use models using built-in tools to manage and track training experiments, automatically choose optimal hyperparameters, debug training jobs, and monitor the utilization of system resources such as GPUs, CPUs, and network bandwidth. We show how SageMaker Training tools enables faster distributed training, including libraries for data parallelism and model parallelism, and the Amazon SageMaker distributed training libraries automatically split models and training datasets across GPU instances to help you complete distributed training faster. Download slides »
Speaker: Gaurav Singh, Solutions Architect, AWS India
Duration: 30mins

Previous Video
Build accurate models combining diverse data types with AutoGluon on Amazon SageMaker (Level 300)
Build accurate models combining diverse data types with AutoGluon on Amazon SageMaker (Level 300)

Real-world machine learning use-cases often involve data in many forms. In this session, we cover the overv...

Next Video
Beyond model development, training and deployment - Deep dive on Amazon SageMaker Model Monitoring  (Level 200)
Beyond model development, training and deployment - Deep dive on Amazon SageMaker Model Monitoring (Level 200)

Unlike traditional software development, ML model development is an iterative process that requires the con...