Choosing the right compute for ML training and inference

Organizations across various industries are increasingly adopting machine learning for a wide range of use cases, including natural language processing (NLP), computer vision, voice assistants, fraud detection, and recommendation engines. Large language models (LLMs) that have hundreds of billions of parameters are unlocking new generative AI use cases, for example, image and text generation. But the growth of ML applications has resulted in higher usage, management, cost of compute, storage, and networking resources. This session explains why identifying and choosing the right compute infrastructure is important to reduce your power consumption, costs, as well as managing complexities from training and deployment of ML models to production. We explain how AWS offers the ideal combination of high performance, cost-effective, and energy-efficient purpose-built ML tools and accelerators, optimized for ML applications. Learn how to choose the right infrastructure for your AI/ML workload requirements. The session also explores the highly performant, scalable, and cost-effective ML infrastructure from AWS, ranging from the latest GPUs to purpose-built accelerators including AWS Trainium, AWS Inferentia and Amazon EC2 P5 which are designed for training and running models.

Speaker: Smiti Guru, Senior Solutions Architect, AWS India

Choosing the right compute for ML training and inference

Generative AI tools Mapping AWS solutions to your needs

Practical guide to building serverless APIs on AWS

A day as a developer with generative AI tools on AWS

Generative AI The secrets of agents

Build chatbots with generative AI and third-party connectors for improved productivity

Customize your generative AI applications to deliver relevant, accurate, and customized responses

Accelerate end-to end DevOps with generative AI

Get started with generative AI on AWS From ideation to production

Build generative AI applications on AWS

Select the right large language model for your application use case

Develop and deploy production-ready generative AI applications

Managing and optimizing costs for AIML workloads

Using generative AI responsibly and securely on AWS

Migrate, modernize and build on AWS: Manage less. Build faster. Innovate more

Accelerate rapid innovation with modern applications

Migrate and modernize Best practices and anti-patterns

How AWS can help with your modernization journey

Compute best practices and design patterns for resiliency

Cloud infrastructure for modern applications

Accelerate your on-premise and VMware migration journey with AWS