Scalable data preparation & ML using Apache Spark on AWS (Level 200)

Analyzing, transforming and preparing large amounts of data is a foundational step of any data science and ML workflow. This session shows how to build end-to-end data preparation and machine learning (ML) workflows. We explain how to connect Apache Spark, for fast data preparation in your data processing environments on Amazon EMR and AWS Glue interactive sessions from Amazon SageMaker Studio. Uncover how to access data governed by AWS Lake Formation to interactively query, explore, visualize data, run and debug Spark jobs as you prepare large-scale data for use in ML. Download slides »
Speaker: Suman Debnath, Principal Developer Advocate, Data Engineering, AWS
Duration: 30mins

Scalable data preparation & ML using Apache Spark on AWS (Level 200)

Innovate with data and machine learning

Accelerate rapid innovation with data and AIML

Generative AI on AWS

Generative AI platform on AWS

Smart traffic management

Troubleshooting with augmented observability and generative AI

Brick maestro with AI/ML and HPC on AWS

Generative AI-powered conversational intelligence - audio, chats supporting diverse languages

Transform digital experiences with generative AI Intelligent videoaudio Q&A

Build generative AI applications with no code/low code solutions on AWS

Build a personalized registration application using generative AI and AWS serverless

Codenator Enhancing user productivity through AI-powered code generation and secure execution

Choosing the right AIML and generative AI tools for your use case

Architecture patterns for building generative AI applications

Cost-optimizing AIML workloads on AWS

Select the right large language model for your generative AI use case

LLMOps Lifecycle of a LLM

Build an automated large language model evaluation pipeline on AWS

Using generative AI responsibly and securely on AWS

Transform your organization with intelligent document processing (IDP) on AWS