Session Archives

You can view all previous sessions of the M-Power Academy here.

SESSION 1

AWS Analytics Services

February 16, 2021

This session provides deep insights into how AWS analytics services offer the flexibility of running big data and analytics workload through its offerings such as EMR, Glue and Redshift. It also covers a number of exciting services introduced as part of 'Reinvent 2021’ such as running Spark work on Kubernetes viz. EMR on EKS, a full featured notebooks IDE called as EMR Studio, managed Apache Airflow and enhancements to Glue and Redshift

Watch now

Speakers

Dipta Shekhar Bhattacharya

Enterprise Solution Architect, AWS

Dipayan Sarkar

Sr Analytics Specialist SA, AWS

Agenda

30 mins

Overview of AWS Analytics services and Reinvent 2021 announcements

15 mins

Glue data brew (Scenario 4)

30 mins

Using EMR on EKS (Scenario 1 & 2)

30 mins

EMR Studio, IDE to manage and run notebooks (Scenario 1 & 2)

10 mins

Q&A session

20 mins

Orchestration of Big Data pipeline - Managed Airflow Service (Scenario 3)

35 mins

Cloudwarehouse - Redshift enhancements and ML integration (Scenario 5, 6, 7 & 8)

10 mins

Q&A Session

Julian Bright

AI Specialist Solutions Architect, Amazon Web Services

SESSION 2

DevOps for data science:
‍Operationalising machine learning

March 19, 2021 | 4 PM to 5:30 PM

This session will provide an overview of MLOps, its features and benefits in transforming your business. It will also give an overview of the orchestration frameworks and tools, provide a Demo and cover the various data integration options possible with AWS services.

Watch now

Agenda

5 mins

Introduction and context setting

5 mins

Overview of MLOps

5 mins

What’s new in Amazon SageMaker for MLOps

15 mins

Orchestration frameworks and tools

10 mins

MLOps Demo

5 mins

Data integration options

45 mins

Q&A Session moderated by Praveen Jayakumar, Principal AI/ML Solutions Architect, AWS

Manikandan Chandrasekaran

Principal Solutions Architect, Containers, Amazon Internet Services Pvt Ltd

Jayesh
Vartak

DNB Solutions Architect, Amazon Internet Services Pvt Ltd

Ramprasad Gurumoorthy

DNB Solutions Architect, Amazon Internet
Services Pvt Ltd

SESSION 3

Container Services on AWS - New Launches & Updates on Amazon ECS

April 16, 2021 | 5.00 PM to 6.30 PM

WATCH now

The session will cover how cutting-edge AWS applications such as Amazon ECS Anywhere provide a simple yet powerful way to manage containerized applications on premises and anywhere else outside AWS. It will also explore how Amazon ECS deployment of Circuit Breaker can automatically discover and roll back unhealthy service deployments, which ensures that resources consumed in failed tasks are saved and keeps indefinite deployment delays at bay. Finally, the session will also explain how a new functionality, dubbed as Amazon ECS Exec, allows all Amazon ECS users to “exec” into a container running inside a task deployed on either Amazon EC2 or AWS Fargate.

Agenda

5 mins

Introduction to the session by Manikandan Chandrasekaran

30 mins

Overview of Amazon ECS Anywhere

20 mins

Overview of Amazon ECS Deployment Circuit Breaker

20 mins

Overview of Amazon ECS exec

15 mins

Q&A Session

Dipta Shekhar
Bhattacharya

Enterprise Solution Architect,
Amazon Internet
Services Pvt Ltd

Dipayan
Sarkar

Sr Analytics Specialist SA,
Amazon Internet
Services Pvt Ltd

SESSION 4

AWS Analytics Services
Key tech area: Analytics

April 23, 2021 | 4 PM to 7 PM

WATCH now

Session details:

This session provides deep insights into how AWS analytics services offer the flexibility of running big data and analytics workload through its offerings such as EMR, Glue and Redshift. As part of 'Reinvent 2021’, it also covers a number of exciting services such as running Spark work on Kubernetes viz. EMR on EKS, a full featured notebooks IDE called as EMR Studio, managed Apache Airflow and enhancements to Glue and Redshift.

Agenda

4:00 PM - 4:30PM

Overview of AWS Analytics services and Reinvent 2021 announcements

15 mins

- Glue data brew

30 mins

- Using EMR on EKS

30 mins

- EMR Studio, IDE to manage and run notebooks

10 mins

- Q&A session

5:55 PM - 6:15 PM

Orchestration of Big Data pipeline - Managed Airflow Service

6:15 PM - 6:50 PM

Cloudwarehouse - Redshift enhancements and ML integration

6:50 PM - 7:00 PM

Q&A Session

Satinder Pal Singh

Head Solutions Architects-DNB, Amazon Internet Services Pvt Ltd

Venugopal Pai

Solutions Architect, Amazon Internet Services Pvt Ltd

Nirmalya Chakraborty

Solutions Architect, Amazon Internet Services Pvt Ltd

Session 5

Enabling the best price performance using AWS Graviton2 Processors
Key tech area: AI/ML

July 9, 2021 | 5:00 - 6:15 PM

Session details:

AWS experts will provide a comprehensive overview of how to attain optimal price performance using AWS Graviton2 Processors. They will explain how they deliver 7x more performance with its 4x more compute cores, 5x faster memory, and 2x larger caches compared to the first-generation Graviton processors. The experts will talk about the superior performance of services such as Amazon ElastiCache, Amazon RDS and Amazon EC2 that can be powered by AWS Graviton2 processors. Through a series of hands-on use-cases and demos, they will also cover benchmarking applications, migrating RDS instances to AWS Graviton2 and deploying multi-architecture EKS clusters with Graviton2.

Agenda

5:00 PM - 5:15 PM

Introduction to the AWS Graviton2 processors by Satinder Pal Singh

5:15 PM - 5:25 PM

Getting Started on AWS Graviton2 by Satinder Pal Singh

5:25 PM - 5:30 PM

Benchmarking applications by Venugopal Pai

5:30 PM - 5:40 PM

Migrating RDS instances to AWS Graviton2 by Venugopal Pai

5:40 PM - 5:55 PM

Deploying multi-architecture Amazon EKS clusters by Nirmalya Chakraborty

5:55 PM - 6:00 PM

Closing statements by Satinder Pal Singh

6:00 PM - 6:15 PM

Q&A session

Leverage Amazon Redshift and Sagemaker for your analytics and ML needs

Akshaya Rawat

Solutions Architect,
AISPL

Tejal Rathod

Solutions Architect,
AISPL

Day 1 - Amazon Redshift
‍November 18 2021, 5:00 PM to 6:20 PM

April 23, 2021 | 4 PM to 7 PM

WATCH now

Session details:

This hand-on session on Amazon Redshift will provide a comprehensive overview of Redshift’s capabilities. Watch DNB Solutions Architect Akshaya Rawat and Solutions Architect Tejal Rathod deep-dive into new features of Amazon Redshift like RA3, AQUA, and data sharing. Let them guide you through an architecture overview, and share some of the best practices to run Amazon Redshift-hosted data warehouses efficiently in your business.

AWS Technology:

Data warehousing, Amazon Redshift

Agenda

5:00 PM - 5:03 PM

Introduction to Redshift

5:03 PM - 5:08 PM

An overview of Redshift Architecture

5:08 PM - 5:18 PM

Exploring new features, architecture of RA3, AQUA, data sharing and their benefits

5:18 PM - 5:28 PM

The considerations & processes for RA3 migration

5:28 PM - 5:43 PM

A live demo of migration utility

5:43 PM - 5:58 PM

Understanding the performance tuning techniques for Redshift

5:58 PM - 6:01 PM

Additional resources

6:01PM - 6:20 PM

Q&A session

Sumir Kumar

AWS WWCS Geo Solution Architect, Amazon Internet Services Private Limited

Kota Vishnu Vardhan

DNB Solution Architect, Amazon Internet Services Private Limited

Day 2 - Amazon Sagemaker
‍November 19 2021, 5:00 PM to 6:20 PM

April 23, 2021 | 4 PM to 7 PM

WATCH now

Session details:

In this session, attendees will learn a unified way of pre-processing data and orchestrating ML workflows in Sagemaker Studio. ML builders who want to leverage the big data processing using EMR can now work within the Sagemaker Studio. Experts will demonstrate how to integrate easily with big data tools and leverage the pre-processing of data using EMR. Furthermore, attendees will learn how to build complete workflows for training and deploying ML models, and metadata management with CI/CD using Sagemaker Pipelines. At the end of the session, the speakers will conduct a Q&A session.

AWS Technology:

Amazon Sagemaker Studio, Amazon Elastic Map Reduce, Simple Storage Service

Agenda

5:00 PM - 5:05 PM

Introduction to the session

5:05 PM - 5:10 PM

An overview of Amazon Sagemaker Studio

5:10 PM - 5:25 PM

Sagemaker DataWrangler, Feature store, Clarify, Distributed training, Experiments & debugger, AutoPilot

5:25 PM - 5:40 PM

Unified Analytics and ML approach with Sagemaker Studio and EMR and Sagemaker Pipelines

5:40 PM - 5:55 PM

Live demo - Integration of Sagemaker Studio with EMR to pre-process the data

5:55 PM - 6:10 PM

Live demo - Automate ML workflow using Sagemaker Pipelines

6:10PM - 6:20 PM

Q&A session

Session Archives

Event One - Security and Networking - December 3, 3:00 PM - 6:00 PM

The first event in the levelUP series will be in the domain of Security and Networking. The event will feature three distinct sessions,
conducted by Solutions Architects from AWS.

SESSION 1

Session I - Security Demystified

2022 will keep the security sector on their toes, as the threat landscape for the year continues to unveil itself, revealing new vulnerabilities to attacks. This makes security a complex and nuanced topic that must be adjusted to suit the needs and priorities of different organisations. In this session, Jasmine Maheshwari, Senior Solutions Architect, AWS, will discuss holistic approaches to security and simplify the meaning of security for all teams in an organisation. The session will also share guidelines on how to create a phased approach for security posture improvement in an organisation.

Session II - Scaling applications with AWS Global Accelerator and Cloudfront

This session, conducted by Venugopal Pai, Solutions Architect, AWS, explores different AWS services that deliver high performance, highly available applications for an organisation’s end users - particularly, Amazon Cloudfront and AWS Global Accelerator.
‍
Amazon Cloudfront enables organisations to securely deliver content with low latency and high transfer speeds. Built for high performance, security and developer convenience, Cloudfront is the perfect vehicle to demonstrate how attendees can speed up the delivery of content, while fending off DDOS attacks.
‍
Vengopal will also show attendees how to improve the availability and performance of your applications for local or global users with AWS Global Accelerator. This networking service improves the performance of an organisation’s traffic by 60 percent using AWS’ global network infrastructure.

Session III - Fintechs and the four 9s of availability

For any product or service, availability is a key factor in their success. Gaining the highest availability with the lowest latency can be a hard task, as bugs, hardware failures, network issues, unusual traffic spikes and other reasons can hamper an organisation achieving the four 9s of availability - 99.99 percent.
‍
In this session, Jasmine Maheshwari, Senior Solutions Architect, AWS, will explain why this is so crucial for fintechs or any organisation with mission critical applications to achieve. The session will also guide you through important architectural aspects such as availability, monitoring, change management, resiliency, DR - factors to help you achieve this coveted level of performance.

AWS Technology:

AWS Global Accelerator and Cloudfront

AWS Speakers

Jasmine Maheshwari

Senior Solutions Architect,
AWS

Venugopal Pai

Solutions Architect,
AWS

Agenda

3:00 PM- 4:00 PM

Security Demystified

Jasmine Maheshwari, Senior Solutions Architect, AWS

4:00 PM - 5:00 PM

Scaling applications with AWS Global Accelerator and Cloudfront

Venugopal Pai, Solutions Architect, AWS

5:00 PM - 6:00 PM

Fintechs and 4 9s of availability

Jasmine Maheshwari, Senior Solutions Architect, AWS

Venugopal Pai, Solutions Architect, AWS

Event Two - AI-ML - December 7, 3:00 PM to 5.30 PM

The second event in the levelUp for 2022 series will dive deep into Artificial Intelligence and Machine Learning. The event will be split into two sessions. First, a 1-hour session will look into building, training and deploying ML models with AWS. The second session will focus on streamlining ML for data scientists and developers, and will last for 1.5 hours. Each session will be conducted by an experienced Solutions Architect from Amazon Internet Services Private Limited.

SESSION 2

Session I - Build, train and deploy ML models with Amazon

From making more accurate predictions, to gaining deeper insights from your data, improving customer experiences to reducing operational overheads, AWS’ Machine Learning services, infrastructure and implementation resources are there to support you at every stage of the journey.
‍
This session, curated for data scientists and developers, guides you on how to prepare, build, train and deploy high-quality Machine Learning models swiftly, by utilising a broad set of capabilities that are purpose-built for ML.
‍
Accelerate innovation in your organisation through purpose-built tools for every step of ML development, including labeling, data preparation, feature engineering, statistical bias detection, auto-ML, training, tuning, hosting, explainability, monitoring, and workflows. Learn about the most comprehensive ML service, Amazon Sagemaker - the "middle layer" in what AWS commonly refers to as the 3-tier AI/ML stack. This 200-level session highlights the main features of Amazon Sagemaker covering end-to-end ML deployment. It starts with Sagemaker Studio and covers GroundTruth, Data Wrangler, Feature Store, Experiments, Debugger, Clarify (for explainability), and more.

Session II - Streamlining Machine Learning for data scientists and developers

The second session in this event is all about teaching modern day ML users to leverage existing Big Data tools, which enable data engineering teams to explore and visualise large datasets. In order to do so, a unified analytics solutions (for e.g. Spark/Hive/metastore integrated with ML tools, such as python notebooks) is required to explore and prepare big-data datasets to build, train and deploy Machine Learning models in a single pane of glass.
‍
Another big challenge is managing many experiments which keep running across the teams with CI/CD practices. Learn how to orchestrate these workflows for production for model building. This includes pre-processing, training, evaluation and registering the models with the respective model versions with lineage and metadata information. This also includes model deployment appropriate approvals and tests to validate the accuracy before deploying the models in production.
‍
This session, by Vishnu Kota, Solutions Architect, Amazon Internet Services Private Limited and Sumir Kumar, AWS WWCS Geo Solution Architect, Amazon Internet Services Private Limited, introduces attendees to Sagemaker Studio, then demonstrates how you can use Sagemaker Studio notebooks to easily and securely connect to Amazon EMR clusters to prepare vast amounts of data for analysis and reporting. Finally, the speakers will introduce and demonstrate Sagemaker Pipelines to automate the ML CICD workflow steps of pre-processing, training, deployment and managing the model metadata.

AWS Technology:

Session 1:

Sagemaker Studio, GroundTruth, Data Wrangler, Feature Store, Experiments, Debugger, Clarify and more.

Session 2:

Sagemaker Studio, Amazon EMR, Sagemaker Pipelines

AWS Speakers

Chandrashekar Munibudha

Principal Solutions Architect,
AISPL

Vishnu Kota

Solutions Architect,
AISPL

Sumir Kumar

Solutions Architect,
AISPL

Agenda

3:00 PM- 4:00 PM

Build, train and deploy ML models with Amazon

Chandra Munibudha, Principal Solutions Architect, AISPL

4:00 PM - 5:30 PM

Streamlining Machine Learning for data scientists and developers

Vishnu Kota, Solutions Architect, AISPL

Sumir Kumar, Solutions Architect, AISPL

Event Three - Dev-Ops & Serverless - December 10, 3:00 PM to 6:00 PM

The third event in the levelUP series will be in the domain of DevOps and Serverless. The event will feature three distinct sessions, conducted by Solutions Architects from AWS.

SESSION 3

Session I - CI/CD on AWS

Deliver apps quickly and efficiently to your customers by introducing automation into the stage of app development through Continuous Integration, Continuous Delivery and Continuous Deployment. ‘Continuous integration and continuous delivery’ (CI/CD) techniques enable teams to increase agility to quickly release high-quality products. In this session, Vishal Gupta, Solutions Architect, AWS and Vivek Ghildiyal, Solutions Architect, AWS, talk about how CI/CD helps in building a successful product by enabling teams to scale by automating safe, repeatable deployments. This includes code, as well as infrastructure maintenance and deployment. Along the way, the speakers will also discuss how companies can integrate security controls into the CI/CD pipeline.”

Session II - Chaos Engineering - AWS Fault Injection Simulator

In session 2, Chandrashekar Munibudha, Principal Solutions Architect, AISPL, will dive deep into the practice of Chaos Engineering - the discipline of experimenting on a distributed system to induce artificial failures. It is a process that builds confidence in your system’s capabilities to withstand turbulent conditions during production. In this session, Chandrashekar will present an overview of chaos engineering and AWS Fault Injection Simulator. AWS Fault Injection Simulator is a fully managed chaos engineering service that helps you improve application resiliency by making it easy and safe to perform controlled chaos engineering experiments on AWS. The session will include a demo of how to use AWS Fault Injection Simulator to make applications more resilient to failure.

Session III - Serverless Workflows with AWS Step Function

Achieve greater scalability, more flexibility, and quicker time to release through serverless workflows. Chandrashekar Munibudha, Principal Solutions Architect, AISPL, will guide you on how to coordinate business workflows among distributed services using a simple, yet powerful, fully-managed service called AWS Step Functions. The session will also discuss how to build resilient, modern applications, and reduce costs using Step Functions.
‍
AWS Step Functions is a serverless function orchestrator that makes it easy to sequence AWS Lambda functions and multiple AWS services into business-critical applications. Through its visual interface, you can create and run a series of check-pointed and event-driven workflows that maintain the application state. The output of one step acts as an input to the next. Each step in your application executes in order, as defined by your business logic.

AWS Technology:

AWS Fault Injection Simulator, AWS Step Function

AWS Speakers

Vishal Gupta

Solutions Architect,
AWS

Vivek Ghildiyal

Solutions Architect,
AWS

Chandrashekar Munibudha

Principal Solutions Architect,
AISPL

Agenda

3:00 PM- 4:00 PM

CI/CD on AWS

Vishal Gupta, Solutions Architect

Vivek Ghildiyal, Solutions Architect

4:00 PM - 5:00 PM

Chaos Engineering - AWS Fault Injection Simulator

Chandra Munibudha, Principal Solutions Architect, AISPL

5:00 PM - 6:00 PM

Serverless Workflows with AWS Step Function

Chandra Munibudha, Principal Solutions Architect, AISPL

Event Four - Containers - December 17, 3:00 PM to 5:30 PM

The fourth event in the LevelUP series will focus on the domain of containers. The event will feature two engaging sessions conducted by experienced solutions architects from AWS.

SESSION 4

Session I - Run Apache Spark on Kubernetes with Amazon EMR on Amazon EKS

In this session, Kayalvizhi Kandasamy, a Senior Solutions Architect at AWS, who works with digital native companies to support their innovations, will take you through the benefits of Amazon EMR on Amazon EKS, which
essentially enables you to use Amazon EMR to run Apache Spark workloads on Amazon EKS. The speaker will deep-dive into the technical aspects of it with clarity. You will discover how you can simplify the running of Big Data frameworks on Kubernetes without the hassles of managing open source code and deliver better performance while consolidating the infrastructure.

Session II - EKS Deep Dive

In this session you will explore the networking, storage, security, scaling, observability and logging aspects of Amazon Elastic Kubernetes Service (Amazon EKS). EKS is a managed container service to run and scale Kubernetes applications in the cloud. The speaker, Jayesh Vartak, a Solutions Architect at AWS, who focuses on containers, application modernization, infrastructure, big data and analytics, will delve into the EKS architecture and its essential elements. You will also learn about scaling sample application using HPA (Horizontal Pod Autoscaler), using CloudWatch Container Insights for metrics and logging, and about EKS’ seamless integration with IAM (Identity and Access Management) for RBAC (Role based Access Control).

AWS Speakers

Kayalvizhi Kandasamy

Senior Solution Architect, AWS

Jayesh Vartak

Solutions Architect,
AISPL

Ramprasad Gurumoorthy

Solutions Architect,
AWS

Agenda

3:00 PM- 4:00 PM

Run Apache Spark on Kubernetes with Amazon EMR on Amazon EKS

Kayalvizhi Kandasamy, Senior Solution Architect, AWS

4:00 PM - 5:30 PM

EKS Deep Dive

Jayesh Vartak, Solutions Architect, AISPL

Ramprasad Gurumoorthy, Solutions Architect, AWS

Event Five - Analytics - December 22, 2:30 PM to 6:00 PM

The fifth event in the levelUp series will be in the domain of Analytics. The event will feature three distinct sessions, conducted by Solutions Architects from AWS.

SESSION 5

Session I - Turn data into insights

Learn how to extract meaningful insights about your customers from volumes of raw Big Data in the first session of the event. Speakers Priya Jathar and Tejal Rathod, Solutions Architects, AWS will share how DNB customers are turning Big Data into meaningful business insights. They will also cover an Analytics pipeline overview, and focus on the key aspects of data processing, deriving analysis, and visualising insights. The session will wrap with demos on data preparation and visualisation.

In this session, the speakers will be utilsing different solutions from AWS, including Analytics Pipeline on AWS, Big Data Processing, Big Data Analysis, Big Data Visualisations, AWS Analytics Services like AWS DataBrew, AWS Glue, Amazon Redshift, Amazon Athena, and Amazon QuickSight.

Session II - Big data analytics architectural patterns and best practices

Every organisation has a different threshold for handling Big Data. As the tools and solutions for working with big datasets evolve, so does the data. Your organisation may require an overarching system to manage large volumes of data that can be analysed for business purposes - which is Big Data Architecture. You will also need to establish big data architectural components before embarking on a Big Data project. Implementing best practices or key principles in your architecture strategy will help create a well-rounded approach that ensures your data addresses a wide range of business needs.
‍
This session will provide an overview of ‘Analytics pipeline 101’ on AWS through broadly seen technical architecture patterns in the Big Data space using AWS Analytics Services. Watch Priya Jathar and Tejal Rathod, Solutions Architects, AWS walk through analytics architecture patterns like Batch/Streaming/Ad-hoc analytics, Serverless analytics, ML Integrations, and Data Mesh. This session also covers customer implemented solutions from industries like gaming and retail, to name a few.
‍
In this session, the speakers will be utilsing different solutions from AWS, including AWS Modern Data Architecture, AWS Analytics Architecture patterns, AWS Analytics Services, Analytics in Payments, Gaming, Retail and Logistics industry verticals.

Session III - Streaming Data Lake Platforms on AWS

Data Lakes enable organisations to store all their structured and unstructured data at any scale, and run different types of analytics to make better, more insightful decisions. However, the ingestion of streaming data into data lake has limitations due to limited support of transactions and incremental data processing on traditional data lake platforms. In this session, Dipta Shekhar, Enterprise Solution Architect, AWS and Akshaya Rawat, Solutions Architect, AISPL introduce the concept of transactional data lake platforms and explain how they solve the problem. The session presents an example transactional data lake platform on AWS using Apache Hudi.

AWS Technology

Session 1

Analytics Pipeline on AWS
Big Data Processing
Big Data Analysis
Big Data Visualisations
AWS Analytics Services like AWS DataBrew
AWS Glue
Amazon Redshift
Amazon Athena
Amazon QuickSight

Session 2

AWS Modern Data Architecture
AWS Analytics Architecture patterns
AWS Analytics Services
Analytics in Payments/Gaming/Retail/Logistics Industry verticals

Session 3

AWS Data Lake
Apache Hudi

AWS Speakers

Priya Jathar

Solutions Architect,
AWS

Tejal Rathod

Solutions Architect,
AWS

Dipta Shekhar

Enterprise Solution Architect, AWS

Akshaya Rawat

Solutions Architect,
AISPL

Agenda

3:00 PM- 4:00 PM

Turn data into insights

Priya Jathar, Solution Architect, AWS

Tejal Rathod, Solution Architect, AWS

4:00 PM - 5:00 PM

Big data analytics architectural patterns and best practices

Priya Jathar, Solution Architect, AWS

Tejal Rathod, Solution Architect, AWS

5:00 PM - 6:00 PM

Streaming Data Lake Platforms on AWS

Dipta Shekhar, Enterprise Solution Architect, AWS

Akshaya Rawat, Solutions Architect, AISPL

Event Six - Analytics - December 23, 3:00 PM to 5:00 PM

The sixth and final event in the levelUP series will dive even deeper into all things analytics. The event will feature two sessions, conducted by a seasoned Solutions Architect from AWS.

SESSION 6

Session I - Deep Dive - Running Spark Jobs at scale with EMR Studio

When it comes to developing, visualising, and debugging data engineering and data science applications, EMR Studio’s integrated development environment makes it easyfor any data scientist or data engineer. In this session, Sumir Kumar, Solutions Architect, AISPL, will cover the use of Jupyter Notebooks, debug with tools like Spark UI and YARN Timeline Service, and collaborate with peers using GitHub and BitBucket - all within the EMR Studio IDE.

The session will further guide you on how to schedule your notebooks as part of a data pipeline. Additionally, you’ll learn how to manage the big data platform by integrating EMR studio with corporate identity. You’ll also be able to define different roles for different data engineering members with appropriate access to the data lakes and metadata.

The session will feature three demos:

• First demo

Develop, visualise, and collaborate big data applications with EMR Studio environment and notebooks.

• Second demo

Running Spark/Hive jobs on EMR clusters and debugging Spark/Hive jobs using the Spark UI, Tez UI, and YARN Timeline Service.

• Third demo

Parameterise the notebooks and schedule them as part of a data pipeline without any additional tools.

Session II - Move to Managed Analytics

Empower your enterprise by automating the process of converting data into insights, which in turn can boost your business, facilitate your goals, and help you understand and retain customers. Session two of this event teaches you how to do that by diving into Managed Analytics.

AWS customers worldwide use open-source distributions including Spark, Elasticsearch, Apache Kafka, and more for analytics. Learn how to deliver insights more quickly and cost-effectively by moving your big data processing, log analytics and search, andreal-time streaming and analytics to the fully managed AWS analytics services in your lake house architecture.

Furthemore, the session will help you understand what AWS Lake house architecture is, the benefits of moving to managed big data analytics services, how to move to managed operational analytics, and how to move to managed real-time analytics. Finally, the speaker - Sumir Kumar, Solutions Architect, AISPL - will conduct a live demo during the session.

AWS Technology

Amazon EMR Studio, AWS analytics services , AWS Lake house architecture

AWS Speakers

Sumir Kumar

Solutions Architect,
AISPL

Agenda

3:00 PM- 4:00 PM

Deep Dive - Running Spark Jobs at scale with EMR Studio

Sumir Kumar, Solutions Architect, AISPL

4:00 PM - 5:00 PM

Move to Managed Analytics

Sumir Kumar, Solutions Architect, AISPL