Sunday, May 9, 2021

System design: Amazon EMR | My 30 minute study

 May 9, 2021

Easily run and scale Apache Spark, Hive, Presto, and other big data frameworks   

Amazon EMR is the industry-leading cloud big data platform for processing vast amounts of data using open source tools such as Apache SparkApache HiveApache HBaseApache FlinkApache Hudi, and Presto. Amazon EMR makes it easy to set up, operate, and scale your big data environments by automating time-consuming tasks like provisioning capacity and tuning clusters. With EMR you can run petabyte-scale analysis at less than half of the cost of traditional on-premises solutions and over 3x faster than standard Apache Spark. You can run workloads on Amazon EC2 instances, on Amazon Elastic Kubernetes Service (EKS) clusters, or on-premises using EMR on AWS Outposts.

Discover how Apache Hudi simplifies pipelines for change data capture (CDC) and privacy regulations

Loaded: 0%
Progress: 0%
Remaining Time-0:00
An introduction to Amazon EMR 

















No comments:

Post a Comment