Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2107375Time Stamp: May 24, 2023
How Zoom implemented streaming log ingestion and efficient GDPR deletes using Apache Hudi on Amazon EMR | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2096909Time Stamp: May 16, 2023
Improve reliability and reduce costs of your Apache Spark workloads with vertical autoscaling on Amazon EMR on EKS Source Cluster: AWS Big Data Source Node: 2084096Time Stamp: May 4, 2023
Build, deploy, and run Spark jobs on Amazon EMR with the open-source EMR CLI tool Source Cluster: AWS Big Data Source Node: 2081864Time Stamp: May 3, 2023
Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark Source Cluster: AWS Big Data Source Node: 2066577Time Stamp: Apr 20, 2023
Improved ML model deployment using Amazon SageMaker Inference Recommender Source Cluster: AWS Machine Learning Source Node: 2066056Time Stamp: Apr 20, 2023
Accelerate HiveQL with Oozie to Spark SQL migration on Amazon EMR Source Cluster: AWS Big Data Source Node: 2065422Time Stamp: Apr 19, 2023
Connect Amazon EMR and RStudio on Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 2069307Time Stamp: Apr 17, 2023
How CyberSolutions built a scalable data pipeline using Amazon EMR Serverless and the AWS Data Lab Source Cluster: AWS Big Data Source Node: 2062059Time Stamp: Apr 17, 2023
Amazon EMR on EKS widens the performance gap: Run Apache Spark workloads 5.37 times faster and at 4.3 times lower cost Source Cluster: AWS Big Data Source Node: 2056599Time Stamp: Apr 12, 2023
Push Amazon EMR step logs from Amazon EC2 instances to Amazon CloudWatch logs Source Cluster: AWS Big Data Source Node: 2051421Time Stamp: Apr 7, 2023
Build event-driven data pipelines using AWS Controllers for Kubernetes and Amazon EMR on EKS Source Cluster: AWS Big Data Source Node: 2040450Time Stamp: Mar 30, 2023
Accelerating revenue growth with real-time analytics: Poshmark’s journey Source Cluster: AWS Big Data Source Node: 2021482Time Stamp: Mar 20, 2023
Accelerate time to insight with Amazon SageMaker Data Wrangler and the power of Apache Hive Source Cluster: AWS Machine Learning Source Node: 2002994Time Stamp: Mar 10, 2023
Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena Source Cluster: AWS Big Data Source Node: 2003350Time Stamp: Mar 10, 2023
Top 6 Amazon S3 Interview Questions Source Cluster: Analytics Vidhya Source Node: 1995232Time Stamp: Mar 5, 2023
Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless Source Cluster: AWS Big Data Source Node: 1990644Time Stamp: Mar 3, 2023
Use Apache Iceberg in a data lake to support incremental data processing Source Cluster: AWS Big Data Source Node: 1988910Time Stamp: Mar 2, 2023
Reduce Amazon EMR cluster costs by up to 19% with new enhancements in Amazon EMR Managed Scaling Source Cluster: AWS Big Data Source Node: 1985656Time Stamp: Feb 28, 2023
How SafeGraph built a reliable, efficient, and user-friendly Apache Spark platform with Amazon EMR on Amazon EKS Source Cluster: AWS Big Data Source Node: 1972501Time Stamp: Feb 21, 2023