All Content tagged with Amazon EMR

Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.

Content language: English

Select tags to filter
Sort by most recent
444 results
We're currently running EMR clusters with release version 6.10.0 where instances are patched using SSM "AWS-RunPatchBaseline" during bootstrap. We're experiencing several critical issues: cluster fail...
0
answers
0
votes
21
views
AWS
asked 2 days ago
How much should be approx time taken for EMR batch processing and storing data in Redshift for 1 TB data with simple transformation. I have following characteristics for data * File size varies from...
1
answers
0
votes
45
views
asked 2 days ago
I have a use case with * 60 MB/sec data volume * Near real time use cases of AI/Data science as downstream applications should be supported * It's not a ultra-low latency use case, even 60 seconds of...
1
answers
0
votes
38
views
asked 2 days ago
After upgrading EMR from 6.5 to 7.5 I am getting following error OpensslCipher: Failed to load OpenSSL Cipher.java.lang.UnsatisfiedLinkError: EVP_CIPHER_CTX_block_sizeBased on the HADOOP-18994 Failed...
0
answers
0
votes
29
views
asked 18 days ago
I would like to confirm whether it is possible to configure an Amazon EMR cluster with mixed instance types, combining both Graviton-based and non-Graviton instances within the same cluster. I'm going...
1
answers
0
votes
31
views
asked 19 days ago
I'm trying to run an EMR notebook to create a delta table in S3. EMR Cluster Version: emr-7.7.0 Installed Applications: Hadoop 3.4.0, Hive 3.1.3, JupyterEnterpriseGateway 2.6.0, Livy 0.8.0, Spark 3.5...
0
answers
0
votes
9
views
asked 25 days ago
Hi everyone, I am researching about s3 backup and a question is what is the impact on the system or users? I think with backup solutions (s3 versioning, replications, aws backup, custom solution like ...
2
answers
0
votes
49
views
profile picture
asked a month ago
Here's a link to my sample calculation: https://calculator.aws/#/estimate?id=e1754f12531b5a51f332143cb5e5a53e4a626f34 I read in another answer that short Serverless workloads are cheaper in general t...
1
answers
0
votes
98
views
asked a month ago
Hi Mate, I have steps running on EMR, which was working till 13th January 2025. After that I tried running the job today and it started failing with Error like : **AttributeError: module 'awscrt.chec...
3
answers
0
votes
501
views
asked 2 months ago
Hello Team, Followed https://github.com/aws-samples/aws-emr-utilities/blob/main/utilities/emr-ec2-custom-python3/README.md#2-container-images-on-yarn Getting issues when we followed to deploy with d...
2
answers
0
votes
56
views
asked 2 months ago
Hello, Getting issues post custom ami use at EMR on Ec2 cluster with spark submit resulted in failure ```confs: [default] 0 artifacts copied, 60 already retrieved (0kB/30ms) 25/01/23 13:11:37 WAR...
1
answers
0
votes
52
views
asked 2 months ago
Dear i have emr run in old version,and our security tool inspected some security issue ,so i want to update the program of the emr cluster and what is the best way to do this Thanks
1
answers
0
votes
86
views
asked 2 months ago