We changed our name from IT Central Station: Here's why

Amazon EMR vs Cloudera Distribution for Hadoop comparison

Cancel
You must select at least 2 products to compare!
Featured Review
Find out what your peers are saying about Amazon EMR vs. Cloudera Distribution for Hadoop and other solutions. Updated: January 2022.
566,406 professionals have used our research since 2012.
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR.""The initial setup is pretty straightforward.""This is the best tool for hosts and it's really flexible and scalable."

More Amazon EMR Pros →

"We also really like the Cloudera community. You can have any question and will have your answer within a few hours.""The most valuable feature is Impala, the querying engine, which is very fast.""I don't see any performance issues.""The main advantage is the storage is less expensive.""The file system is a valuable feature.""The solution is reliable and stable, it fits our requirements.""CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools.""The most valuable feature is Kubernetes."

More Cloudera Distribution for Hadoop Pros →

Cons
"The dashboard management could be better. Right now, it's lacking a bit.""The most complicated thing is configuring to the cluster and ensure it's running correctly.""Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana."

More Amazon EMR Cons →

"The procedure for operations could be simplified.""The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS.""The price of this solution could be lowered.""Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment.""It could be faster and more user-friendly.""There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon.""Currently, we are using many other tools such as Spark and Blade Job to improve the performance.""The initial setup of Cloudera is difficult."

More Cloudera Distribution for Hadoop Cons →

Pricing and Cost Advice
  • "You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
  • More Amazon EMR Pricing and Cost Advice →

  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
    566,406 professionals have used our research since 2012.
    Questions from the Community
    Top Answer: 
    The initial setup is pretty straightforward.
    Top Answer: 
    The price of the solution may be a bit more than other competitors, such as Microsoft.
    Top Answer: 
    The dashboard management could be better. Right now, it's lacking a bit. I'd like more of a remote connection between my computer and the solution. We have multi-factor authentication, and at one… more »
    Top Answer: 
    The CDP I used was almost 2.5 years ago on-premise. I would rate it 8/10. I did not have much to compare against in those days and due to Cloud not accessible in my organisation. But, definitely CDP… more »
    Top Answer: 
    The solution is reliable and stable, it fits our requirements.
    Top Answer: 
    I haven't bought a license for this solution. I'm only using the Apache license version.
    Ranking
    4th
    out of 22 in Hadoop
    Views
    1,742
    Comparisons
    1,485
    Reviews
    3
    Average Words per Review
    501
    Rating
    6.7
    2nd
    out of 22 in Hadoop
    Views
    4,556
    Comparisons
    3,215
    Reviews
    7
    Average Words per Review
    354
    Rating
    7.4
    Comparisons
    Also Known As
    Amazon Elastic MapReduce
    Learn More
    Overview
    Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data. Amazon EMR simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective for you to distribute and process vast amounts of your data across dynamically scalable Amazon EC2 instances.
    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
    Offer
    Learn more about Amazon EMR
    Learn more about Cloudera Distribution for Hadoop
    Sample Customers
    Yelp
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    Top Industries
    VISITORS READING REVIEWS
    Computer Software Company30%
    Media Company20%
    Comms Service Provider14%
    Financial Services Firm7%
    REVIEWERS
    Financial Services Firm29%
    Computer Software Company24%
    Marketing Services Firm12%
    Insurance Company12%
    VISITORS READING REVIEWS
    Computer Software Company27%
    Comms Service Provider22%
    Financial Services Firm10%
    Government6%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise43%
    Large Enterprise29%
    REVIEWERS
    Small Business25%
    Midsize Enterprise25%
    Large Enterprise50%
    Find out what your peers are saying about Amazon EMR vs. Cloudera Distribution for Hadoop and other solutions. Updated: January 2022.
    566,406 professionals have used our research since 2012.

    Amazon EMR is ranked 4th in Hadoop with 3 reviews while Cloudera Distribution for Hadoop is ranked 2nd in Hadoop with 10 reviews. Amazon EMR is rated 6.6, while Cloudera Distribution for Hadoop is rated 7.6. The top reviewer of Amazon EMR writes "Stable, scalable, and has all the necessary distributions ". On the other hand, the top reviewer of Cloudera Distribution for Hadoop writes "Performs well and the technical support is helpful, but the upgrade process needs to be consolidated". Amazon EMR is most compared with Hortonworks Data Platform, Apache Spark, HPE Ezmeral Data Fabric, Spark SQL and Qubole Data Services, whereas Cloudera Distribution for Hadoop is most compared with HPE Ezmeral Data Fabric, Apache Spark, SingleStore DB, InfluxDB and SAP HANA. See our Amazon EMR vs. Cloudera Distribution for Hadoop report.

    See our list of best Hadoop vendors.

    We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.