We performed a comparison between Amazon EMR, Cloudera Distribution for Hadoop, and IBM InfoSphere BigInsights [EOL] based on real PeerSpot user reviews.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop."The project management is very streamlined."
"Amazon EMR is a good solution that can be used to manage big data."
"The solution helps us manage huge volumes of data."
"Amazon EMR's most valuable features are processing speed and data storage capacity."
"It has a variety of options and support systems."
"It allows users to access the data through a web interface."
"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"It has the best proxy, security, and support features compared to open-source products."
"The main advantage is the storage is less expensive."
"The solution is reliable and stable, it fits our requirements."
"I don't see any performance issues."
"We had a data warehouse before all the data. We can process a lot more data structures."
"It is helpful to gather and process data."
"The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"The problem for us is it starts very slow."
"There is no need to pay extra for third-party software."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"The initial setup was time-consuming."
"There is room for improvement in pricing."
"The product's features for storing data in static clusters could be better."
"The procedure for operations could be simplified."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"They should focus on upgrading their technical capabilities in the market."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"There are better solutions out there that have more features than this one."
"The dashboard could be improved."
"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."
"The governance aspect of the solution should be improved."
"The UI was not interactive: Responses used to be very slow and hang up at times."
More Cloudera Distribution for Hadoop Pricing and Cost Advice →
Earn 20 points