Amazon EC2 vs Apache Spark vs Azure Stream Analytics comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
2,542 views|1,652 comparisons
98% willing to recommend
Apache Logo
2,893 views|2,256 comparisons
89% willing to recommend
Microsoft Logo
9,766 views|8,235 comparisons
95% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EC2, Apache Spark, and Azure Stream Analytics based on real PeerSpot user reviews.

Find out what your peers are saying about Amazon Web Services (AWS), Apache, Zadara and others in Compute Service.
To learn more, read our detailed Compute Service Report (Updated: April 2024).
769,334 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The most valuable features of Amazon EC2 are content delivery and adaptability.""The product is easy to set up.""Amazon EC2 is highly scalable.""The most valuable features are the scalability options, low maintenance, and options to upgrade. AWS support is also pretty good. The generation upgrade is pretty simple and standardized.""The amount of bandwidth has been most valuable.""What I found most valuable in Amazon EC2 is that you only pay for what you use, versus an on-premise deployment that requires you to pay for the cost of the server. When it's on-premise, you'll need to meet more specifications and requirements, and the purchasing process even takes time. As Amazon EC2 is cloud-based, you'll only pay when you use the service.""We find it easy to scale.""Its ease of use is valuable."

More Amazon EC2 Pros →

"With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware.""It's easy to prepare parallelism in Spark, run the solution with specific parameters, and get good performance.""It provides a scalable machine learning library.""It is highly scalable, allowing you to efficiently work with extensive datasets that might be problematic to handle using traditional tools that are memory-constrained.""We use it for ETL purposes as well as for implementing the full transformation pipelines.""I appreciate everything about the solution, not just one or two specific features. The solution is highly stable. I rate it a perfect ten. The solution is highly scalable. I rate it a perfect ten. The initial setup was straightforward. I recommend using the solution. Overall, I rate the solution a perfect ten.""It is useful for handling large amounts of data. It is very useful for scientific purposes.""I like that it can handle multiple tasks parallelly. I also like the automation feature. JavaScript also helps with the parallel streaming of the library."

More Apache Spark Pros →

"We use Azure Stream Analytics for simulation and internal activities.""The life cycle, report management and crash management features are great.""Real-time analytics is the most valuable feature of this solution. I can send the collected data to Power BI in real time.""Provides deep integration with other Azure resources.""The solution's most valuable feature is its ability to create a query using SQ.""The solution's technical support is good.""I appreciate this solution because it leverages open-source technologies. It allows us to utilize the latest streaming solutions and it's easy to develop.""The way it organizes data into tables and dashboards is very helpful."

More Azure Stream Analytics Pros →

Cons
"The availability and response time of the free technical support can be improved.""The IP changes whenever we restart which is frustrating.""Regarding availability, a noticeable improvement would be the possibility of more load balancing configurations and the deployment of more datacenters, mainly in Latin America.""The initial setup could be easier because many keys are required for access.""One of the challenges is the AMI upgrades.""Its price can be reduced.""I would like to see more variety in the operating system images used to create test environments in EC2. There should be more versions and releases. Sometimes, you want to test an update from an old release to a higher version, but you can’t do that with the new images available. You have to use your own.""They have to provide clarity on pricing. It's not transparent."

More Amazon EC2 Cons →

"We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time.""This solution currently cannot support or distribute neural network related models, or deep learning related algorithms. We would like this functionality to be developed.""They could improve the issues related to programming language for the platform.""Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing.""Apache Spark could improve the connectors that it supports. There are a lot of open-source databases in the market. For example, cloud databases, such as Redshift, Snowflake, and Synapse. Apache Spark should have connectors present to connect to these databases. There are a lot of workarounds required to connect to those databases, but it should have inbuilt connectors.""The product could improve the user interface and make it easier for new users.""The solution needs to optimize shuffling between workers.""Apache Spark provides very good performance The tuning phase is still tricky."

More Apache Spark Cons →

"We would like to have centralized platform altogether since we have different kind of options for data ingestion. Sometimes it gets difficult to manage different platforms.""Easier scalability and more detailed job monitoring features would be helpful.""It is not complex, but it requires some development skills. When the data is sent from Azure Stream Analytics to Power BI, I don't have the access to modify the data. I can't customize or edit the data or do some queries. All queries need to be done in the Azure Stream Analytics.""If something goes wrong, it's very hard to investigate what caused it and why.""Sometimes when we connect Power BI, there is a delay or it throws up some errors, so we're not sure.""The solution doesn't handle large data packets very efficiently, which could be improved upon.""The solution could be improved by providing better graphics and including support for UI and UX testing.""The solution's interface could be simpler to understand for non-technical people."

More Azure Stream Analytics Cons →

Pricing and Cost Advice
  • "Pricing appears to be cheap, however, it is extremely difficult in calculating what something will cost."
  • "It has helped to reduce costs with infrastructure."
  • "EC2 pricing is somewhat transparent, in that AWS provides pricing for all instance types. However, the number of pricing options can be confusing."
  • "For our usage, the cost is approximately $20,000 to $23,000 per month."
  • "There is a license required to use this solution and we pay on a monthly basis."
  • "The price is reasonable, but there is definitely an opportunity to lower it in instances which are of a higher configuration, because they have been typically used for the long term."
  • "Amazon EC2 has a pay-as-you-use cost model."
  • "The clients have found the billing of Amazon EC2 good, but the price could be less high. There is a monthly subscription to use the solution."
  • More Amazon EC2 Pricing and Cost Advice →

  • "Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
  • "Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
  • "We are using the free version of the solution."
  • "Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
  • "Apache Spark is an expensive solution."
  • "Spark is an open-source solution, so there are no licensing costs."
  • "On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
  • "It is an open-source solution, it is free of charge."
  • More Apache Spark Pricing and Cost Advice →

  • "The cost of this solution is less than competitors such as Amazon or Google Cloud."
  • "We pay approximately $500,000 a year. It's approximately $10,000 a year per license."
  • "I rate the price of Azure Stream Analytics a four out of five."
  • "The licensing for this product is payable on a 'pay as you go' basis. This means that the cost is only based on data volume, and the frequency that the solution is used."
  • "There are different tiers based on retention policies. There are four tiers. The pricing varies based on steaming units and tiers. The standard pricing is $10/hour."
  • "The current price is substantial."
  • "Azure Stream Analytics is a little bit expensive."
  • "The product's price is at par with the other solutions provided by the other cloud service providers in the market."
  • More Azure Stream Analytics Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
    769,334 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon EC2 is really reliable and provides great flexibility.
    Top Answer:The solution has different pricing models, and its cost differs when you purchase it for one year or three years.
    Top Answer:The solution’s pricing and downtimes could be improved. I would like to have a better pricing model for Amazon EC2… more »
    Top Answer:We use Spark to process data from different data sources.
    Top Answer:In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond… more »
    Top Answer:Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics… more »
    Top Answer:The product's price is at par with the other solutions provided by the other cloud service providers in the market.
    Top Answer:Azure Stream Analytics was not meeting our company's expectations because it was tedious to change the job, write… more »
    Ranking
    3rd
    out of 16 in Compute Service
    Views
    2,542
    Comparisons
    1,652
    Reviews
    42
    Average Words per Review
    341
    Rating
    8.6
    5th
    out of 16 in Compute Service
    Views
    2,893
    Comparisons
    2,256
    Reviews
    26
    Average Words per Review
    444
    Rating
    8.7
    3rd
    out of 38 in Streaming Analytics
    Views
    9,766
    Comparisons
    8,235
    Reviews
    14
    Average Words per Review
    430
    Rating
    8.2
    Comparisons
    Also Known As
    Amazon Elastic Compute Cloud, EC2
    ASA
    Learn More
    Overview

    Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable compute capacity in the cloud. It is designed to make web-scale cloud computing easier for developers.

    Amazon EC2’s simple web service interface allows you to obtain and configure capacity with minimal friction. It provides you with complete control of your computing resources and lets you run on Amazon’s proven computing environment. Amazon EC2 reduces the time required to obtain and boot new server instances to minutes, allowing you to quickly scale capacity, both up and down, as your computing requirements change. Amazon EC2 changes the economics of computing by allowing you to pay only for capacity that you actually use. Amazon EC2 provides developers the tools to build failure resilient applications and isolate them from common failure scenarios.

    Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

    Azure Stream Analytics is a robust real-time analytics service that has been designed for critical business workloads. Users are able to build an end-to-end serverless streaming pipeline in minutes. Utilizing SQL, users are able to go from zero to production with a few clicks, all easily extensible with unique code and automatic machine learning abilities for the most advanced scenarios.

    Azure Stream Analytics has the ability to analyze and accurately process exorbitant volumes of high-speed streaming data from numerous sources at the same time. Patterns and scenarios are quickly identified and information is gathered from various input sources, such as social media feeds, applications, clickstreams, sensors, and devices. These patterns can then be implemented to trigger actions and launch workflows, such as feeding data to a reporting tool, storing data for later use, or creating alerts. Azure Stream Analytics is also offered on Azure IoT Edge runtime, so the data can be processed on IoT devices.

    Top Benefits

    • User friendly: Azure Stream Analytics is very straightforward and easy to use. Out of the box and with a few clicks, users are able to connect to numerous sources and sinks, and easily develop an end-to-end pipeline. Stream Analytics can easily connect to Azure IoT Hub and Azure Event Hub for streaming ingestion, in addition to connecting with Azure Blob storage for historical data ingestion.

    • Flexible deployment: For low-latency analytics, Azure Stream Analytics can run on Azure Stack or IoT edge. For large-scale analytics, the solution can run in the cloud. Azure Stream Analytics uses the same query language and tools for both the cloud and the edge, facilitating an easier process for developers to design exceptional hybrid architectures for streaming processes.

    • Cost-effective: With Azure Stream Analytics, users only pay for the streaming units they consume; there are no upfront costs. Users can easily scale up or down as needed; there is no commitment or cluster provisioning.

    • Trustworthy: Azure Stream Analytics guarantees event processing to be 99.99% available with a minute level of granularity. Azure Stream Analytics has embedded recovery capabilities and checkpoints to keep things running smoothly at all times. Events are never lost with Azure Stream Analytics at-least once delivery of events and exactly one event processing.

    Reviews from Real Users

    “Azure Stream Analytics is something that you can use to test out streaming scenarios very quickly in the general sense and it is useful for IoT scenarios. If I was to do a project with IoT and I needed a streaming solution, Azure Stream Analytics would be a top choice. The most valuable features of Azure Stream Analytics are the ease of provisioning and the interface is not terribly complex.” - Olubisi A., Team Lead at a tech services company.

    “It's used primarily for data and mining - everything from the telemetry data side of things. It's great for streaming and makes everything easy to handle. The streaming from the IoT hub and the messaging are aspects I like a lot.” - Sudhendra U., Technical Architect at Infosys

    Sample Customers
    Netflix, Expedia, TimeInc., Novaris, airbnb, Lamborghini
    NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
    Rockwell Automation, Milliman, Honeywell Building Solutions, Arcoflex Automation Solutions, Real Madrid C.F., Aerocrine, Ziosk, Tacoma Public Schools, P97 Networks
    Top Industries
    REVIEWERS
    Computer Software Company31%
    Financial Services Firm15%
    Comms Service Provider12%
    Government8%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company17%
    Manufacturing Company6%
    Educational Organization6%
    REVIEWERS
    Computer Software Company30%
    Financial Services Firm15%
    University9%
    Marketing Services Firm6%
    VISITORS READING REVIEWS
    Financial Services Firm25%
    Computer Software Company13%
    Manufacturing Company7%
    Comms Service Provider6%
    REVIEWERS
    Computer Software Company27%
    Manufacturing Company18%
    Insurance Company9%
    Media Company9%
    VISITORS READING REVIEWS
    Computer Software Company15%
    Financial Services Firm12%
    Manufacturing Company8%
    Comms Service Provider5%
    Company Size
    REVIEWERS
    Small Business46%
    Midsize Enterprise16%
    Large Enterprise39%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise12%
    Large Enterprise68%
    REVIEWERS
    Small Business40%
    Midsize Enterprise18%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise12%
    Large Enterprise71%
    REVIEWERS
    Small Business24%
    Midsize Enterprise10%
    Large Enterprise67%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise11%
    Large Enterprise69%
    Buyer's Guide
    Compute Service
    April 2024
    Find out what your peers are saying about Amazon Web Services (AWS), Apache, Zadara and others in Compute Service. Updated: April 2024.
    769,334 professionals have used our research since 2012.