Try our new research platform with insights from 80,000+ expert users

AWS Lambda vs Apache Spark vs Azure Stream Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
6.6
Apache Spark enhances machine learning, cutting operational costs by up to 50%, with efficiency reliant on resources and expertise.
Sentiment score
7.4
AWS Lambda offers high ROI with low costs, automatic scaling, and pay-as-you-go pricing, enhancing development focus.
Sentiment score
5.3
Azure Stream Analytics offers quick, cost-effective deployment, resulting in positive ROI and customer satisfaction for non-complex scenarios.
 

Customer Service

Sentiment score
5.9
Apache Spark support feedback varies, with mixed reviews on community forums, vendor support, and documentation adequacy.
Sentiment score
7.1
AWS Lambda support receives mixed reviews, praised for enterprise plans but critiqued for response times and personalized cost concerns.
Sentiment score
6.2
Azure Stream Analytics support is effective and responsive, with service quality varying by subscription and occasional communication challenges.
There is a big communication gap due to lack of understanding of local scenarios and language barriers.
The support on critical issues depends on the level of subscription that you have with Microsoft itself.
They've managed to answer all my questions and provide help in a timely manner.
 

Scalability Issues

Sentiment score
7.5
Apache Spark excels in scalability, efficiently handling large data workloads with ease, though it requires skilled infrastructure management.
Sentiment score
7.8
AWS Lambda efficiently scales with traffic, integrates well with AWS, but may raise cost concerns at high volumes.
Sentiment score
7.2
Azure Stream Analytics is highly scalable, cloud-based, easily integrated, adaptable, and efficiently manages varying workloads, despite some cost concerns.
I would rate how scalable AWS Lambda is a nine on a scale from 1 to 10, where 1 would be the lowest and 10 would be the highest level of scalability.
Maintenance requires a couple of people, however, it's not a full-time endeavor.
Azure Stream Analytics is scalable, and I would rate it seven out of ten.
 

Stability Issues

Sentiment score
7.5
Apache Spark is generally stable, trusted by companies; newer versions enhance reliability, though memory issues may arise without proper configuration.
Sentiment score
8.1
AWS Lambda offers stable, reliable performance with high availability, despite occasional latency issues, suitable for mission-critical applications.
Sentiment score
6.3
Azure Stream Analytics is reliable but can face downtime, bugs, transformation challenges, and requires tuning for optimal stability.
MapReduce needs to perform numerous disk input and output operations, while Apache Spark can use memory to store and process data.
They require significant effort and fine-tuning to function effectively.
 

Room For Improvement

Apache Spark requires improvements in scalability, usability, documentation, memory efficiency, real-time processing, and broader language support for better performance.
AWS Lambda requires improved integration, language support, performance, scalability, user-friendliness, and competitive pricing for enhanced third-party interoperability.
Azure Stream Analytics needs better pricing, logging, customization, connectivity, integration, UI, flexibility, support, error handling, and simplified licensing.
There's setup time required to get it integrated with different services such as Power BI, so it's not a straight out-of-the-box configuration.
A cost comparison between products is also not straightforward.
Although customers can invite Microsoft Taiwan office staff for introductions, there are not many useful case references, suggesting room for improvement in market support.
 

Setup Cost

Apache Spark is cost-effective but may incur expenses from hardware, cloud resources, or commercial support, impacting deployment costs.
AWS Lambda's pay-per-use pricing is cost-effective for small workloads, offering flexibility and savings over traditional servers.
Azure Stream Analytics offers competitive pricing but can be costly for enterprises; users find billing reports confusing.
From my point of view, it should be cheaper now, considering the years since its release.
Regarding the cost of Azure Stream Analytics, I believe the price is reasonable for the tool.
We sell the data analytics value and operational value to customers, focusing on productivity and efficiency from the cloud.
 

Valuable Features

Apache Spark offers fast in-memory processing, scalable analytics, MLlib for machine learning, SQL support, and seamless integration with languages.
AWS Lambda offers serverless architecture, easy integration, auto-scaling, and cost-efficiency for developers prioritizing flexibility and quick deployments.
Azure Stream Analytics provides integrated, scalable real-time analytics with SQL queries and machine learning, enhancing data processing capabilities efficiently.
Not all solutions can make this data fast enough to be used, except for solutions such as Apache Spark Structured Streaming.
It's very accurate and uses existing technologies in terms of writing queries, utilizing standard query languages such as SQL, Spark, and others to provide information.
Clients can choose and subscribe to the service items they need, making it more flexible than IBM solutions, especially in data analytics or data governance.
Everything could be converged in a high-level overview in terms of cost, performance, and scaling up the teams.
 

Mindshare comparison

Hadoop Market Share Distribution
ProductMarket Share (%)
Apache Spark19.3%
Cloudera Distribution for Hadoop22.1%
HPE Ezmeral Data Fabric14.2%
Other44.39999999999999%
Hadoop
Compute Service Market Share Distribution
ProductMarket Share (%)
AWS Lambda18.2%
AWS Batch17.7%
AWS Fargate12.7%
Other51.400000000000006%
Compute Service
Streaming Analytics Market Share Distribution
ProductMarket Share (%)
Azure Stream Analytics8.1%
Apache Flink14.6%
Databricks13.1%
Other64.2%
Streaming Analytics
 

Featured Reviews

Omar Khaled - PeerSpot reviewer
Empowering data consolidation and fast decision-making with efficient big data processing
I can improve the organization's functions by taking less time to make decisions. To make the right decision, you need the right data, and a solution can provide this by hiring talent and employees who can consolidate data from different sources and organize it. Not all solutions can make this data fast enough to be used, except for solutions such as Apache Spark Structured Streaming. To make the right decision, you should have both accurate and fast data. Apache Spark itself is similar to the Python programming language. Python is a language with many libraries for mathematics and machine learning. Apache Spark is the solution, and within it, you have PySpark, which is the API for Apache Spark to write and run Python code. Within it, there are many APIs, including SQL APIs, allowing you to write SQL code within a Python function in Apache Spark. You can also use Apache Spark Structured Streaming and machine learning APIs.
Andrew-Wong - PeerSpot reviewer
Convenience in deployment process with room for code preview improvement
Having a better preview would be helpful. Sometimes, if my Lambda code is too big, it can be inconvenient as I'm unable to see my code when it exceeds a certain size. AWS has a limit, like a three-megabyte limit, beyond which I cannot view or edit the code easily.
SantiagoCordero - PeerSpot reviewer
Native connectors and integration simplify tasks but portfolio complexity needs addressing
There are too many products in the Azure landscape, which sometimes leads to overlap between them. Microsoft continuously releases new products or solutions, which can be frustrating when determining the appropriate features from one solution over another. A cost comparison between products is also not straightforward. They should simplify their portfolio. The Microsoft licensing system is confusing and not easy to understand, and this is something they should address. In the future, I may stop using Stream Analytics and move to other solutions. I discussed Palantir earlier, which is something I want to explore in depth because it allows me to accomplish more efficiently compared to solely using Azure. Additionally, the vendors should make the solution more user-friendly, incorporating low-code and no-code features. This is something I wish to explore further.
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
868,304 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
26%
Computer Software Company
11%
Manufacturing Company
7%
Comms Service Provider
7%
Financial Services Firm
18%
Computer Software Company
12%
Manufacturing Company
8%
Educational Organization
6%
Financial Services Firm
15%
Computer Software Company
13%
Manufacturing Company
8%
Retailer
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business27
Midsize Enterprise15
Large Enterprise32
By reviewers
Company SizeCount
Small Business35
Midsize Enterprise15
Large Enterprise42
By reviewers
Company SizeCount
Small Business8
Midsize Enterprise3
Large Enterprise17
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Apache Spark is open-source, so it doesn't incur any charges.
What needs improvement with Apache Spark?
Regarding Apache Spark, I have only used Apache Spark Structured Streaming, not the machine learning components. I am...
Which is better, AWS Lambda or Batch?
AWS Lambda is a serverless solution. It doesn’t require any infrastructure, which allows for cost savings. There is n...
What do you like most about AWS Lambda?
The tool scales automatically based on the number of incoming requests.
What is your experience regarding pricing and costs for AWS Lambda?
The pricing of AWS Lambda is reasonable. It's beneficial and cost-effective for users regardless of the number of ins...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What is your experience regarding pricing and costs for Azure Stream Analytics?
The solution does not need any license; it comes with your subscription.
What needs improvement with Azure Stream Analytics?
It does not always give you the right reason or the correct reason. For example, if a service is stopped, it just tel...
 

Also Known As

No data available
No data available
ASA
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Netflix
Rockwell Automation, Milliman, Honeywell Building Solutions, Arcoflex Automation Solutions, Real Madrid C.F., Aerocrine, Ziosk, Tacoma Public Schools, P97 Networks
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: September 2025.
868,304 professionals have used our research since 2012.