Try our new research platform with insights from 80,000+ expert users

AWS Lambda vs Apache Spark vs Azure Stream Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

Hadoop
Compute Service
Streaming Analytics
 

Featured Reviews

Ilya Afanasyev - PeerSpot reviewer
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a lot of data, and in the last MongoDB version, there is a possibility to read only changed data. This reduced the cost and configuration of the cluster, and we saved about $150,000. The solution is scalable. It's a stable product.
Wai L Lin O - PeerSpot reviewer
A serverless solution with easy integration features
We use AWS Lambda because it provides a solution for our needs without requiring us to manage our infrastructure. With the tool, we only pay for the resources we use. Additionally, it is straightforward to implement and integrates with other services like API Gateway. The tool's serverless nature has had the most significant impact on our workflow. I find it particularly attractive because it eliminates the need for managing servers. In my previous experience, managing upgrades and updates was quite challenging. The solution's integration process with other AWS services was relatively easy. We primarily use AWS services such as EventBridge for scheduling processes and log management.
SantiagoCordero - PeerSpot reviewer
Native connectors and integration simplify tasks but portfolio complexity needs addressing
There are too many products in the Azure landscape, which sometimes leads to overlap between them. Microsoft continuously releases new products or solutions, which can be frustrating when determining the appropriate features from one solution over another. A cost comparison between products is also not straightforward. They should simplify their portfolio. The Microsoft licensing system is confusing and not easy to understand, and this is something they should address. In the future, I may stop using Stream Analytics and move to other solutions. I discussed Palantir earlier, which is something I want to explore in depth because it allows me to accomplish more efficiently compared to solely using Azure. Additionally, the vendors should make the solution more user-friendly, incorporating low-code and no-code features. This is something I wish to explore further.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is the Fault Tolerance and easy binding with other processes like Machine Learning, graph analytics."
"The most crucial feature for us is the streaming capability. It serves as a fundamental aspect that allows us to exert control over our operations."
"It is highly scalable, allowing you to efficiently work with extensive datasets that might be problematic to handle using traditional tools that are memory-constrained."
"The tool's most valuable feature is its speed and efficiency. It's much faster than other tools and excels in parallel data processing. Unlike tools like Python or JavaScript, which may struggle with parallel processing, it allows us to handle large volumes of data with more power easily."
"The product's initial setup phase was easy."
"One of the key features is that Apache Spark is a distributed computing framework. You can help multiple slaves and distribute the workload between them."
"The features we find most valuable are the machine learning, data learning, and Spark Analytics."
"The main feature that we find valuable is that it is very fast."
"AWS Lambda is a stable solution."
"AWS Lambda is entirely stable."
"The most valuable feature is that it scans the cloud system and if they are any security anomalies it triggers an email."
"The cool thing about AWS Lambda is that AWS does all the management. For compression, it is all about making the data small and then making it regular size again. We have an encode function and a decode function. AWS Lambda schedules each of those for us. It has a load balancer and all the fancy stuff, depending on the demand. The most valuable part of AWS Lambda is that I only need to write the software. I need to write two functions, and my cloud developer turns them into two AWS Lambda instances. That's it."
"Thanks to this solution, we do not need to worry about hardware or resource utilization. It saves us time."
"It is easy to use."
"We are building a Twitter-like application in the boot camp. I have used Lamda for the integration of the post-confirmation page in the application. This will help you get your one-time password via mail. You can log in with the help of a post-confirmation page. We didn’t want to setup an instance specifically for confirmation. We used the Lambda function so that it goes back to sleep after pushing up."
"What I like best about AWS Lambda is that it's feature-rich, and I appreciate that. I also like that it's stable and supports many languages."
"The solution's most valuable feature is its ability to create a query using SQ."
"I appreciate this solution because it leverages open-source technologies. It allows us to utilize the latest streaming solutions and it's easy to develop."
"The life cycle, report management and crash management features are great."
"Any time I needed assistance, they were helpful."
"The integrations for this solution are easy to use and there is flexibility in integrating the tool with Azure Stream Analytics."
"It provides the capability to streamline multiple output components."
"It was easy for me to use from the beginning. I am accustomed to working with Microsoft."
"I like the IoT part. We have mostly used Azure Stream Analytics services for it"
 

Cons

"The logging for the observability platform could be better."
"Apache Spark's GUI and scalability could be improved."
"They could improve the issues related to programming language for the platform."
"Its UI can be better. Maintaining the history server is a little cumbersome, and it should be improved. I had issues while looking at the historical tags, which sometimes created problems. You have to separately create a history server and run it. Such things can be made easier. Instead of separately installing the history server, it can be made a part of the whole setup so that whenever you set it up, it becomes available."
"When using Spark, users may need to write their own parallelization logic, which requires additional effort and expertise."
"The product could improve the user interface and make it easier for new users."
"The solution’s integration with other platforms should be improved."
"The solution needs to optimize shuffling between workers."
"We need to invest time in learning the tool's language variant. We have encountered instances of downtime as well."
"AWS Lambda should improve its compatibility with the language used to write the code."
"There are other similar solutions, such as Google Cloud Platform or Microsoft Azure. They might be better for small tasks."
"The support team does not know how to implement and build the solution."
"I would like the layers to have a bigger volume. I would like to be able to add more. I don't want to be limited by the layer."
"I have seen some drawbacks with certain integrations."
"My engineers work with it on a daily basis. I just don't have enough depth of knowledge about what kinds of edge cases they may have tried and found lacking. There may be some issues with some language support at one point or another because we couldn't get the underlying libraries in there. A lot of what we do is either in JavaScript, Python, or some of the non-compiled languages. I'm not sure if we've ever tried building a C# solution, for instance, in Lambda or a Java solution in Lambda. It doesn't mean those aren't its capabilities. I would rather refer to my engineers for where the boundaries are."
"The 60 seconds limitation with the consumption of the service is really restrictive for a service and the solution can be improved by eliminating that."
"The solution’s customer support could be improved."
"There may be some issues when connecting with Microsoft Power BI because we are providing the input and output commands, and there's a chance of it being delayed while connecting."
"Azure Stream Analytics could improve by having clearer metrics as to the scale, more metrics around the data set size that is flowing through it, and performance tuning recommendations."
"The only challenge was that the streaming analytics area in Azure Stream Analytics could not meet our company's expectations, making it a component where improvements are required."
"The collection and analysis of historical data could be better."
"The solution doesn't handle large data packets very efficiently, which could be improved upon."
"There are too many products in the Azure landscape, which sometimes leads to overlap between them."
"The solution could be improved by providing better graphics and including support for UI and UX testing."
 

Pricing and Cost Advice

"The tool is an open-source product. If you're using the open-source Apache Spark, no fees are involved at any time. Charges only come into play when using it with other services like Databricks."
"On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
"Apache Spark is an expensive solution."
"Apache Spark is an open-source tool."
"I did not pay anything when using the tool on cloud services, but I had to pay on the compute side. The tool is not expensive compared with the benefits it offers. I rate the price as an eight out of ten."
"The solution is affordable and there are no additional licensing costs."
"Apache Spark is an open-source solution, and there is no cost involved in deploying the solution on-premises."
"We are using the free version of the solution."
"The price of AWS Lambda is priced very low."
"AWS is slightly more expensive than Azure."
"It costs maybe less than $10 per month in my use case."
"AWS Lambda is inexpensive."
"It computes by the cycle, and it's very cheap."
"This is a product that is pay-per-use, as opposed to a licensing fee."
"We don't need to pay for licensing to use Lambda."
"The cost is based on runtime."
"I rate the price of Azure Stream Analytics a four out of five."
"The current price is substantial."
"Azure Stream Analytics is a little bit expensive."
"There are different tiers based on retention policies. There are four tiers. The pricing varies based on steaming units and tiers. The standard pricing is $10/hour."
"The licensing for this product is payable on a 'pay as you go' basis. This means that the cost is only based on data volume, and the frequency that the solution is used."
"When scaling up, the pricing for Azure Stream Analytics can get relatively high. Considering its capabilities compared to other solutions, I would rate it a seven out of ten for cost. However, we've found ways to optimize costs using tools like Databricks for specific tasks."
"We pay approximately $500,000 a year. It's approximately $10,000 a year per license."
"The product's price is at par with the other solutions provided by the other cloud service providers in the market."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
849,686 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
27%
Computer Software Company
13%
Manufacturing Company
8%
Comms Service Provider
6%
Educational Organization
67%
Financial Services Firm
8%
Computer Software Company
5%
Manufacturing Company
3%
Computer Software Company
15%
Financial Services Firm
14%
Manufacturing Company
10%
Retailer
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Compared to other solutions like Doc DB, Spark is more costly due to the need for extensive infrastructure. It requir...
What needs improvement with Apache Spark?
The Spark solution could improve in scheduling tasks and managing dependencies. Spark alone cannot handle sequential ...
Which is better, AWS Lambda or Batch?
AWS Lambda is a serverless solution. It doesn’t require any infrastructure, which allows for cost savings. There is n...
What do you like most about AWS Lambda?
The tool scales automatically based on the number of incoming requests.
What is your experience regarding pricing and costs for AWS Lambda?
AWS Lambda is cheaper compared to running an instance continuously. You only pay for what you use, making it cost-eff...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What is your experience regarding pricing and costs for Azure Stream Analytics?
I have no problem with pricing. We sell the data analytics value and operational value to customers, focusing on prod...
What needs improvement with Azure Stream Analytics?
There is a lack of technical support from Microsoft's local office, particularly in Taiwan. We often have to learn on...
 

Also Known As

No data available
No data available
ASA
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Netflix
Rockwell Automation, Milliman, Honeywell Building Solutions, Arcoflex Automation Solutions, Real Madrid C.F., Aerocrine, Ziosk, Tacoma Public Schools, P97 Networks
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: March 2025.
849,686 professionals have used our research since 2012.