Try our new research platform with insights from 80,000+ expert users

AWS Lambda vs Apache Spark comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on May 21, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Ranking in Compute Service
4th
Average Rating
8.4
Reviews Sentiment
7.4
Number of Reviews
66
Ranking in other categories
Hadoop (1st), Java Frameworks (2nd)
AWS Lambda
Ranking in Compute Service
1st
Average Rating
8.6
Reviews Sentiment
7.3
Number of Reviews
88
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of June 2025, in the Compute Service category, the mindshare of Apache Spark is 11.4%, up from 10.8% compared to the previous year. The mindshare of AWS Lambda is 21.2%, up from 20.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Compute Service
 

Featured Reviews

Dunstan Matekenya - PeerSpot reviewer
Open-source solution for data processing with portability
Apache Spark is known for its ease of use. Compared to other available data processing frameworks, it is user-friendly. While many choices now exist, Spark remains easy to use, particularly with Python. You can utilize familiar programming styles similar to Pandas in Python, including object-oriented programming. Another advantage is its portability. I can prototype and perform some initial tasks on my laptop using Spark without needing to be on Databricks or any cloud platform. I can transfer it to Databricks or other platforms, such as AWS. This flexibility allows me to improve processing even on my laptop. For instance, if I'm processing large amounts of data and find my laptop becoming slow, I can quickly switch to Spark. It handles small and large datasets efficiently, making it a versatile tool for various data processing needs.
Andrew-Wong - PeerSpot reviewer
Convenience in deployment process with room for code preview improvement
Having a better preview would be helpful. Sometimes, if my Lambda code is too big, it can be inconvenient as I'm unable to see my code when it exceeds a certain size. AWS has a limit, like a three-megabyte limit, beyond which I cannot view or edit the code easily.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"There's a lot of functionality."
"The most valuable feature of Apache Spark is its flexibility."
"It is useful for handling large amounts of data. It is very useful for scientific purposes."
"The solution is scalable."
"The solution is very stable."
"AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI."
"The product is useful for analytics."
"With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware."
"Some of the most valuable features are that it's easy to install and use. The performance is also good."
"The most valuable features of AWS Lambda are a serverless and event-driven architecture."
"The most valuable feature of AWS Lambda is that you can trigger and run jobs instantly, and after you complete the job, that function is either destroyed or stopped automatedly."
"The serverless computing feature eliminates the need to manage servers, provision, or scale."
"We are building a Twitter-like application in the boot camp. I have used Lamda for the integration of the post-confirmation page in the application. This will help you get your one-time password via mail. You can log in with the help of a post-confirmation page. We didn’t want to setup an instance specifically for confirmation. We used the Lambda function so that it goes back to sleep after pushing up."
"The programming language and the integration with other AWS services are the most valuable features."
"I have found all of the features valuable. It's an easy and cheap solution."
"Provides a good, easy path from when you're using an AWS cluster."
 

Cons

"I would like to see integration with data science platforms to optimize the processing capability for these tasks."
"The migration of data between different versions could be improved."
"Apache Spark is very difficult to use. It would require a data engineer. It is not available for every engineer today because they need to understand the different concepts of Spark, which is very, very difficult and it is not easy to learn."
"Apache Spark lacks geospatial data."
"The logging for the observability platform could be better."
"It would be beneficial to enhance Spark's capabilities by incorporating models that utilize features not traditionally present in its framework."
"We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time."
"Apache Spark should add some resource management improvements to the algorithms."
"We don't have the inbuilt modules in AWS Lambda. If more modules were built into or integrated with AWS Lambda, that would help developers to code."
"If it is a specific ETL process or a long-term one, then AWS Lambda is not a good option."
"The price in general could always be better."
"I want to see support for longer applications. I need the 15-minute time-out window to improve."
"I would like to see the five zero four AWS Lambda invocation fixed. This is basically a time-out error."
"Lambda would benefit from a debugging feature as well."
"I wish to see better execution time in the next release."
"The tool changes its UI every month which is very frustrating for me. I don’t know why AWS keeps changing the UI. They can’t stick to a specific one"
 

Pricing and Cost Advice

"I did not pay anything when using the tool on cloud services, but I had to pay on the compute side. The tool is not expensive compared with the benefits it offers. I rate the price as an eight out of ten."
"It is an open-source solution, it is free of charge."
"The tool is an open-source product. If you're using the open-source Apache Spark, no fees are involved at any time. Charges only come into play when using it with other services like Databricks."
"It is quite expensive. In fact, it accounts for almost 50% of the cost of our entire project."
"Spark is an open-source solution, so there are no licensing costs."
"Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
"We are using the free version of the solution."
"Apache Spark is an expensive solution."
"AWS Lambda's cloud version isn't expensive, and I'd rate its pricing as five out of five."
"The price of AWS Lambda is priced very low."
"AWS Lambda is not expensive for micro testing but is expensive if used for long deployment or long services."
"Lambda is an affordable solution. They offer free requests every month and charge per the compute time. If you are working in a big organization, usually AWS offer a savings plan where you get approximately 70% discount on pricing."
"The cost is based on runtime."
"AWS Lambda is a cheap solution."
"I would rate the tool’s pricing a nine out of ten. The solution’s pricing works on a pay-as-you-go basis."
"Price-wise, AWS Lambda is very cheap. It's not free, but it's not that expensive."
report
Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
856,873 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
27%
Computer Software Company
13%
Manufacturing Company
7%
Comms Service Provider
6%
Educational Organization
57%
Financial Services Firm
10%
Computer Software Company
6%
Manufacturing Company
4%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Apache Spark is open-source, so it doesn't incur any charges.
What needs improvement with Apache Spark?
There is complexity when it comes to understanding the whole ecosystem, especially for beginners. I find it quite complex to understand how a Spark job is initiated, the roles of driver nodes, work...
Which is better, AWS Lambda or Batch?
AWS Lambda is a serverless solution. It doesn’t require any infrastructure, which allows for cost savings. There is no setup process to deal with, as the entire solution is in the cloud. If you use...
What do you like most about AWS Lambda?
The tool scales automatically based on the number of incoming requests.
What is your experience regarding pricing and costs for AWS Lambda?
The pricing of AWS Lambda is reasonable. It's beneficial and cost-effective for users regardless of the number of instances used.
 

Comparisons

 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Netflix
Find out what your peers are saying about AWS Lambda vs. Apache Spark and other solutions. Updated: June 2025.
856,873 professionals have used our research since 2012.