Try our new research platform with insights from 80,000+ expert users

Apache Spark vs SAP HANA comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Average Rating
8.4
Reviews Sentiment
7.7
Number of Reviews
66
Ranking in other categories
Hadoop (1st), Compute Service (4th), Java Frameworks (2nd)
SAP HANA
Average Rating
8.4
Reviews Sentiment
6.5
Number of Reviews
85
Ranking in other categories
Data Virtualization (2nd), Embedded Database (4th), Relational Databases Tools (4th)
 

Featured Reviews

Ilya Afanasyev - PeerSpot reviewer
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a lot of data, and in the last MongoDB version, there is a possibility to read only changed data. This reduced the cost and configuration of the cluster, and we saved about $150,000. The solution is scalable. It's a stable product.
Jayarami Reddy Pujeri - PeerSpot reviewer
Comprehensive system with real-time analytics for versatile industry applications
Our primary use case is working with various clients in industries such as pharmaceuticals and other services. We support clients as implementers of SAP HANA, providing expertise in functionality, finance, logistics, and processes The solution is very user-friendly and supports all kinds of…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The fault tolerant feature is provided."
"One of the key features is that Apache Spark is a distributed computing framework. You can help multiple slaves and distribute the workload between them."
"The solution has been very stable."
"Its scalability and speed are very valuable. You can scale it a lot. It is a great technology for big data. It is definitely better than a lot of earlier warehouse or pipeline solutions, such as Informatica. Spark SQL is very compliant with normal SQL that we have been using over the years. This makes it easy to code in Spark. It is just like using normal SQL. You can use the APIs of Spark or you can directly write SQL code and run it. This is something that I feel is useful in Spark."
"We use it for ETL purposes as well as for implementing the full transformation pipelines."
"The product is useful for analytics."
"AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI."
"The product's initial setup phase was easy."
"The functionality is of the solution is very good."
"What's most valuable in SAP HANA is that it covers the business process systems of my company. The solution also helps because it makes almost everything automated. Another valuable feature of SAP HANA is that it can be integrated with third parties and other enhancements needed by the company."
"The in-memory computing and the efficient response time are very good features."
"Provides us with predictive capabilities for asset maintenance, and real-time forecasts."
"We have found that the Fiori Apps are particularly good."
"SAP HANA is a stable solution."
"We are using the solution for the DW system. The primary function of the solution is for the database in memory."
"It is a stable solution...It is a scalable solution."
 

Cons

"The graphical user interface (UI) could be a bit more clear. It's very hard to figure out the execution logs and understand how long it takes to send everything. If an execution is lost, it's not so easy to understand why or where it went. I have to manually drill down on the data processes which takes a lot of time. Maybe there could be like a metrics monitor, or maybe the whole log analysis could be improved to make it easier to understand and navigate."
"Dynamic DataFrame options are not yet available."
"Apache Spark can improve the use case scenarios from the website. There is not any information on how you can use the solution across the relational databases toward multiple databases."
"At times during the deployment process, the tool goes down, making it look less robust. To take care of the issues in the deployment process, users need to do manual interventions occasionally."
"When you want to extract data from your HDFS and other sources then it is kind of tricky because you have to connect with those sources."
"Apart from the restrictions that come with its in-memory implementation. It has been improved significantly up to version 3.0, which is currently in use."
"There were some problems related to the product's compatibility with a few Python libraries."
"The management tools could use improvement. Some of the debugging tools need some work as well. They need to be more descriptive."
"The solution is very expensive for us."
"There could be better management for faster updates, last year there were some changes in India to the e-invoicing feature."
"The SAP HANA interface has room for improvement because it takes more work to manage than the Microsoft SQL Server interface."
"SAP HANA isn't user-friendly, and it's very hard to train newcomers to use it."
"The performance and integration with other products are areas in need of improvement."
"The JDBC connectors are very slow."
"Per SAP, you can do both transactional and analytical processes in SAP HANA. Though that's true, the speed is slower when you combine the two functions, so this is what I'd like SAP to improve in SAP HANA. In the next release, I want to see better diagrams in SAP HANA and a more user-friendly interface."
"If the developers were to enhance or improve the application logic while processing the transactions, that would be great."
 

Pricing and Cost Advice

"We are using the free version of the solution."
"Apache Spark is an open-source solution, and there is no cost involved in deploying the solution on-premises."
"The tool is an open-source product. If you're using the open-source Apache Spark, no fees are involved at any time. Charges only come into play when using it with other services like Databricks."
"The solution is affordable and there are no additional licensing costs."
"They provide an open-source license for the on-premise version."
"Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
"On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
"I did not pay anything when using the tool on cloud services, but I had to pay on the compute side. The tool is not expensive compared with the benefits it offers. I rate the price as an eight out of ten."
"The pricing is a bit on the high side."
"The price of licensing is dependent on the size of the project, however, we have found that there is scope to negotiate the cost. If the solution is implemented on-premises there may be some extra costs for hosting etc."
"The cost of SAP HANA is high, and I would rate the price at eight out of ten."
"The tool has a high price. I rate the solution’s pricing, one on a scale of ten, where one is expensive and ten is cheap."
"The pricing for SAP HANA is high. You pay a lot for the license, and you also have to pay for some add-ons."
"SAP HANA is more expensive than other solutions."
"The price of the solution could be reduced, it is expensive."
"Price-wise, the product falls on the higher side of the spectrum. There is no need to pay for maintenance and support additionally. Support is available for bug fixes in the product."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
849,686 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
27%
Computer Software Company
13%
Manufacturing Company
8%
Comms Service Provider
6%
Manufacturing Company
15%
Computer Software Company
11%
Financial Services Firm
10%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Compared to other solutions like Doc DB, Spark is more costly due to the need for extensive infrastructure. It requires significant investment in infrastructure, which can be expensive. While cloud...
What needs improvement with Apache Spark?
The Spark solution could improve in scheduling tasks and managing dependencies. Spark alone cannot handle sequential tasks, requiring environments like Airflow scheduler or scripts. For instance, o...
What are the biggest benefits of using SAP HANA?
Based on my work with SAP HANA, the biggest benefit that it can bring to your business is total data management. This product is by SAP - a company that serves almost all needs a client may have co...
Is SAP HANA’s customer and technical support reliable?
We have been using SAP HANA for a fairly short period of time and have only taken advantage of their customer support. So far, we have not had issues that required specialized help from technical s...
Is SAP HANA difficult to set up and start using?
SAP HANA is fairly easy to set up, however, I do not think a complete beginner can do it. You certainly need some preparation - either you need to have experience with similar solutions, or with ot...
 

Comparisons

 

Also Known As

No data available
SAP High-Performance Analytic Appliance, HANA
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Unilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
Find out what your peers are saying about Apache Spark vs. SAP HANA and other solutions. Updated: April 2025.
849,686 professionals have used our research since 2012.