SAP HANA vs Spark SQL comparison

Cancel
You must select at least 2 products to compare!
SAP Logo
734 views|464 comparisons
91% willing to recommend
Apache Logo
1,534 views|1,005 comparisons
85% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between SAP HANA and Spark SQL based on real PeerSpot user reviews.

Find out in this report how the two Hadoop solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed SAP HANA vs. Spark SQL Report (Updated: March 2024).
767,847 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The user interface is very good. You can do any kind of reporting analytics from the platform.""What I like most are the dashboards and pervasive analytics.""It is difficult for me to narrow down what the best features are in SAP HANA because they work together to provide the overall functionality of the solution. However, the Fiori application is very good.""Anyone currently using SAP will be transitioning to HANA.""The product handles high volumes very well and provides good integration.""The UX experience is very good.""The solution is extremely stable. That's the most important aspect of the solution, for our organization. There is no downtime, and the performance is very good.""Some functions have good performance."

More SAP HANA Pros →

"The performance is one of the most important features. It has an API to process the data in a functional manner.""The stability was fine. It behaved as expected.""The speed of getting data.""Certain data sets that are very large are very difficult to process with Pandas and Python libraries. Spark SQL has helped us a lot with that.""Spark SQL's efficiency in managing distributed data and its simplicity in expressing complex operations make it an essential part of our data pipeline.""The solution is easy to understand if you have basic knowledge of SQL commands.""This solution is useful to leverage within a distributed ecosystem.""The team members don't have to learn a new language and can implement complex tasks very easily using only SQL."

More Spark SQL Pros →

Cons
"I would like more technical documentation. I would like it to be easier to find online help or have a better launch-based service. SAP has a lot of functions, so we need more best practices and more detailed documentation on industry solutions. For example, it would be good to have documentation on why a certain process needs to be set up and which kinds of configurations need to be set up.""It could be a bit more scalable.""The solution's development platform should be more flexible and scalable to adapt to other solutions.""I don't have direct access to SAP, and instead, I need to go through the SAP office in India.""SAP HANA isn't user-friendly, and it's very hard to train newcomers to use it.""Technical support should be more customer-friendly.""What needs improvement in SAP HANA is its automation, in particular, it needs more enhancements in that area.""Per SAP, you can do both transactional and analytical processes in SAP HANA. Though that's true, the speed is slower when you combine the two functions, so this is what I'd like SAP to improve in SAP HANA. In the next release, I want to see better diagrams in SAP HANA and a more user-friendly interface."

More SAP HANA Cons →

"In the next update, we'd like to see better performance for small points of data. It is possible but there are better tools that are faster and cheaper.""SparkUI could have more advanced versions of the performance and the queries and all.""This solution could be improved by adding monitoring and integration for the EMR.""It takes a bit of time to get used to using this solution versus Pandas as it has a steep learning curve.""There are many inconsistencies in syntax for the different querying tasks.""There should be better integration with other solutions.""In terms of improvement, the only thing that could be enhanced is the stability aspect of Spark SQL.""Being a new user, I am not able to find out how to partition it correctly. I probably need more information or knowledge. In other database solutions, you can easily optimize all partitions. I haven't found a quicker way to do that in Spark SQL. It would be good if you don't need a partition here, and the system automatically partitions in the best way. They can also provide more educational resources for new users."

More Spark SQL Cons →

Pricing and Cost Advice
  • "Set up a consortium of consulting partners and hardware vendors to define your tech. Landscape TCO (total cost of ownership) and then approach the OEM for pricing (on-premise or on cloud or a hybrid model). Check if you can bring your own licenses for some of the existing application licenses on the new platform, to reduce TCO."
  • "People who are technical will accept the cost, but financially they will assess whether this solution will bring them revenue or not. People often ask, how will I profit when the cost is so high?"
  • "It is expensive, which isn't a problem for us because SAP HANA is processing the data so fast."
  • "SAP HANA is an expensive product."
  • "It is expensive."
  • "Setup and licensing require planning and proper budgeting, as it is not cheap."
  • "The price of the solution could be reduced, it is expensive."
  • "The price of this product is good."
  • More SAP HANA Pricing and Cost Advice →

  • "The solution is open-sourced and free."
  • "There is no license or subscription for this solution."
  • "The solution is bundled with Palantir Foundry at no extra charge."
  • "The on-premise solution is quite expensive in terms of hardware, setting up the cluster, memory, hardware and resources. It depends on the use case, but in our case with a shared cluster which is quite large, it is quite expensive."
  • "We use the open-source version, so we do not have direct support from Apache."
  • "We don't have to pay for licenses with this solution because we are working in a small market, and we rely on open-source because the budgets of projects are very small."
  • More Spark SQL Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
    767,847 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Based on my work with SAP HANA, the biggest benefit that it can bring to your business is total data management. This product is by SAP - a company that serves almost all needs a client may have… more »
    Top Answer:We have been using SAP HANA for a fairly short period of time and have only taken advantage of their customer support. So far, we have not had issues that required specialized help from technical… more »
    Top Answer:SAP HANA is fairly easy to set up, however, I do not think a complete beginner can do it. You certainly need some preparation - either you need to have experience with similar solutions, or with other… more »
    Top Answer:Spark SQL's efficiency in managing distributed data and its simplicity in expressing complex operations make it an essential part of our data pipeline.
    Top Answer:We don't have to pay for licenses with this solution because we are working in a small market, and we rely on open-source because the budgets of projects are very small.
    Top Answer:In terms of improvement, the only thing that could be enhanced is the stability aspect of Spark SQL. There could be additional features that I haven't explored but the current solution for working… more »
    Ranking
    1st
    out of 14 in Embedded Database
    Views
    734
    Comparisons
    464
    Reviews
    39
    Average Words per Review
    398
    Rating
    8.5
    4th
    out of 22 in Hadoop
    Views
    1,534
    Comparisons
    1,005
    Reviews
    7
    Average Words per Review
    543
    Rating
    8.3
    Comparisons
    Oracle Database logo
    Compared 33% of the time.
    SQL Server logo
    Compared 28% of the time.
    MySQL logo
    Compared 8% of the time.
    IBM Db2 Database logo
    Compared 7% of the time.
    Apache Spark logo
    Compared 4% of the time.
    Also Known As
    SAP High-Performance Analytic Appliance, HANA
    Learn More
    Overview

    SAP HANA, also known as SAP High-performance Analytics Appliance, is a multi-model database that stores data in its memory, allowing users to avoid disk storage. The product combines its robust database with services for creating applications. SAP HANA is faster than other database management systems (DBMS) because it stores data in column-based tables in main memory and brings online analytical processing (OLAP) and online transaction processing (OLTP) together.

    The column-oriented in-memory database design allows users to run high-speed transactions alongside advanced analytics, all in a single system. This provides companies with the ability to process very large amounts of data with low latency and query data in an instant. By combining multiple data management capabilities, the solution simplifies IT, helps businesses with innovations, and facilitates digital transformation.

    The solution is structured into five groups of capabilities, categorized as:

    • Database design
    • Database management
    • Application development
    • Advanced analytics
    • Data virtualization

    There are three more SAP products that work alongside SAP HANA and complete the experience for users together. SAP S/4HANA Cloud is a ready-to-run cloud enterprise resource planning (ERP). SAP BW/4HANA is a packaged data warehouse, based on SAP HANA, which allows users to consolidate data across the enterprise to get a consistent view of their data. Finally, SAP Cloud is a single database as a service (DBaaS) foundation for modern applications and analytics across all enterprise data. All three products can combine with SAP HANA to deliver to users an optimized experience regarding their data.

    SAP HANA Features

    Each architectural group of capabilities of SAP HANA has various features that users can benefit from. These include:

    • Parallel processing database: SAP HANA utilizes a single platform to run transactional and analytical workloads.

    • ACID compliance: This feature ensures compliance with requirements for Atomicity, Consistency, Isolation, and Durability (ACID) standards.

    • Multi-tenancy: This feature allows multiple tenant databases to run in one system while sharing the same memory and processors.

    • Multi-tier storage and persistent memory support: SAP HANA's native storage extension is a built-in capability to manage between memory and persistent storage, including SAP HANA Cloud Data Lake.

    • Scaling: The scaling feature supports terabytes of data in a single server and distributes large tables across multiple servers in a cluster to scale further.

    • Data modeling: This feature consists of graphical modeling tools that enable collaboration between stakeholders and the creation of models to execute complex business logic and data transformation in real time.

    • Stored procedures: The product has a native language to build stored procedures and uses advanced capabilities to create complex logic.

    • Administration: This feature consists of administration tools for various platform lifecycle, performance, and management operations and automations.

    • Security: SAP HANA provides its users with real-time data anonymization features to extract value from data while protecting privacy.

    • Availability and recovery: The tool supports high availability and disaster recovery through an array of techniques, including backup, storage mirroring, synchronous, asynchronous, and multitarget system replication.

    • Extended application services: Through its built-in application server, users can develop services such as REST and ODATA, as well as web applications that can run on multiple locations.

    • Client access: The product offers clients the ability to access it via other application platforms and languages, including Java, JavaScript, R, and Go.

    • Application lifecycle management: This set of features facilitates the building and packaging of applications, transporting them for development to test to production, and then deploying them.

    • Application development: This feature consists of a set of tools that offer application development on premises and in the Cloud. The programming language ABAP includes additional optimized features to build extensions to SAP applications.

    • Search: The search feature uses SQL to locate text promptly across multiple columns and textual content.

    • Spatial processing: This product feature provides native support for spatial data types and spatial functions.

    • Graph: Through this feature, users of the product can store and process highly connected data using a property graph.

    • Streaming analytics: This feature combines various data sources that users can utilize to discover trends over a set period.

    • Data integration and replication: The solution offers comprehensive features to handle all data integration scenarios.

    • Data federation: This feature allows users to perform queries on remote data sources in real time with data federation.

    • Caching: The capacity to cache data provides users with the ability to optimize federated queries against remote sources of data.

    SAP HANA Benefits

    SAP HANA provides many benefits for its users. These include:

    • This solution offers a high level of data and application security, beginning from a secure setup and providing continuous support.

    • SAP HANA offers augmentation for applications and analytics with built-in machine learning (ML).

    • The solution works in a timely manner, as it provides a response to queries within seconds in large production applications.

    • SAP HANA simplifies work, as it provides a single gateway to all user data with advanced data virtualization.

    • The product is very flexible, as it allows users to deploy applications in a public or private cloud, in multiple clouds, on premises, or hybrid.

    • SAP HANA scales easily for data volume and concurrent users across a distributed environment.

    • This is a powerful solution in terms of querying large datasets with a massively parallel processing (MPP) database.

    • SAP HANA is a versatile product that supports hybrid transactional and analytical processing as well as many data types.

    • The product provides a smaller data footprint with no data duplication or advanced compression, and reduces data silos.

    Reviews from Real Users

    According to a database consultant at a pharma/biotech company, SAP HANA is a very robust solution with good data access.

    Bruno V., owner at LAVORO AUTOM INF E COM LTDA, likes SAP HANA because the product offers advanced features, helps reduce hours, and makes it easy to find what you need.

    Spark SQL is a Spark module for structured data processing. Unlike the basic Spark RDD API, the interfaces provided by Spark SQL provide Spark with more information about the structure of both the data and the computation being performed. There are several ways to interact with Spark SQL including SQL and the Dataset API. When computing a result the same execution engine is used, independent of which API/language you are using to express the computation. This unification means that developers can easily switch back and forth between different APIs based on which provides the most natural way to express a given transformation.
    Sample Customers
    Unilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
    UC Berkeley AMPLab, Amazon, Alibaba Taobao, Kenshoo, Hitachi Solutions
    Top Industries
    REVIEWERS
    Manufacturing Company17%
    Computer Software Company15%
    Energy/Utilities Company10%
    Retailer8%
    VISITORS READING REVIEWS
    Computer Software Company14%
    Manufacturing Company14%
    Financial Services Firm8%
    Comms Service Provider6%
    VISITORS READING REVIEWS
    Financial Services Firm21%
    Computer Software Company14%
    University8%
    Manufacturing Company5%
    Company Size
    REVIEWERS
    Small Business26%
    Midsize Enterprise15%
    Large Enterprise59%
    VISITORS READING REVIEWS
    Small Business19%
    Midsize Enterprise13%
    Large Enterprise67%
    REVIEWERS
    Small Business36%
    Midsize Enterprise43%
    Large Enterprise21%
    VISITORS READING REVIEWS
    Small Business13%
    Midsize Enterprise13%
    Large Enterprise74%
    Buyer's Guide
    SAP HANA vs. Spark SQL
    March 2024
    Find out what your peers are saying about SAP HANA vs. Spark SQL and other solutions. Updated: March 2024.
    767,847 professionals have used our research since 2012.

    SAP HANA is ranked 1st in Embedded Database with 79 reviews while Spark SQL is ranked 4th in Hadoop with 14 reviews. SAP HANA is rated 8.4, while Spark SQL is rated 7.8. The top reviewer of SAP HANA writes "Excellent compatibility between modules and the control". On the other hand, the top reviewer of Spark SQL writes "Offers the flexibility to handle large-scale data processing". SAP HANA is most compared with Oracle Database, SQL Server, MySQL, IBM Db2 Database and Apache Spark, whereas Spark SQL is most compared with Apache Spark, IBM Db2 Big SQL, HPE Ezmeral Data Fabric and Netezza Analytics. See our SAP HANA vs. Spark SQL report.

    See our list of best Hadoop vendors.

    We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.