Try our new research platform with insights from 80,000+ expert users

IBM Analytics Engine vs Spark SQL comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

IBM Analytics Engine
Ranking in Hadoop
10th
Average Rating
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
Spark SQL
Ranking in Hadoop
5th
Average Rating
7.8
Reviews Sentiment
7.6
Number of Reviews
14
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of June 2025, in the Hadoop category, the mindshare of IBM Analytics Engine is 2.4%, up from 0.9% compared to the previous year. The mindshare of Spark SQL is 10.6%, down from 11.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop
 

Featured Reviews

Saket Pandey - PeerSpot reviewer
Good solution for small and medium-sized businesses and highly stable
I would advise instead of only going through other reviews; it would be great if you could schedule a talk with the IBM team that would be helping you implement this solution. They would deep dive into the process and protocols you are currently set up in, and then they will provide you an optimal solution and optimal price. So I believe talking with the support team was really amazing. They even helped us in some other parts as well. It is a good solution for small and medium-sized businesses. Overall, I would rate the solution an eight out of ten because of the support team. They were able to resolve issues, which helped us deploy higher-grade solutions correctly and quickly. We were able to ensure that our processes were working correctly, and we saved about 15-16% of a week's time by using this solution. In terms of return on investment, we saved about $7,000 a month.
SurjitChoudhury - PeerSpot reviewer
Offers the flexibility to handle large-scale data processing
My experience with the initial setup of Spark SQL was relatively smooth. Understanding the system wasn't overly difficult because the data was structured in databases, and we could use notebooks for coding in Python or Java. Configuring networks and running scripts to load data into the database were routine tasks that didn't pose significant challenges. The flexibility to use different languages for coding and the ability to process data using key-value pairs in Python made the setup adaptable. Once we received the source data, processing it in SparkSQL involved writing scripts to create dimension and fact tables, which became a standard part of our workflow. Setting up Spark SQL was reasonably quick, but sometimes we face performance issues, especially during data loading into the SQL Server data warehouse. Sequencing notebooks for efficient job runs is crucial, and managing complex tasks with multiple notebooks requires careful tracking. Exploring ways to optimize this process could be beneficial. However, once you are familiar with the database architecture and project tools, understanding and adapting to the system become more straightforward.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The best part was that we could make minor changes in the way we were bifurcating the data, even at a very small scale. The accuracy of conversion was also very high."
"The solution is easy to understand if you have basic knowledge of SQL commands."
"I find the Thrift connection valuable."
"The speed of getting data."
"The team members don't have to learn a new language and can implement complex tasks very easily using only SQL."
"The performance is one of the most important features. It has an API to process the data in a functional manner."
"This solution is useful to leverage within a distributed ecosystem."
"The stability was fine. It behaved as expected."
"Certain data sets that are very large are very difficult to process with Pandas and Python libraries. Spark SQL has helped us a lot with that."
 

Cons

"One area for improvement would be the initial setup stage, which took longer than expected."
"There are many inconsistencies in syntax for the different querying tasks."
"In the next update, we'd like to see better performance for small points of data. It is possible but there are better tools that are faster and cheaper."
"In terms of improvement, the only thing that could be enhanced is the stability aspect of Spark SQL."
"This solution could be improved by adding monitoring and integration for the EMR."
"It would be useful if Spark SQL integrated with some data visualization tools."
"There should be better integration with other solutions."
"The solution needs to include graphing capabilities. Including financial charts would help improve everything overall."
"Anything to improve the GUI would be helpful."
 

Pricing and Cost Advice

Information not available
"We use the open-source version, so we do not have direct support from Apache."
"We don't have to pay for licenses with this solution because we are working in a small market, and we rely on open-source because the budgets of projects are very small."
"The solution is open-sourced and free."
"The on-premise solution is quite expensive in terms of hardware, setting up the cluster, memory, hardware and resources. It depends on the use case, but in our case with a shared cluster which is quite large, it is quite expensive."
"The solution is bundled with Palantir Foundry at no extra charge."
"There is no license or subscription for this solution."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
856,873 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
18%
Computer Software Company
14%
Retailer
10%
Healthcare Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

Ask a question
Earn 20 points
What do you like most about Spark SQL?
Spark SQL's efficiency in managing distributed data and its simplicity in expressing complex operations make it an essential part of our data pipeline.
What is your experience regarding pricing and costs for Spark SQL?
We don't have to pay for licenses with this solution because we are working in a small market, and we rely on open-source because the budgets of projects are very small.
What needs improvement with Spark SQL?
In terms of improvement, the only thing that could be enhanced is the stability aspect of Spark SQL. There could be additional features that I haven't explored but the current solution for working ...
 

Overview

 

Sample Customers

Information Not Available
UC Berkeley AMPLab, Amazon, Alibaba Taobao, Kenshoo, Hitachi Solutions
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: June 2025.
856,873 professionals have used our research since 2012.