Try our new research platform with insights from 80,000+ expert users

Apache Spark vs IBM Netezza Performance Server comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Mar 1, 2026

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.6
Apache Spark provides up to 50% cost savings, boosting efficiency and reducing expenses significantly in machine learning analytics.
Sentiment score
7.1
IBM Netezza Performance Server improves data query speed and efficiency, enhancing business performance and cost savings through compression.
 

Customer Service

Sentiment score
6.0
Apache Spark offers vibrant community support and resources, with commercial support available through vendors like Cloudera and Hadoop.
Sentiment score
6.4
IBM Netezza Performance Server praised for knowledgeable support; mixed feedback on resolution speed post-acquisition, yet communication remains consistent.
I would rate the technical support of Apache Spark an eight because when we had questions, we found solutions, and it was straightforward.
Consultant, Chief Engineer, Teamleiter at infoteam Software AG
I have received support via newsgroups or guidance on specific discussions, which is what I would expect in an open-source situation.
Data Architect at Devtech
Technical support is very costly for me, accounting for twenty-five to thirty percent of the product cost.
Project Manager at MAF Retail
 

Scalability Issues

Sentiment score
7.4
Apache Spark's scalability and versatility enable efficient large-scale data processing, making it a reliable choice for diverse teams.
Sentiment score
6.1
IBM Netezza struggles with scalability, requiring extra hardware for expansion, prompting users to consider cloud alternatives for growth.
It is provided as a pre-configured box, and scaling is not an option.
Project Manager at MAF Retail
 

Stability Issues

Sentiment score
7.4
Apache Spark is praised for its robust stability and reliability, with high user ratings despite minor configuration challenges.
Sentiment score
7.8
IBM Netezza Performance Server is stable, offering high uptime and reliable performance, with occasional issues due to maintenance or compatibility.
Apache Spark resolves many problems in the MapReduce solution and Hadoop, such as the inability to run effective Python or machine learning algorithms.
Data Engineer at a tech company with 10,001+ employees
Without a doubt, we have had some crashes because each situation is different, and while the prototype in my environment is stable, we do not know everything at other customer sites.
Data Architect at Devtech
 

Room For Improvement

Apache Spark needs improvements in real-time querying, user-friendliness, logging, large dataset handling, and expanded programming language support.
IBM Netezza struggles with scalability, user interface, query performance, big data support, and high costs, needing better tools and integration.
I find that there really lacks the technical depth to do any recommendations for future updates of Apache Spark.
Consultant, Chief Engineer, Teamleiter at infoteam Software AG
Various tools like Informatica, TIBCO, or Talend offer specific aspects, licensing can be costly;
Data Architect at Devtech
The cloud version is only available in AWS, and in the Middle East, it is not well-developed in the Azure environment.
Project Manager at MAF Retail
 

Setup Cost

Apache Spark is cost-effective but can incur high infrastructure costs, especially in cloud setups like Databricks, with setup time variability.
IBM Netezza offers high performance and low maintenance but is considered costly, especially for mid-sized organizations, with significant licensing fees.
 

Valuable Features

Apache Spark provides scalable, in-memory data processing with flexible support for distributed computing, streaming, and machine learning integration.
IBM Netezza Performance Server delivers fast analytics, ease of use, robust support, and efficient data warehousing with minimal maintenance.
The most important part is that everything can be connected, and the data exchange across overseas connections is fast and reliable.
Consultant, Chief Engineer, Teamleiter at infoteam Software AG
Apache Spark is the solution, and within it, you have PySpark, which is the API for Apache Spark to write and run Python code.
Data Engineer at a tech company with 10,001+ employees
The solution is beneficial in that it provides a base-level long-held understanding of the framework that is not variant day by day, which is very helpful in my prototyping activity as an architect trying to assess Apache Spark, Great Expectations, and Vault-based solutions versus those proposed by clients like TIBCO or Informatica.
Data Architect at Devtech
It operates as a high-speed data warehouse, which is essential for handling big data.
Project Manager at MAF Retail
 

Categories and Ranking

Apache Spark
Ranking in Hadoop
1st
Average Rating
8.4
Reviews Sentiment
6.9
Number of Reviews
69
Ranking in other categories
Compute Service (5th), Java Frameworks (2nd)
IBM Netezza Performance Server
Ranking in Hadoop
6th
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
45
Ranking in other categories
Data Warehouse (13th)
 

Mindshare comparison

As of March 2026, in the Hadoop category, the mindshare of Apache Spark is 13.3%, down from 18.6% compared to the previous year. The mindshare of IBM Netezza Performance Server is 6.2%, up from 1.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop Mindshare Distribution
ProductMindshare (%)
Apache Spark13.3%
IBM Netezza Performance Server6.2%
Other80.5%
Hadoop
 

Featured Reviews

Devindra Weerasooriya - PeerSpot reviewer
Data Architect at Devtech
Provides a consistent framework for building data integration and access solutions with reliable performance
The in-memory computation feature is certainly helpful for my processing tasks. It is helpful because while using structures that could be held in memory rather than stored during the period of computation, I go for the in-memory option, though there are limitations related to holding it in memory that need to be addressed, but I have a preference for in-memory computation. The solution is beneficial in that it provides a base-level long-held understanding of the framework that is not variant day by day, which is very helpful in my prototyping activity as an architect trying to assess Apache Spark, Great Expectations, and Vault-based solutions versus those proposed by clients like TIBCO or Informatica.
Shiv Subramaniam Koduvayur - PeerSpot reviewer
Project Manager at MAF Retail
Parallel data processing streamlines operations while cost and cloud integration challenge adoption
The cost of the solution is on the more expensive side, which is a concern for me. Additionally, its promotion and interaction with cloud applications are limited. The cloud version is only available in AWS, and in the Middle East, it is not well-developed in the Azure environment. For the cost to be reduced, it should match competitors. Many features need to be incorporated on the cloud.
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
884,873 professionals have used our research since 2012.
 

Comparison Review

it_user232068 - PeerSpot reviewer
Senior Data Architect at a pharma/biotech company with 1,001-5,000 employees
Aug 5, 2015
Netezza vs. Teradata
Original published at https://www.linkedin.com/pulse/should-i-choose-net Two leading Massively Parallel Processing (MPP) architectures for Data Warehousing (DW) are IBM PureData System for Analytics (formerly Netezza) and Teradata. I thought talking about the similarities and differences…
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Manufacturing Company
8%
Computer Software Company
7%
Comms Service Provider
6%
Financial Services Firm
20%
Manufacturing Company
9%
Comms Service Provider
7%
Performing Arts
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business28
Midsize Enterprise16
Large Enterprise32
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise5
Large Enterprise33
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Apache Spark is open-source, so it doesn't incur any charges.
What needs improvement with Apache Spark?
I find that there really lacks the technical depth to do any recommendations for future updates of Apache Spark. I used it for two years for our prototype work and testing things, but because I had...
What needs improvement with IBM Netezza Performance Server?
The cost of the solution is on the more expensive side, which is a concern for me. Additionally, its promotion and interaction with cloud applications are limited. The cloud version is only availab...
What advice do you have for others considering IBM Netezza Performance Server?
The solution has generally received positive feedback from me and is recommended for continued use by end users. However, the product cost is high compared to others in the market, and this cost ha...
 

Also Known As

No data available
Netezza Performance Server, Netezza, Netezza Analytics
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Seattle Childrens Hospital, Carphone Warehouse, Vanderbilt University School of Medicine, Battelle, Start Today Co. Ltd., Kelley Blue Book, Trident Marketing, Elisa Corporation, Catalina Marketing, iBasis, Barnes & Noble, Qualcomm, MediaMath, Acxiom, iBasis, Foxwoods
Find out what your peers are saying about Apache Spark vs. IBM Netezza Performance Server and other solutions. Updated: March 2026.
884,873 professionals have used our research since 2012.