Try our new research platform with insights from 80,000+ expert users
Apache Spark Logo

Apache Spark Reviews

Vendor: Apache
4.2 out of 5
Badge Ranked 1

What is Apache Spark?

Featured Apache Spark reviews

Apache Spark mindshare

As of August 2025, the mindshare of Apache Spark in the Hadoop category stands at 19.2%, down from 20.1% compared to the previous year, according to calculations based on PeerSpot user engagement data.
Hadoop Market Share Distribution
ProductMarket Share (%)
Apache Spark19.2%
Cloudera Distribution for Hadoop23.3%
HPE Ezmeral Data Fabric14.8%
Other42.7%
Hadoop

PeerResearch reports based on Apache Spark reviews

TypeTitleDate
CategoryHadoopAug 27, 2025Download
ProductReviews, tips, and advice from real usersAug 27, 2025Download
ComparisonApache Spark vs Cloudera Distribution for HadoopAug 27, 2025Download
ComparisonApache Spark vs Amazon EMRAug 27, 2025Download
ComparisonApache Spark vs HPE Ezmeral Data FabricAug 27, 2025Download
Suggested products
TitleRatingMindshareRecommending
Spring Boot4.2N/A95%38 interviewsAdd to research
Jakarta EE3.7N/A66%3 interviewsAdd to research
 
 
Key learnings from peers

Valuable Features

Room for Improvement

ROI

Pricing

Popular Use Cases

Service and Support

Deployment

Scalability

Stability

Review data by company size

By reviewers
Company SizeCount
Small Business24
Midsize Enterprise13
Large Enterprise25
By reviewers
By visitors reading reviews
Company SizeCount
Small Business131
Midsize Enterprise57
Large Enterprise470
By visitors reading reviews

Top industries

By visitors reading reviews
Financial Services Firm
26%
Computer Software Company
11%
Manufacturing Company
7%
Comms Service Provider
7%
University
5%
Government
5%
Retailer
5%
Insurance Company
4%
Educational Organization
4%
Healthcare Company
4%
Real Estate/Law Firm
2%
Construction Company
2%
Media Company
2%
Non Profit
2%
Recreational Facilities/Services Company
2%
Legal Firm
1%
Hospitality Company
1%
Outsourcing Company
1%
Performing Arts
1%
Pharma/Biotech Company
1%
Energy/Utilities Company
1%
Transportation Company
1%
Consumer Goods Company
1%
 
Apache Spark Reviews Summary
Author infoRatingReview Summary
Data Engineer at a tech company with 10,001+ employees5.0I use Apache Spark for real-time data processing and transformation across multiple sources like CRM and Siebel. It's reliable, fast, and improves our decision-making, though I see future needs for better integration with emerging cloud solutions.
Senior Developer at Infosys3.5No summary available
Head of Data at a energy/utilities company with 51-200 employees4.0Apache Spark significantly reduced operational costs by 50% and although it supports parallel processing, it needs improvements in scalability and user-friendliness. Working with datasets isn't as straightforward as with Pandas, though it's flexible and functional.
Senior Software Architect at USEReady4.0No summary available
Head of Data Science center of excellence at Ameriabank CJSC4.0I use Apache Spark primarily for in-memory processing of big data, which is valuable for tasks like running ML algorithms. Although its Pandas UDF support is advantageous, the Java overhead and performance issues suggest alternatives may be preferable.
Sr Manager at a transportation company with 10,001+ employees4.5I use Apache Spark for real-time data processing and ETL tasks. It offers unparalleled features but faces limitations due to its in-memory implementation. Despite improvements in version 3.0, reducing costs and addressing memory issues would enhance it further.
Data Scientist at a financial services firm with 10,001+ employees4.5I primarily use Apache Spark for data processing tasks involving large datasets, appreciating its ease of use and portability. While it's efficient for both small and large datasets, the lack of support for geospatial data is a limitation.
Data engineer at Cocos pt4.5We use Apache Spark primarily for Spark SQL and occasionally Spark Streaming, processing data from sources like SAP and Azure Data Warehouse. Its in-memory processing significantly outperforms Hadoop, offering faster data handling and enhanced query optimization.