No more typing reviews! Try our Samantha, our new voice AI agent.
Spark SQL Logo

Spark SQL Reviews

Vendor: Apache
3.9 out of 5

What is Spark SQL?

Get the report
Helped 900,277 peers since 2012

Featured Spark SQL reviews

Spark SQL mindshare

As of June 2026, the mindshare of Spark SQL in the Hadoop category stands at 5.1%, down from 10.5% compared to the previous year, according to calculations based on PeerSpot user engagement data.
Hadoop Mindshare Distribution
ProductMindshare (%)
Spark SQL5.1%
Cloudera Distribution for Hadoop14.7%
Apache Spark13.9%
Other66.30000000000001%
Hadoop

PeerResearch reports based on Spark SQL reviews

TypeTitleDate
CategoryHadoopJun 22, 2026Download
ProductReviews, tips, and advice from real usersJun 22, 2026Download
ComparisonSpark SQL vs Apache SparkJun 22, 2026Download
ComparisonSpark SQL vs Cloudera Distribution for HadoopJun 22, 2026Download
ComparisonSpark SQL vs Amazon EMRJun 22, 2026Download
Suggested products
TitleRatingMindshareRecommending
Apache Spark4.213.9%90%69 interviewsAdd to research
Cloudera Distribution for Hadoop4.014.7%92%51 interviewsAdd to research
 
 
Key learnings from peers
Last updated Mar 27, 2026

Valuable Features

Room for Improvement

Pricing

Review data by company size

By reviewers
Company SizeCount
Small Business5
Midsize Enterprise6
Large Enterprise4
By reviewers
By visitors reading reviews
Company SizeCount
Small Business29
Midsize Enterprise8
Large Enterprise41
By visitors reading reviews

Top industries

By visitors reading reviews
Financial Services Firm
21%
University
12%
Healthcare Company
8%
Manufacturing Company
8%
Comms Service Provider
6%
Retailer
6%
Computer Software Company
6%
Construction Company
5%
Insurance Company
4%
Government
4%
Marketing Services Firm
4%
Outsourcing Company
4%
Performing Arts
4%
Real Estate/Law Firm
3%
Hospitality Company
1%
Import And Exporter
1%
Logistics Company
1%
Educational Organization
1%
Media Company
1%

Compare Spark SQL with alternative products

Learn more about Spark SQL

Spark SQL customers

Related questions

 
Spark SQL Reviews Summary
Author infoRatingReview Summary
Team Lead, Data Engineering at Nesine.com4.0I use Spark SQL for batch processing and transformations, appreciating its speed and Hive interoperability. However, its high resource consumption led me to migrate streaming jobs to Apache Flink, despite its ease of development.
Data engineer at Cocos pt3.5I use Spark SQL for data processing from various sources, integrating efficiently with our CI/CD workflow via Azure DevOps. It offers flexible and scalable data handling, although stability could be improved. Transitioning from Apache Hive enhanced our performance significantly.
Principal Consultant/Manager at Tenzing4.0We use PySpark for big data processing with Spark SQL on Microsoft Azure, appreciating its SQL connectivity and ease of use while suggesting improvements in documentation and SparkUI for better performance insights. Spark SQL facilitates complex task implementation using SQL.
Data Engineer at Behsazan Mellat4.5We use Spark SQL for business analytics in our HDFS environment to handle large data volumes efficiently. Its capability to run parallel queries is a key advantage over Python. Integration with data visualization tools like Tableau would enhance its functionality.
Data Engineer at BBD4.0We use Spark SQL for data engineering, transformation, and querying with around 30–40 users. Its powerful query language benefits us, but it has a steep learning curve. Previously, we used Panda and Dask, which were less scalable than Spark SQL.
Senior Analyst/ Customer Business and Insights Specialist at a tech services company with 501-1,000 employees4.0Our company uses Spark SQL for creating pipelines and data sets, finding it easy to use with basic SQL knowledge, especially for analytics within specific use cases. However, it could improve by offering more in-solution guidance on aggregate functions.
Lecturer at Amirkabir University of Technology4.0I use Spark SQL for data preparation and querying, valuing its flexible methods and good documentation. While I recommend it for being stable and scalable, I wish for more consistent syntax across different tasks.
CTO at Dokument IT d.o.o.5.0I used Spark SQL for analytics and statistical reports from content management platforms. The Thrift connection is valuable, but I've faced on-premise Delta Lake compatibility issues. The documentation lacks detail, especially for Thrift server setup, and interactive queries need improvement.
Engineering Manager/Solution architect at a computer software company with 201-500 employees4.0I find this solution stable, scalable, and useful within a distributed ecosystem, with straightforward setup and no licensing costs. I recommend it at 8/10, though it needs better EMR monitoring and integration.
Associate Manager at a consultancy with 501-1,000 employees5.0I use Spark SQL for data validation and queries. Its ease of use and validation are valuable, despite needing better integration. It's stable, scalable, free, and I rate it ten out of ten.
Kemal Duman - PeerSpot reviewer
Kemal Duman
Team Lead, Data Engineering at Nesine.com
Jan 21, 2026
Data pipelines have run faster and support flexible batch and streaming transformations
SurjitChoudhury - PeerSpot reviewer
SurjitChoudhury
Data engineer at Cocos pt
Nov 23, 2023
Offers the flexibility to handle large-scale data processing
Sahil Taneja - PeerSpot reviewer
Sahil Taneja
Principal Consultant/Manager at Tenzing
May 5, 2023
Easy to use and do not require a learning curve
Aria Amini - PeerSpot reviewer
Aria Amini
Data Engineer at Behsazan Mellat
Jul 26, 2023
A great solution for handling large volumes of data by parallel queries
Lucas Dreyer - PeerSpot reviewer
Lucas Dreyer
Data Engineer at BBD
Jan 4, 2023
Processing solution used for data engineering and transformation with the ability to process large datasets
KM
Keshav Mandal
Senior Analyst/ Customer Business and Insights Specialist at a tech services company with 501-1,000 employees
Nov 22, 2022
Analytics are easy because data is contained within each use case
Mahdi Sharifmousavi - PeerSpot reviewer
Mahdi Sharifmousavi
Lecturer at Amirkabir University of Technology
Aug 10, 2022
Incorporates regular SQL syntax within tasks and very useful for querying and depicting data
SB
Slaven Batnozic
CTO at Dokument IT d.o.o.
Aug 18, 2023
If implemented well, the solution is highly compatible and great for data analysis
reviewer1724670 - PeerSpot reviewer
reviewer1724670
Engineering Manager/Solution architect at a computer software company with 201-500 employees
Dec 2, 2021
Useful tool within a distributed ecosystem
reviewer1488372 - PeerSpot reviewer
reviewer1488372
Associate Manager at a consultancy with 501-1,000 employees
May 29, 2021
Easy to use, reliable, and useful data validation