IT Central Station is now PeerSpot: Here's why

Spark SQL Primary Use Case

Engineering Manager/Solution architect at a computer software company with 201-500 employees

The primary use case of this solution is to function within a distributed ecosystem. Spark is part of EMR, a Hadoop distribution, and is one of the tools in the ecosystem. You are not working with Hadoop in a vacuum—you leverage Spark, Hive, HBase—because it is just a distributed ecosystem. It has no value within itself. 

This solution can be deployed both on the cloud and on Cloudera distributions. 

View full review »
Corporate Sales at a financial services firm with 10,001+ employees

We use it to gather all the transaction data. We have Hadoop and Spark in our system, and we use some easy process flows for transport. 

View full review »
Associate Manager at a consultancy with 501-1,000 employees

I am using this solution for data validation and writing queries.

View full review »
Buyer's Guide
July 2022
Find out what your peers are saying about Apache, Informatica, VMware and others in Hadoop. Updated: July 2022.
621,327 professionals have used our research since 2012.