We performed a comparison between Apache Spark and Zadara based on real PeerSpot user reviews.
Find out in this report how the two Compute Service solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The most valuable feature of Apache Spark is its flexibility."
"The product is useful for analytics."
"The deployment of the product is easy."
"The features we find most valuable are the machine learning, data learning, and Spark Analytics."
"It is highly scalable, allowing you to efficiently work with extensive datasets that might be problematic to handle using traditional tools that are memory-constrained."
"Now, when we're tackling sentiment analysis using NLP technologies, we deal with unstructured data—customer chats, feedback on promotions or demos, and even media like images, audio, and video files. For processing such data, we rely on PySpark. Beneath the surface, Spark functions as a compute engine with in-memory processing capabilities, enhancing performance through features like broadcasting and caching. It's become a crucial tool, widely adopted by 90% of companies for a decade or more."
"This solution provides a clear and convenient syntax for our analytical tasks."
"Its scalability and speed are very valuable. You can scale it a lot. It is a great technology for big data. It is definitely better than a lot of earlier warehouse or pipeline solutions, such as Informatica. Spark SQL is very compliant with normal SQL that we have been using over the years. This makes it easy to code in Spark. It is just like using normal SQL. You can use the APIs of Spark or you can directly write SQL code and run it. This is something that I feel is useful in Spark."
"Zadara Storage Cloud having 24/7 management saves me support and engineering costs because the storage and computing are managed by a third-party. We are able to focus more attention on the customer, which is truly our core business. Even at 1:00 AM or 2:00 AM at night, someone will answer, which is important."
"A nice feature is the immutable object storage, which can be used in conjunction with Veeam."
"The most valuable feature is the flexibility in terms of deployment options."
"One of the most valuable features is its integration with other cloud solutions. We have a presence within Amazon EC2 and we leverage compute instances in there. Being able to integrate with compute, both locally within Zadara, as well as with other cloud vendors such as Amazon, is very helpful, while also being able to maintain extremely low latency between those connections."
"The processing is much faster with this product."
"Being able to scale on demand, and being able to get out of our security operation center, and not having to purchase hardware upfront, has drastically reduced the overhead that was required to maintain our information. We have also gained additional capabilities in terms of speed of replicating that information."
"The most valuable features of Zadara are its visibility and simplicity to use."
"The most valuable feature of Zadara is its ease of use and safety. Overall the solution is a complete package, it has all the features needed."
"When you want to extract data from your HDFS and other sources then it is kind of tricky because you have to connect with those sources."
"Apache Spark can improve the use case scenarios from the website. There is not any information on how you can use the solution across the relational databases toward multiple databases."
"It would be beneficial to enhance Spark's capabilities by incorporating models that utilize features not traditionally present in its framework."
"Apart from the restrictions that come with its in-memory implementation. It has been improved significantly up to version 3.0, which is currently in use."
"We've had problems using a Python process to try to access something in a large volume of data. It crashes if somebody gives me the wrong code because it cannot handle a large volume of data."
"It requires overcoming a significant learning curve due to its robust and feature-rich nature."
"The solution’s integration with other platforms should be improved."
"In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond, do the transformation in a subsecond, and all that."
"The range of support of VMware could be better. It can support Windows, however, it cannot support other operating systems like IBM AIX. This needs to improve."
"The initial setup of the solution is complex."
"There are still some storage features that they lack. For example, other vendors implemented the auto-tiering feature a long time ago, while Zadara Storage Cloud is just coming out with this feature today. So, they are a little bit late compared to the market."
"Having iSCSI over the internet using a VPN, the IPSec tunnel is really the only thing that I find missing from this product."
"Cost-wise, because it's a pay-per-use model, it may ultimately end up costing us more in the long run than something we developed ourselves."
"The management interface is more geared towards end-users rather than a service partner like ourselves, and there are improvements that can be made around that."
"In the next release, there can be some improvements to the web console by adding more features because the console is simple. Additionally, the calculator could improve."
"Some of the features are a little bit slow to come to market."
Apache Spark is ranked 5th in Compute Service with 60 reviews while Zadara is ranked 9th in Compute Service with 9 reviews. Apache Spark is rated 8.4, while Zadara is rated 8.8. The top reviewer of Apache Spark writes "Reliable, able to expand, and handle large amounts of data well". On the other hand, the top reviewer of Zadara writes "We're able to scale up or down almost instantly, and changes are handled efficiently by their managed services team ". Apache Spark is most compared with Spring Boot, AWS Batch, Spark SQL, SAP HANA and Cloudera Distribution for Hadoop, whereas Zadara is most compared with MinIO, Amazon S3, Nutanix Cloud Infrastructure (NCI), Wasabi and Red Hat Ceph Storage. See our Apache Spark vs. Zadara report.
See our list of best Compute Service vendors.
We monitor all Compute Service reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.