We changed our name from IT Central Station: Here's why
2019-07-10T12:01:00Z

What advice do you have for others considering Apache Spark?


If you were talking to someone whose organization is considering Apache Spark, what would you say?

How would you rate it and why? Any other tips or advice?

ITCS user
Guest
1414 Answers

author avatar
Top 5LeaderboardReal User

I have the solution installed on my computer and on our servers. You can use it on-premises or as a SaaS. I'd rate the solution at a nine out of ten. I've been very pleased with its capabilities. I would recommend the solution for the people who need to deploy projects with streaming. If you have many different sources or different types of data, and you need to put everything in the same place - like a data lake - Spark, at this moment, has the right tools. It's an important solution for data science, for data detectors. You can put all of the information in one place with Spark.

2021-08-18T14:51:07Z
author avatar
Real User

I would recommend Apache Spark to new users, but it depends on the use case. Sometimes, it's not the best solution. On a scale from one to ten, I would give Apache Spark a ten.

2021-03-27T15:39:24Z
author avatar
Top 5LeaderboardReal User

I would definitely recommend Spark. It is a great product. I like Spark a lot, and most of the features have been quite good. Its initial learning curve is a bit high, but as you learn it, it becomes very easy. I would rate Apache Spark an eight out of ten.

2021-02-01T12:04:16Z
author avatar
Top 10LeaderboardReal User

I would advise planning well before implementing this solution. In enterprise corporations like ours, there are a lot of policies. You should first find out your needs, and after that, you or your team should set it up based on your needs. If your needs change during development because of the business requirements, it will be very difficult. If you are clear about your needs, it is easier to set it up. If you know how Spark is used in your project, you have to define firewall rules and cluster needs. When you set up Spark, it should be ready for people's usage, especially for remote job execution. I would rate Apache Spark a nine out of ten.

2020-10-28T02:27:29Z
author avatar
Top 5LeaderboardReal User

We're customers and also partners with Apache. While we are on version 2.6, we are considering upgrading to version 3.0. I'd rate the solution nine out of ten. It works very well for us and suits our purposes almost perfectly.

2020-07-23T07:58:35Z
author avatar
Top 20Real User

I would say for some use case we don't have to go to Apache spark and it can be implemented using ordinary python,go or Java application. For some use cases if leveraging the usage of Apache Spark gives better performance and reduction of time we can go for Apache Spark. I would rate Apache spark 9 out of 10 for use cases that require it. I would advice using already cloud services for implementing Apache Spark.

2020-06-10T05:37:19Z
author avatar
Top 20Consultant

I would rate this solution an eight out of ten.

2020-02-02T10:42:14Z
author avatar
Consultant

The advice that I would give to someone considering this solution is that the quality of data has key streaming capabilities like velocity. This means how quickly you are going to refer to the data. These things matter by designing the solution. We need to take these things out. I would rate Apache Spark an eight out of ten. To make it a ten they should improve the speed. The data storage capacity means we can inject somewhere in the user database in more efficient ways.

2020-01-29T11:22:00Z
author avatar
Top 5Real User

I would rate it a nine out of ten.

2020-01-29T11:22:00Z
author avatar
Consultant

On a scale of 1 to 10, I'd put it at an eight. To make it a perfect 10 I'd like to see an improved configuration bot. Sometimes it is a nightmare on Linux trying to figure out what happened on the configuration and back-end. So I think installation and configuration with some other tools. We are technical people, we could figure it out, but if aspects like that were improved then other people who are less technical would use it and it would be more adaptable to the end-user.

2019-12-23T07:05:00Z
author avatar
Real User

We use both on-premises and public and private cloud deployment models. We're partners with Databricks. I'm a consultant. Our company works for large enterprises such as banks and energy companies. 17 of our workers use Apache Spark. With the cloud, there are many companies that integrate Spark. Most projects in big data around the world use Spark, indirectly or directly. I'd rate the solution eight out of ten.

2019-12-09T10:58:00Z
author avatar
Consultant

The work that we are doing with this solution is quite common and is very easy to do. My advice for anybody who is implementing this solution is to look at their needs and then look at the community. Normally, there are a lot of people who have already done what you need. So, even without experience, it is quite simple to do a lot of things. I would rate this solution a nine out of ten.

2019-10-13T05:48:00Z
author avatar
Real User

I would rate this solution eight out of 10.

2019-07-14T10:21:00Z
author avatar
Real User

I would recommend the solution. I would rate it an eight or nine out of 10. For some areas, I would give it ten but I cannot use some parts. If you are going to use it for a consumer then I would be able to recommend it and you should go ahead. It doesn't work for me as I have different clients and different engagements.

2019-07-10T12:01:00Z
Learn what your peers think about Apache Spark. Get advice and tips from experienced pros sharing their opinions. Updated: January 2022.
564,599 professionals have used our research since 2012.