


Find out what your peers are saying about Databricks, Dataiku, Knime and others in Data Science Platforms.
Tasks that earlier took hours in Excel or SQL are now completed in minutes.
From a time-saving perspective, we saved 60 to 75 percent of the human workforce needed and eliminated other disparate ETL tools, ultimately saving us over 600,000 dollars.
Alteryx helps familiarize managers with artificial intelligence-driven possibilities.
This reduction in both time and money resulted in real-time impact and significant cost savings.
For a lot of different tasks, including machine learning, it is a nice solution.
When it comes to big data processing, I prefer Databricks over other solutions.
The market is competitive, and Dataiku must adopt a consumption-based model instead of the current monthly model.
I consider the return on investment with Dataiku valuable because for us, it is one single platform where all our data scientists come together and work on any model building, so it is collaboration, plus having everything in one place, organized, having proper project management, and then built-in capabilities which help to facilitate model building.
It is a good return on investment since it helps save a lot of time, and it's easy for my teammates to work cross-functionally on the same project.
Customer support from Alteryx has been amazing.
they do have a website for resolving doubts and accessing helpful resources, including various tools and filters.
I contacted customer support once or twice, and they were quick to respond.
Whenever we reach out, they respond promptly.
As of now, we are raising issues and they are providing solutions without any problems.
I would give Databricks customer support a rating of ten.
Dataiku partners with local industry experts who understand the business better and provide support.
The support team does not provide adequate assistance.
They should not take the complaints so lightly.
Alteryx is scalable for most enterprise analytics and data preparation workloads.
Suggestions for improvements in Alteryx include areas for increasing efficiency, particularly in processing telemetry data, which involves dealing with large volumes of unstructured data.
Alteryx is scalable, and I would give it eight out of ten.
The sky's the limit with Databricks.
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
Databricks is an easily scalable platform.
Dataiku is quite scalable, as long as I can pay for more licenses, there is no technical limitation.
Dataiku's scalability is pretty good; I can scale the projects very easily, and clear guidance is given as well.
I didn't need to reach out to Alteryx for support because available documents usually provide enough information to resolve issues.
I have not encountered any lagging, crashing, or instability in the system during these three months of usage.
Alteryx is a reliable tool, but it is also very heavy, requiring good laptop configurations, a minimum of 8GB RAM, and a recent processor such as i10.
They release patches that sometimes break our code.
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
Databricks is definitely a very stable product and reliable.
It would help if there was a backup proposition in place to avoid hampering our work due to updates.
For around ten percent of the day, it is usually down, and we are unable to do work on it.
In terms of stabilization, if my data has no outlier creation in the raw data, then it is quite stable.
The tool could include more native connectors, such as for global ERPs, instead of requiring additional fees for these connections.
The support structure changed; initially, we received great support, however, it later became less reliable due to licensing issues and a tiered support system.
The additional features that Alteryx needs to work on to make it more competitive include better collaboration and easier integration through API.
Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.
We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly.
We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools.
Someone who needs to do coding can do it, and someone who does not know coding can also build solutions.
The license is very expensive.
I would love for Dataiku to allow more flexibility with code-based components and provide the possibility to extend it by developing and integrating custom components easily with existing ones.
The price is very high, with licensing typically starting around five thousand dollars plus user per year.
We found excellent use cases for automation through Alteryx, which provided the means to reduce operational costs and streamline the build of ETL pipelines without extensive coding.
Alteryx is more cost-effective compared to Informatica licenses, offering savings.
It is not a cheap solution.
I believe that in terms of credits for Databricks, we're spending between £15,000 and £20,000 a month.
My experience with pricing, implementation costs, and licensing is that it is very efficient and very fast.
There are no extra expenses beyond the existing licensing cost.
I find the pricing of Dataiku quite affordable for our customers, as they are usually large companies.
The pricing for Dataiku is very high, which is its biggest downside.
Alteryx not only represents data but also supports decision-making by suggesting the next steps.
Analysts who do not have any coding experience can still work on the transformation and preparation of data, which is quite useful.
Alteryx includes built-in tools such as drive time analysis and linear regression, which are much harder to achieve in standard BI tools such as Power BI or Tableau.
Databricks' capability to process data in parallel enhances data processing speed.
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
This feature is useful because it simplifies tasks and eliminates the need for a data scientist.
Dataiku primarily enhances the speed at which our customers can develop or train their machine learning models because it is a drag-and-drop platform.
It offers most of the capabilities required for data science, MLOps, and LLMOps.
| Product | Mindshare (%) |
|---|---|
| Databricks | 7.6% |
| Dataiku | 5.2% |
| Alteryx | 3.7% |
| Other | 83.5% |


| Company Size | Count |
|---|---|
| Small Business | 33 |
| Midsize Enterprise | 16 |
| Large Enterprise | 56 |
| Company Size | Count |
|---|---|
| Small Business | 27 |
| Midsize Enterprise | 12 |
| Large Enterprise | 57 |
| Company Size | Count |
|---|---|
| Small Business | 6 |
| Midsize Enterprise | 2 |
| Large Enterprise | 13 |
Alteryx provides user-friendly, no-code tools for data blending, preparation, and analysis. Its drag-and-drop interface and in-database capabilities simplify integration with data sources while maintaining data integrity.
Alteryx offers a comprehensive suite for automation of data workflows, reducing manual tasks and enhancing processing efficiency. Known for robust predictive and spatial analytics, it effectively handles large datasets. The platform's flexibility allows for custom script deployments, supported by a strong community. However, Alteryx faces challenges with high pricing, lack of cloud support, and limited data visualization tools. Users express a need for more in-built data science functionalities, improved API integration, and a smoother learning curve. Connectivity and documentation gaps, along with complex workflows, are noted concerns, suggesting areas for enhancement. Alteryx is widely used for tasks like ETL processes, data preparation, predictive modeling, and report generation, supporting functions like financial projections and spatial analysis.
What features define Alteryx?Alteryx is implemented across industries for diverse needs such as anomaly detection in finance, customer segmentation in marketing, and tax automation in auditing. Teams leverage its capabilities for data blending and predictive modeling to enhance operational efficiency and address specific business needs effectively.
Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.
Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.
What features make Databricks unique?
What benefits can users expect from Databricks?
In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.
Dataiku Data Science Studio is acclaimed for its versatile capabilities in advanced analytics, data preparation, machine learning, and visualization. It streamlines complex data tasks with an intuitive visual interface, supports multiple languages like Python, R, SQL, and scales efficiently for large dataset handling, boosting organizational efficiency and collaboration.