Dataiku's features include a process scheduler, cloud-based processes, comprehensive functionalities, easy prediction deployment, and visual data preparation tools. Users appreciate its integration with languages like Python and R as well as BigQuery access. The interface is user-friendly, offering both code and visual options. Feature selection, model creation, workflow organization, collaboration, traceability, and automation are notable. It supports Spark for distributed computing and excels in data science, MLOps, and integration with public cloud platforms.
- "Our clients can easily drag and drop components and use them on the spot."
- "Dataiku is highly regarded as it is a leader in the Gartner ranking."
- "I believe the return on investment looks positive."
Dataiku faces challenges in server stability, query engine reliability, and user interface complexity. Improvements are needed in data exclusion, message broadcasting, and deep learning integration. Users call for better handling of complex data types and enhanced support for non-IT users. Pricing is considered high, and GitHub integration is problematic. Developers seek more flexibility in code-based components and support for parallel machine learning. Enhancements in error messaging, GPU integration, and commercial transparency are also desired.
- "There is room for improvement in terms of allowing for more code-based features."
- "The technical support from Dataiku is not good. The support team does not provide adequate assistance, and there are concerns about billing requests."
- "One area for improvement is the need for more capabilities similar to those provided by NVIDIA for parallel machine learning training. We still encounter some integration issues."