"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"It is really easy to set up and the interface is easy to use."
"In StreamSets, everything is in one place."
"Ab Initio reaches the highest performance and is very flexible in processing huge amounts of data."
"I like everything about this product, but the biggest thing is the ease of use."
"I like the way that you can use the context variables, and how you can work those context variables to give you values and settings for every development environment, such as PROD, TEST, and DEV."
"The most valuable feature is integration."
"The most valuable feature is the data loading and scripting language"
"The features that I like the most are the simplicity of the interface, and the ability to quickly develop with a predefined component."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"An awesome improvement would be big data solutions, for example, implementing some kind of business intelligence or neural networks for artificial intelligence."
"They lack in memory capacity."
"I've had some issues with bugs causing crashes, especially when making changes to the system or with the monthly upgrades to Studio they've introduced."
"Performance and speed could be improved."
"I would like to sync a project and do an upload from that current version, and then from GitLab, be able to download the latest one."
"I think they should drive toward AI and machine learning. They could include a machine-learning algorithm for the deduplication."
More Talend Data Management Platform Pricing and Cost Advice →
Ab Initio Co>Operating System is ranked 24th in Data Integration Tools with 1 review while Talend Data Management Platform is ranked 13th in Data Integration Tools with 5 reviews. Ab Initio Co>Operating System is rated 10.0, while Talend Data Management Platform is rated 8.4. The top reviewer of Ab Initio Co>Operating System writes "High performance and flexible solution for companies with large amounts of data". On the other hand, the top reviewer of Talend Data Management Platform writes "User-friendly, stable, and handles different context variables well". Ab Initio Co>Operating System is most compared with SSIS, Collibra Catalog, AWS Glue, Oracle Data Integrator (ODI) and Talend Open Studio, whereas Talend Data Management Platform is most compared with IBM InfoSphere DataStage, Talend Data Fabric, Talend Open Studio, SAP Data Services and Palantir Foundry.
See our list of best Data Integration Tools vendors and best Cloud Data Integration vendors.
We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.