Try our new research platform with insights from 80,000+ expert users

SAP Data Hub vs Talend Open Studio comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

SAP Data Hub
Average Rating
7.6
Reviews Sentiment
6.8
Number of Reviews
3
Ranking in other categories
Data Governance (29th), Metadata Management (10th)
Talend Open Studio
Average Rating
8.0
Reviews Sentiment
6.8
Number of Reviews
50
Ranking in other categories
Data Integration (5th)
 

Featured Reviews

VM
The solution is seamless, but the database sometimes leads to confusion
We used to have multiple different kinds of databases, which internally, had different compliance levels. Retention management is very different now. If the policy is live and the claim has been completed, I couldn't archive the claim. I needed to keep a reference integrity of that claim and understand which policy paid out the claim. With this solution, the policy came in six months ago and qualified for archiving. The claim had been paid and in every environment, the claim had been closed, including the reporting system, the claims system, etc. With the payment set gateway, I can just go and archive. But, we had a hard time during this process. I rate the overall solution a seven out of ten.
Costin Marzea - PeerSpot reviewer
Allows you to develop your own components and can be used as an OEM
Sometimes, scalability is part of planning. It depends on what you mean by scalability. People talk a lot about it, but scalability is not always about system functionality. Sometimes, it may be planning the job you're doing. If you want to split it into several jobs or servers, you don't actually have to have it built in as a functionality. You can create a job using a loop, which runs and controls several jobs in a loop that may be controlled. Scaling should not always be part of the infrastructure based on whether the engine can scale or not. I think it's your plan or project that should scale and split, and you can define these parameters. These parameters include how many servers you want to run or how many executions you want to do on different parts of the data. It's not always an issue of the engine running. Sometimes, your database should be configured to support partitioning. The product may scale very well without partitioning, but if the basic response is very slow, you didn't solve the problem. You should solve the problems at a higher level, not just at the execution level. They should be solved at the database level and communication level, and you should have firewalls. We are trying to add to the open source the ability to generate code for containers and Kubernetes that exist in the subscription version. Once you do this, Kubernetes will take care of the scaling, so there is no problem.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is the S/4HANA 1909 On-Premise"
"Its connection to on-premise products is the most valuable. We mostly use the on-premise connection, which is seamless. This is what we prefer in this solution over other solutions. We are using it the most for the orchestration where the data is coming from different categories. Its other features are very much similar to what they are giving us in open source. Their push-down approach is the most advantageous, where they push most of the processing on to the same data source. This means that they have a serverless kind of thing, and they don't process the data inside a product such as Data Hub. They process the data from where the data is coming out. If it is coming from HANA, to capture the data or process it for analytics, orchestration, or management, they go to the HANA database and give it out. They don't process it on Data Hub. This push-down approach increases the processing speed a little bit because the data is processed where it is sitting. That's the best part and an advantage. I have used another product where they used to capture the data first and then they used to process it and give it. In Data Hub, it is in reverse. They process it first and give it, and then they put their own manipulations. They lead in terms of business functions. No other solution has business functions already implemented to perform business analysis. They have a lot of prebuilt business functions for machine learning and orchestration, which we can use directly to get an analysis out from the existing data. Most of the data is sitting as enterprise data there. That's a major advantage that they have."
"SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database."
"It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The graphical design environment is also very easy."
"It has got so many connectors. It is intuitive and easy to use."
"Talend Open Studio's installation process is easy. One just needs to install Java before installing the product"
"The initial setup of the product was very easy."
"The most valuable feature for me when it comes to this solution is that it's easy to use."
"The Talend Studio connected to the Talend MDM (Master Data Management) is the most valuable feature. Talend Studio is used to create a job stream that connects to multiple data sources, matches, compares or creates a golden record for overall identification. It also has a good catalogue of objects that can be dragged and dropped for building models."
"The product's initial setup phase was easy."
"The data integration aspect of the solution is excellent."
 

Cons

"The company has everything offshore."
"In 2018, connecting it to outside sources, such as IoT products or IoT-enabled big data Hadoop, was a little complex. It was not smooth at the beginning. It was unstable. It took a lot of time for the initial data load. Sometimes, the connection broke, and we had to restart the process, which was a major issue, but they might have improved it now. It is very smooth with SAP HANA on-premise system, SAP Cloud Platform, and SAP Analytics Cloud. It could be because these are their own products, and they know how to integrate them. With Hadoop, they might have used open-source technologies, and that's why it was breaking at that time. They are providing less embedded integration because they want us to use their other products. For example, they don't want to go and remove SAP Analytics Cloud and put everything in Data Hub. They want us to use SAP Analytics Cloud somewhere else and not inside the Data Hub. On the integration part, it lacks real-time analytics, and it is slow. They should embed the SAP Analytics Cloud inside Data Hub or support some kind of analysis. They do provide some analysis, but it is not extensive. They are moreover open source. So, we need a lot of developers or data scientists to go in and implement Python algorithms. It would be better if they can provide their own existing algorithms and give some connections and drop-down menus to go and just configure those. It will make things really quick by increasing the embedded integrations. It will also improve the process efficiency and processing power. Its performance needs improvement. It is a little slow. It is not the best in the market, and there are other products that are much better than this. In terms of technology and performance, it is a little slow as compared to Microsoft and other data orchestration products. I haven't used other products, but I have read about those products, their settings, and the milliseconds that they do. In Azure Purview, they say that they can copy, manage, or transform the data within milliseconds. They say that they can transform 100 gigabytes of data within three to five seconds, which is something SAP cannot do. It generally takes a lot of time to process that much amount of data. However, I have never tested out Azure."
"Nowadays there are some inconsistencies in data bases, however, they upgrade and release the versions to market."
"I think my biggest problem with the tool is that the errors are very hard to debug."
"It is not as visually appealing as some of the other tools."
"The product could be more intuitive."
"There used to be many Youtube channels that offered Talend training, but now there don't seem to be any. The solution should offer more online training resources."
"The server-side should be completely revamped."
"he overall marketing and documentation of this product need to improve a lot."
"The stability of the solution could improve when running jobs. There can be errors when running projects but in the end, it works well and the errors do not impact the result."
"In version 6.2 we did encounter issues with the job servers and specifically with ESB. Version 6.3 is better but large jobs can cause the MDM server to fall over, requiring a reboot."
 

Pricing and Cost Advice

"The Cloud is very expensive, but SAP HANA previous service is okay."
"It is an open-source tool which means it is a free solution."
"We are using the free version of the tool, because the enterprise version is a little expensive."
"Open Studio has a basic license and additional costs for services, including customer support and technical assistance."
"Talend Open Studio is priced too high."
"Pricing is always a challenge. It is quite an expensive model, but because the platform is so simple to use, we haven't had to purchase any additional licenses."
"The solution will be more expensive if you have a low data volume and a large number of developers."
"It is an open-source product."
"I am using the open-source version of the solution, so there are no extra costs for any feature."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
863,429 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
14%
Financial Services Firm
13%
Government
11%
Computer Software Company
11%
Financial Services Firm
16%
Computer Software Company
12%
Manufacturing Company
8%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about SAP Data Hub?
SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database.
What needs improvement with SAP Data Hub?
We moved from Oracle. If you're aware of your monitoring system, the RPU market, and the managed system, you should move to HANA, which is an innovative database built by SAP itself. However, this ...
What is your primary use case for SAP Data Hub?
I technically handle the database, like cycle management projects. When transaction data comes in, we see it based on the retention periods. We have to move the data to some secure storage rather t...
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What do you like most about Talend Open Studio?
It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The...
 

Also Known As

No data available
Open Studio
 

Overview

 

Sample Customers

Kaeser Kompressoren, HARTMANN
Almerys, BF&M, Findus
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: July 2025.
863,429 professionals have used our research since 2012.