Try our new research platform with insights from 80,000+ expert users

Confluent vs IBM InfoSphere DataStage comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Confluent
Average Rating
8.2
Reviews Sentiment
6.7
Number of Reviews
23
Ranking in other categories
Streaming Analytics (4th)
IBM InfoSphere DataStage
Average Rating
7.8
Reviews Sentiment
6.8
Number of Reviews
42
Ranking in other categories
Data Integration (6th)
 

Mindshare comparison

Confluent and IBM InfoSphere DataStage aren’t in the same category and serve different purposes. Confluent is designed for Streaming Analytics and holds a mindshare of 8.2%, down 11.2% compared to last year.
IBM InfoSphere DataStage, on the other hand, focuses on Data Integration, holds 5.2% mindshare, down 5.5% since last year.
Streaming Analytics
Data Integration
 

Featured Reviews

Gustavo-Barbosa Dos Santos - PeerSpot reviewer
Has good technical support services and a valuable feature for real-time data streaming
Implementing Confluent's schema registry has significantly enhanced our organization's data quality assurance. It helps us understand the various requirements of multiple customers and validates the information for different versions. We can automate the tasks using Confluent Kafka. Thus, it guarantees us the data quality and maintains the integrity of message contracts.
Rahul Saxena - PeerSpot reviewer
A helpful and cost-effective tool that performs well and is very easy to use
I deal with companies from the healthcare industry. The solutions are largely cloud-based. In data-rich industries like telecom or BFSI, such tools are extensively used. Healthcare also has a lot of data. I will encourage people to use the solution. It is quite an easy tool. Every stage has a help guide. It’s an extensive documentation. We can understand the purpose of a stage, how the connection has to be set up, how to set up a username and password, and whom we should contact. New users must start using the tool and explore it. They might have to invest ten days or two weeks to understand the workflows and options. It is easy to learn. My company is a partner with IBM. Overall, I rate the product a nine out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"With Confluent Cloud we no longer need to handle the infrastructure and the plumbing, which is a concern for Confluent. The other advantage is that all portfolios have access to the data that is being shared."
"Their tech support is amazing; they are very good, both on and off-site."
"We ensure seamless management of Kafka through Confluent, allowing all of our Kafka activities to be handled by a third party."
"A person with a good IT background and HTML will not have any trouble with Confluent."
"Kafka Connect framework is valuable for connecting to the various source systems where code doesn't need to be written."
"The most valuable feature of Confluent is the wide range of features provided. They're leading the market in this category."
"Our main goal is to validate whether we can build a scalable and cost-efficient way to replicate data from these various sources."
"The most valuable is its capability to enhance the documentation process, particularly when creating software documentation."
"The data lineage report can be filtered for reporting. The reports are user-friendly and take less time to find what you need."
"I am impressed with the tool's ETL tracing."
"The concept of integration is a valuable feature of the product."
"The solution is stable."
"The product is easy to deploy."
"Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job."
"IBM is stable and accurate to monitor. It's easy to understand to monitor the data lineage from source to target."
"It's useful for reporting and selecting different extract files."
 

Cons

"There is no local support team in Saudi Arabia."
"Confluent has a good monitoring tool, but it's not customizable."
"One area we've identified that could be improved is the governance and access control to the Kafka topics. We've found some limitations, like a threshold of 10,000 rules per cluster, that make it challenging to manage access at scale if we have many different data sources."
"I am not very impressed by Confluent. We continuously face issues, such as Kafka being down and slow responses from the support team."
"They should remove Zookeeper because of security issues."
"It could have more integration with different platforms."
"Areas for improvement include implementing multi-storage support to differentiate between database stores based on data age and optimizing storage costs."
"It would help if the knowledge based documents in the support portal could be available for public use as well."
"There are three things that could improve - the cloud, monitoring and cloud integration. It's a solid product but not a modern one and of course it depends what you're looking for."
"It would be great if they can include some basic version of data quality checking features."
"The interface needs improvement. It is really too technical. That is the main problem."
"It would be useful to provide support for Python, AR, and Java."
"The solution can be a bit more user-friendly, similar to Informatica."
"So, there are some features that are missing. If I compare DataStage to Talend, Talend allows you to write custom code in Java or use these tools in your applications as well if you are building a job application. But in DataStage, it does not allow you to write custom code for any component."
"The graphical user interface (GUI) feels a lot like the interfaces from the 1980s."
"DataStage is quite expensive. It is too hard to find a consultant using DataStage in Turkey."
 

Pricing and Cost Advice

"Confluent is expensive, I would prefer, Apache Kafka over Confluent because of the high cost of maintenance."
"Confluence's pricing is quite reasonable, with a cost of around $10 per user that decreases as the number of users increases. Additionally, it's worth noting that for teams of up to 10 users, the solution is completely free."
"On a scale from one to ten, where one is low pricing and ten is high pricing, I would rate Confluent's pricing at five. I have not encountered any additional costs."
"Confluent has a yearly license, which is a bit high because it's on a per-user basis."
"The pricing model of Confluent could improve because if you have a classic use case where you're going to use all the features there is no plan to reduce the features. You should be able to pick and choose basic services at a reduced price. The pricing was high for our needs. We should not have to pay for features we do not use."
"You have to pay additional for one or two features."
"It comes with a high cost."
"Confluent is an expensive solution as we went for a three contract and it was very costly for us."
"I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the clients my company works with, though the pricing is higher compared to other solutions, so many clients choose to go with a different solution rather than IBM InfoSphere DataStage."
"The cost is too high."
"It is quite expensive."
"Small and medium-sized companies cannot afford to pay for this solution."
"The solution is cheap."
"Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
"High-cost of ownership: They could take a page from open source software."
"It's very expensive."
report
Use our free recommendation engine to learn which Streaming Analytics solutions are best for your needs.
852,780 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
19%
Computer Software Company
16%
Manufacturing Company
6%
Insurance Company
6%
Financial Services Firm
27%
Computer Software Company
11%
Manufacturing Company
9%
Government
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Confluent?
I find Confluent's Kafka Connectors and Kafka Streams invaluable for my use cases because they simplify real-time data processing and ETL tasks by providing reliable, pre-packaged connectors and to...
What is your experience regarding pricing and costs for Confluent?
They charge a lot for scaling, which makes it expensive.
What needs improvement with Confluent?
I am not very impressed by Confluent. We continuously face issues, such as Kafka being down and slow responses from the support team. The lack of easy access to the Confluent support team is also a...
Would you upgrade to more premium versions of IBM InfoSphere DataStage?
My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For ...
Is IBM InfoSphere DataStage more difficult to use compared to other tools in the field?
I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work ...
Do you rely on IBM Cloud Paks for your data? Have you utilized this product, or do you use IBM InfoSphere DataStage without it?
IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands...
 

Overview

 

Sample Customers

ING, Priceline.com, Nordea, Target, RBC, Tivo, Capital One, Chartboost
Dubai Statistics Center, Etisalat Egypt
Find out what your peers are saying about Confluent vs. IBM InfoSphere DataStage and other solutions. Updated: April 2025.
852,780 professionals have used our research since 2012.