No more typing reviews! Try our Samantha, our new voice AI agent.

StreamSets vs dbt comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

dbt
Ranking in Data Integration
9th
Average Rating
8.0
Reviews Sentiment
6.6
Number of Reviews
10
Ranking in other categories
Data Quality (5th)
StreamSets
Ranking in Data Integration
23rd
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of May 2026, in the Data Integration category, the mindshare of dbt is 1.4%, down from 1.5% compared to the previous year. The mindshare of StreamSets is 1.2%, down from 1.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
dbt1.4%
StreamSets1.2%
Other97.4%
Data Integration
 

Featured Reviews

Harshwardhan Gullapalli - PeerSpot reviewer
AI Engineer at a educational organization with 51-200 employees
Data pipelines have improved financial accuracy and now build transparent audit-ready reports
As for something I wish we had, dbt's native support for Python transformations came later, and we did some complex financial classification calculations that felt clunky in pure SQL. We ended up writing Python in our n8n workflows and then fed the results back into dbt, which created a bit of a split-brain situation. If we would have had dbt Python models earlier, we could have kept that logic unified. Managing multiple reporting standards was our biggest operational pain point with dbt. We were running UAE corporate tax compliance and IFRS disclosure workflows simultaneously for different clients, and dbt does not have a native concept of multi-tenant or multi-standard project organization. Everything lives in one flat structure, so we had to build more conventions: separate schema folders for IFRS models versus UACT models, custom macros to tag models by compliance regime, and environment variables to control which set of transformations run for which client.
SS
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
Enables effective batch loading with visual interface and enterprise support
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is very convenient because at the end, I have the opportunity to orchestrate all my transformations in just one single place, rather than having them spread out."
"The most concrete outcome was a significant reduction in data errors reaching our downstream AI models, and after implementing dbt's testing layer, we caught roughly 70% of those issues at the transformation stage itself, before they ever touched the model."
"I would say the best feature or the most desirable feature for dbt is the ability to write everything in code."
"Since we migrated from SSIS to dbt model architecture, it takes around four hours only to complete a full refresh, and the client is now happy because our downtime was drastically reduced when we perform a complete refresh of the data."
"From a developer point of view, I find the ease of development and the code to be the most useful capabilities of dbt."
"Overall, I find dbt to be optimized compared to other tools."
"There is operational efficiency achieved, and data quality and governance have also been achieved with modular SQL and version controlling, which reduced duplication of data and data errors."
"dbt has positively impacted my organization by allowing us to create our data pipelines much faster, going from ingestion of data to creating a data product in weeks instead of months, and we can do it in-house with the skillset we already have."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages, completing processes that previously took approximately an hour to an hour and a half with Hadoop in just 15 minutes and saving us around 45 minutes per data pipeline or table that we migrate."
"StreamSets Transformer is a good feature because it helps you when you are developing applications and when you don't want to write a lot of code. That is the best feature overall."
"StreamSets has definitely helped us in getting the information into our data lake very quickly, in terms of ingestion, and the most important thing is it has helped us from a resourcing point of view because you can easily upskill a BI or ETL resource without any programming knowledge to work with this, which has drastically reduced the time that we are spending on workloads by 60% to 70% as well as reducing the time spent on ingestion by 30%."
"It has helped us in moving into the data analytics world at a faster pace than any other tool would have done."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
 

Cons

"If you compare the cost of those packages with dbt alone, it is more expensive to use dbt alone."
"dbt can be improved as I find the co-pilot in dbt is not very good, and my team has tried using it but opted to move off it and use other co-pilots such as GitHub."
"The initial setup of dbt is somewhat complex."
"Every upgrade is a little bit of a risk for us because we do not know if the workarounds that we developed will be available for the next version."
"If I needed to name a few areas for improvement, I would mention the migration of code to Git and GitHub, which sometimes fails and can be confusing for developers during handover."
"Since dbt has a license cost, if a company is small and does not have much budget, they can explore other tools because there are other tools that provide the same functionality at a lower cost."
"The solution must add more Python-based implementations."
"Managing multiple reporting standards was our biggest operational pain point with dbt."
"StreamSet works great for batch processing but we are looking for something that is more real-time. We need latency in numbers below milliseconds."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"In terms of the product, I don't think there is any room for improvement because it is very good. One small area of improvement that is very much needed is on the knowledge base side. Sometimes, it is not very clear how to set up a certain process or a certain node for a person who's using the platform for the first time."
"The logging mechanism could be improved. Now, it is a bit difficult to understand and filter the logs."
 

Pricing and Cost Advice

"The solution’s pricing is affordable."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"I believe the pricing is not equitable."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"StreamSets is an expensive solution."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"The pricing is affordable for any business."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
893,221 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Insurance Company
8%
Manufacturing Company
8%
Comms Service Provider
7%
Financial Services Firm
11%
Insurance Company
8%
Manufacturing Company
7%
Computer Software Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business2
Midsize Enterprise3
Large Enterprise5
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise11
 

Questions from the Community

What is your experience regarding pricing and costs for dbt?
I mentioned the cost as one of the advantages, specifically the license cost.
What needs improvement with dbt?
With AI, everything is advancing so fast, so I would say that the most important thing is to try to integrate with more platforms. As of now, dbt has a strong integration with AWS and with Snowflak...
What is your primary use case for dbt?
I am currently working with dbt and use dbt's modular SQL models.
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
What advice do you have for others considering StreamSets?
If asked, I definitely recommend StreamSets to other users. My overall rating for the solution is nine.
 

Overview

 

Sample Customers

Information Not Available
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about StreamSets vs. dbt and other solutions. Updated: April 2026.
893,221 professionals have used our research since 2012.