We performed a comparison between Informatica PowerCenter and Matillion ETL based on real PeerSpot user reviews.
Find out in this report how the two Data Integration Tools solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"In StreamSets, everything is in one place."
"It is really easy to set up and the interface is easy to use."
"The most valuable features are the monitoring tools and the reporting manager."
"I like the automated scheduling feature."
"Deployment was simple and straightforward."
"The most valuable features of Informatica PowerCenter are the ease of use, and development, and is simple to find resources."
"It has a Data Catalog that uses the Model repository."
"It is easy to use, and it is quick for developing things. It is fairly powerful, and it can integrate with a lot of different platforms without much hassle."
"It's very easy to use it to develop mappings and workflows."
"Can manage a huge quantity of data and provide reliability."
"Matillion ETL is one hundred percent stable."
"The simplicity of this tool is nice. It has a good graphical user interface. You can also do a lot of generic stuff in the tool. If there is good connectivity to a cloud database, such as Snowflake, and you can have a lot of Snowflake functionality in the tool."
"The loading of data is the most valuable feature of Matillion ETL."
"Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"What I didn't like about it is that the platform itself is not great at distributed processing. When you need high parallel processing, it has some inherent issues. We had to use Java transformation, and it did not go very well. I have heard that it is going to the cloud, but we haven't tried that."
"The UI is a little outdated."
"Informatica PowerCenter could improve on the documentation for the implementation. The documents provided are not very good for a new user."
"Informatica PowerCenter is outdated and would benefit from modernization. They should have a very good migration strategy from Informatica PowerCenter to AACF. Informatica PowerCenter there is no point in using it, you have to use a cloud version."
"It would be nice to have all tools in one place. CDC needs more effort, as it's only easy to develop if you are familiar with Linux."
"The pricing could be improved."
"PowerCenter could be improved by having more big data components. Normally, we prefer Informatica as a relational database, but nowadays, companies are trying to understand and use big data components. I think it would be useful if we had more chances to create a hub ecosystem because customers try to use some data integration tasks by SQL, Spark and Spark codes, and Scala, but at the end of the day, the company will understand that we need to trace all the steps. An ETL tool is a must for that company, if we're talking about the regulated industries like finance, telcos, etc. If Informatica's biggest ecosystems feature were okay, I would prefer to use it."
"In terms of performance improvement and tuning, there should be a bit more guidance and documentation."
"There are certain functions that are available in other ETL tools which are still not present in Matillion ETL. It would be good to have more features."
"It can have multi-environment support. We should be able to deploy it in different environments. Its integration with SAP connection is not so nice, which should be improved. It can also support an on-prem database."
"I am looking forward to seeing the expansion of the source range for their data loader product."
StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.
Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.
With StreamSets, you can deliver the continuous data that drives the connected enterprise.
Enterprise data integration platform to help organizations access, transform, and integrate data from a variety of systems.
Matillion ETL for Redshift is a fast, modern, easy-to-use and powerful ETL/ELT tool that makes it simple and productive to load and transform data on Amazon Redshift. 100 x faster than traditional ETL technology, up and running in under 5 minutes and prices from $1.37 per hour, with no commitments or up-front costs.
Informatica PowerCenter is ranked 2nd in Data Integration Tools with 35 reviews while Matillion ETL is ranked 9th in Cloud Data Integration with 3 reviews. Informatica PowerCenter is rated 7.8, while Matillion ETL is rated 8.4. The top reviewer of Informatica PowerCenter writes "A stable, scalable, and mature solution for complex transformations and data integration". On the other hand, the top reviewer of Matillion ETL writes "A stable and scalable solution that allows you to do a lot of generic stuff and comes with good GUI and support". Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS, AWS Glue and Talend Open Studio, whereas Matillion ETL is most compared with Azure Data Factory, AWS Glue, Informatica Cloud Data Integration, Talend Open Studio and SSIS. See our Informatica PowerCenter vs. Matillion ETL report.
We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.