"It is really easy to set up and the interface is easy to use."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"In StreamSets, everything is in one place."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"It provides everything I need. Nothing is missing. PowerCenter is a good tool for on-premise databases."
"Good interface, reasonable documentation."
"What I like the most is that we have to deal with less while writing the queries."
"I like the automated scheduling feature."
"Informatica PowerCenter is very good for integrating a huge amount of data in a very short duration, such as a minute. It is also very easy to use. After you provide the source and the target, mappings are automatically done, which makes it easy to use for the development team."
"The interface is very clean and clear."
"We use Informatica PowerCenter to transfer the transitional database to and from the data warehouse. This is very efficient as it enables us to quickly find our data reports and the data, so we can build AI models."
"The most valuable features are the metadata repository and the data warehouse application console."
"The solution offers very good end-to-end capabilities."
"The security is also excellent. It's highly granular, so the admins have a high degree of control, and there are many levels of security. That worked well. You won't have an EDC unless you put everything onto the platform because it is its own isolated thing."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"We need another tool for monitoring. It would be easier if all the features were consolidated into one tool."
"Its scalability can be improved. It is not scalable."
"PowerCenter could be improved by having more big data components. Normally, we prefer Informatica as a relational database, but nowadays, companies are trying to understand and use big data components. I think it would be useful if we had more chances to create a hub ecosystem because customers try to use some data integration tasks by SQL, Spark and Spark codes, and Scala, but at the end of the day, the company will understand that we need to trace all the steps. An ETL tool is a must for that company, if we're talking about the regulated industries like finance, telcos, etc. If Informatica's biggest ecosystems feature were okay, I would prefer to use it."
"What I didn't like about it is that the platform itself is not great at distributed processing. When you need high parallel processing, it has some inherent issues. We had to use Java transformation, and it did not go very well. I have heard that it is going to the cloud, but we haven't tried that."
"The solution can improve by providing more connectivity by having native ODBC or JDBC connections available. It will be easier and more people could start using it."
"Requires an established data center because there is no option for software as a service."
"Informatica PowerCenter could improve on the documentation for the implementation. The documents provided are not very good for a new user."
"PowerCenter has three clients. I wish they would consolidate everything into one GUI, not three. Also, we had a persistent issue with the Informatica Developer tool but it was solved when we migrated to the newest one."
"The workflow could be improved."
"The data lineage was challenging. It's hard to track data from the sources as it moves through stages. Informatica EDC can easily capture and report it because it talks to the metadata. This is generated across those various staging points."
StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.
Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.
With StreamSets, you can deliver the continuous data that drives the connected enterprise.
Enterprise data integration platform to help organizations access, transform, and integrate data from a variety of systems.
Palantir Foundry is an enterprise data management platform offering comprehensive tooling for working with big data. Because it is an operating system made for modern enterprises, it is highly available and a continuously updated platform.
Palantir Foundry is a fully managed SaaS platform that spans from cloud hosting and data integration to flexible analytics, visualization, model-building, operational decision-making, and decision capture. It equips technical and non-technical users to make data-driven operational decisions.
Palantir Foundry includes tools to integrate data of any scale, format, or structure, and also has granular, flexible access controls for individual datasets. In addition, it has an open, modular architecture with multiple RESTful APIs, it has native applications for developing machine learning and artificial intelligence, it provides sophisticated data science applications for users of all technical abilities, and much more.
Palantir Foundry Features
The most valuable Palantir Foundry features include:
Security, flexibility, interoperability, easy deployment, built-in role classification, purpose-based access controls, interoperable architecture, model integration, AI modeling tools, ontology, custom workflows, team-specific applications, self-serve analytics, lineage system, operational application building, 200+ data connectors, data versioning, change management framework, sand decision orchestration, and custom dashboard and report building tools.
With Palantir Foundry You Can:
Palantir Foundry Benefits
Some of the many Palantir Foundry benefits include:
Reviews from Real Users
PeerSpot users like Palantir Foundry because it has many advantages:
“It is user-friendly, good automation, and allows you to do a better job of data governance.” - Associate, Inhouse Consulting at a pharma/biotech company
“Works seamlessly with good end-to-end capabilities and the capability to scale.” - Wallace H., Sr. Director at a tech services company
Informatica PowerCenter is ranked 1st in Data Integration Tools with 32 reviews while Palantir Foundry is ranked 17th in Data Integration Tools with 2 reviews. Informatica PowerCenter is rated 8.0, while Palantir Foundry is rated 8.6. The top reviewer of Informatica PowerCenter writes "A stable, scalable, and mature solution for complex transformations and data integration". On the other hand, the top reviewer of Palantir Foundry writes "The data visualization is fantastic and the security is excellent". Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS, AWS Glue and Informatica PowerExchange, whereas Palantir Foundry is most compared with Azure Data Factory, Palantir Gotham, SAP Data Services, Alteryx Designer and Informatica Enterprise Data Catalog.
See our list of best Data Integration Tools vendors.
We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.