We performed a comparison between IBM InfoSphere Information Server and Talend Open Studio based on real PeerSpot user reviews.Find out what your peers are saying about Microsoft, Informatica, Oracle and others in Data Integration Tools.
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution."
"In StreamSets, everything is in one place."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"IBM InfoSphere Information Server is stable."
"The most valuable features of Talend Open Studio are customization and integration."
"Talend lets you do everything — mapping, workflow, and orchestration — in a single place."
"This product is very easy to use."
"The main differentiator that I have seen between Talend and other data integration tools is the ability to view the data pipeline in the form of a program."
"It is user-friendly and the interface is good."
"Talend is safe to use because it is very restrictive. It is easy to use when one learns how to manipulate data with SQL."
"The most valuable features are definitely data integration, data preparation, and data stewardship."
"The data integration aspect of the solution is excellent."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"IBM InfoSphere Information Server should be more scalable. It should have the option to change the configuration to run on a single, non-multiple node, or multi-threading processing."
"I think my biggest problem with the tool is that the errors are very hard to debug."
"In the next release, Open Studio should include cloud storage as an input."
"The documentation is lacking within the product. They need to get better at all aspects of describing how it works and how to use it."
"The security features could be improved."
"In terms of features, it has all the features that I need. However, it consumes a lot of resources. It is using a lot of RAM, and they need to fix the issue related to resource consumption. It currently requires more than 24 gigabytes of RAM, which is a big amount of RAM."
"Talent consumes a lot of resources on my PC."
"Talend should improve the log and error handling to better track the errors you find during development. Sometimes it's challenging to see what's causing an issue, and tracking that on Talend is complicated."
"We need more components to be more efficient. We use a lot of components, such as Salesforce, and it's not easy to use. There's are minor bugs and it's not easy to use some of the features."
StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.
Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.
With StreamSets, you can deliver the continuous data that drives the connected enterprise.
Talend Open Studio is a free, open source ETL tool for data integration and Big Data. The solution enables you to extract diverse datasets and normalize and transform them into a consistent format which can be loaded into a number of third-party databases and applications.
Talend Open Studio Features
Talend Open Studio has many valuable key features. Some of the most useful ones include:
Talend Open Studio Benefits
There are several benefits to implementing Talend Open Studio. Some of the biggest advantages the solution offers include:
Reviews from Real Users
Below are some reviews and helpful feedback written by PeerSpot users currently using the Talend Open Studio solution.
Elio B., Data Integration Specialist/CTO at Asset messages, says, "The solution has a good balance between automated items and the ability for a developer to integrate and extend what he needs. Other competing tools do not offer the same grade of flexibility when you need to go beyond what is provided by the tool. Talend, on the other hand, allows you to expand very easily."
A Practice Head, Analytics at a tech services company mentions, “The data integration aspect of the solution is excellent. The product's data preparation features are very good. There's very useful data stewardship within the product. From a technical standpoint, the solution itself is pretty good. There are very good pre-built connectors in Talend, which is good for many clients or businesses, as, in most cases, companies are dealing with multiple data sources from multiple technologies. That is where a tool like Talend is extremely helpful.”
Prerna T., Senior System Executive at a tech services company, comments, “The best thing I have found with Talend Open Studio is their major support for the lookups. With Salesforce, when we want to relate our child objects to their parent object, we need to create them via IDs. Then the upsert operation, which will allow you to relate a child object to the event, will have an external ID. That is the best thing which keeps it very sorted. I like that.”
An Implementation Specialist, Individual Contributor at a computer software company, states, “I can connect with different databases such as Oracle Database or SQL Server. It allows you to extract the data from one database to another. I can structure the data by filtering and mapping the fields.” He also adds, “It is very user-friendly. You need to know the basics of SQL development or SQL queries, and you can use this tool.”
PeerSpot user Badrakh V., Information System Architect at Astvision, explains, "The most valuable features are the ETL tools."
IBM InfoSphere Information Server is ranked 40th in Data Integration Tools with 1 review while Talend Open Studio is ranked 5th in Data Integration Tools with 15 reviews. IBM InfoSphere Information Server is rated 7.0, while Talend Open Studio is rated 7.6. The top reviewer of IBM InfoSphere Information Server writes "Prompt support, reliable, but lacking scalability". On the other hand, the top reviewer of Talend Open Studio writes "Popular open-source ETL tool in Singapore/ South East Asia; Connects to Big Data; Easy to spot errors with generated Java code". IBM InfoSphere Information Server is most compared with IBM InfoSphere DataStage, IBM Cloud Pak for Data, Oracle GoldenGate, Qlik Replicate and IBM Watson Knowledge Catalog, whereas Talend Open Studio is most compared with SSIS, AWS Glue, Azure Data Factory, IBM InfoSphere DataStage and Talend Data Fabric.
We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.