2020-08-30T08:33:35Z
Miriam Tover - PeerSpot reviewer
Service Delivery Manager at PeerSpot (formerly IT Central Station)
  • 0
  • 8

What do you like most about Talend Data Quality?

Hi Everyone,

What do you like most about Talend Data Quality?

Thanks for sharing your thoughts with the community!

3
PeerSpot user
3 Answers
Karthik Babu - PeerSpot reviewer
Senior Consultant at a tech services company with 201-500 employees
Consultant
Leaderboard
2021-12-20T08:04:00Z
Dec 20, 2021

The solution is customizable.

Search for a product comparison
SP
IT Manager at a insurance company with 10,001+ employees
Real User
Top 5Leaderboard
2021-03-05T20:22:02Z
Mar 5, 2021

It is saving a lot of time. Today, we can mask around a hundred million records in 10 minutes. Masking is one of the key pieces that is used heavily by the business and IT folks. Normally in the software development life cycle, before you project anything into the production environment, you have to test it in the test environment to make sure that when the data goes into production, it works, but these are all production files. For example, we acquired a new company or a new state for which we're going to do the entire back office, which is related to claims processing, payments, and member enrollment every year. If you get the production data and process it again, it becomes a compliance issue. Therefore, for any migrations that are happening, we have developed a new capability called pattern masking. This feature looks at those files, masks that information, and processes it through the system. With this, there is no PHI and PII element, and there is data integrity across different systems.

It has seamless integration with different databases. It has components using which you can easily integrate with different databases on the cloud or on-premise.

It is a drag and drop kind of tool. Instead of writing a lot of Java code or SQL queries, you can just drag and drop things. It is all very pictorial. It easily tells you where the job is failing. So, you can just go quickly and figure out why it is happening and then fix it.

HU
Practice Manager (Digital Solutions) at a computer software company with 201-500 employees
MSP
2020-08-30T08:33:35Z
Aug 30, 2020

The features that I find to be the most valuable are the extensibility, the integration, and the ease of integration with multiple platforms.

Find out what your peers are saying about Talend, Experian, Informatica and others in Data Quality. Updated: November 2022.
653,522 professionals have used our research since 2012.
Related Questions
Miriam Tover - PeerSpot reviewer
Service Delivery Manager at PeerSpot (formerly IT Central Station)
Mar 5, 2021
Please share with the community what you think needs improvement with Talend Data Quality. What are its weaknesses? What would you like to see changed in a future version?
See 2 answers
HU
Practice Manager (Digital Solutions) at a computer software company with 201-500 employees
Aug 30, 2020
I would say that some of the support elements need improvement. It is built on open-source technology and they provide platinum support, but they need improvement. We have a large customer base and they need more customized support from them. I would like to see more advancements with certain big data technology that they have that hasn't been added to the platform. It's something that they could add in the future.
SP
IT Manager at a insurance company with 10,001+ employees
Mar 5, 2021
They don't have any AI capabilities. Talend DQ is specifically for data quality, which only has data profiling. With Talend DQ, I cannot generate any reports today, so I need an ETL tool. It provides general Excel files, or I have to create some views. If instead of buying a new tool, Talend provides a reporting capability or solution, it would be great. It will reduce the development effort for creating these kinds of reports. We also manage the infrastructure for Talend. From the licensing perspective, for cloud, they only have seat licenses where one person is tied to one license, but for on-premise, they have concurrent licenses. It would be really awesome if they can provide concurrent licenses for the cloud so that if one person is not there, somebody else can use that license. Currently, it is not possible unless a person deactivates his or her license and moves the same seat license to someone else. We are one of the biggest customers in the central zone of the US for Talend, and this is the feedback that we have provided them again and again, but they come back and say that they aren't able to provide concurrent licenses on the cloud. In version 7.3, there is a feature for tokenization and de-tokenization of data. This is the feature that we are looking for. It is useful if somebody wants to see what we have masked and how do we demask it. This feature is not there in version 7.1. There are also a few other capabilities on the cloud, but we don't yet have a big footprint in the cloud.
Julia Frohwein - PeerSpot reviewer
Senior Director of Delivery at PeerSpot (formerly IT Central Station)
Mar 5, 2021
How do you or your organization use this solution? Please share with us so that your peers can learn from your experiences. Thank you!
See 2 answers
HU
Practice Manager (Digital Solutions) at a computer software company with 201-500 employees
Aug 30, 2020
Our use cases vary, but mainly we are using it for implementing a master data management platform. We get data from multiple sources and create a golden ticket record that can be used for ingesting the data from that single source to any of the platforms.
SP
IT Manager at a insurance company with 10,001+ employees
Mar 5, 2021
Talend has different modules. Talend has Talend Data integration (DI), Talend Data Quality (DQ), Talend MDM, and Talend Data Mapper (TDM). We have Talend DI, Talend DQ, and TDM. Our use cases span across these modules. We don't use Talend MDM because we have a different solution for MDM. Our EDF team is using an Informatica solution for that. We have a platform that deals with MongoDB, Oracle, and SQL Server databases. We also have Teradata and Kafka. The first use case was to ensure that when the data traverses from one application to another, there is no data loss. This use case was more around data reconciliation, and it was also loosely tied to the data quality. The second use case was related to data consistency. We wanted to make sure that the data is consistent across various applications. For example, we are a healthcare company. If I'm just validating the claim system, I need to see how do I inject the data into those systems without any issues. The third use case was related to whether the data is matching the configurations. For example, in production, I want to see: * If there is any data issue or duplicate data? * Is the data coming from different states getting fed into the system and matching the configurations that have been set in our different engines, such as enrollment, billing, and all those things? * Is it able to process this data with our configuration? * Is it giving the right output? The fourth use case was to see if I can virtually create data. For example, I want to test with some data that is not available in the current environment, or I'm trying to create some EDA files, which are 834 and 837 transaction files. These are the enrollment and claims processing files that come from different providers. If I want to test these files, do I have the right information within my systems, and who can give me that information. The fifth use case was related to masking the information so that in your environment, people don't have access to certain data. For example, across the industry, people pull the data from production and then just push it into the lower environment and test, but because this is healthcare data, we have a lot of PHI and PII information. If you have your PHI and PII information in production and I am pulling that data, I have everything that is in production in the test environment. So, I know your address, and I know your residents. I can hack into your systems, and I can do anything. This is the main issue for us with HIPAA compliance. How do we mask that information so that in your environment, people don't have access to it? These are different use cases on which we started our journey. Now, it is going more into the cloud, and we are using Talend to interact with various cloud environments in AWS. We are also interacting with Redshift and Snowflake by using Talend. So, it is expanding. We are using version 7.1, and we are migrating to version 7.3 very soon.
Download Free Report
Download our free Data Quality Report and find out what your peers are saying about Talend, Experian, Informatica, and more! Updated: November 2022.
DOWNLOAD NOW
653,522 professionals have used our research since 2012.