Coming October 25: PeerSpot Awards will be announced! Learn more
2018-03-06T07:53:00Z
Julia Frohwein - PeerSpot reviewer
Senior Director of Delivery at PeerSpot (formerly IT Central Station)
  • 1
  • 15

What is your primary use case for Talend Data Quality?

How do you or your organization use this solution?

Please share with us so that your peers can learn from your experiences.

Thank you!

3
PeerSpot user
3 Answers
SP
IT Manager at a insurance company with 10,001+ employees
Real User
Top 5Leaderboard
2021-06-03T14:05:28Z
03 June 21

Talend DQ module is more focused around data profiling, so we have some teams in our organization from data governance who use it to profile the data to find anomalies in the data.

HU
Practice Manager (Digital Solutions) at a computer software company with 201-500 employees
MSP
2020-08-30T08:33:35Z
30 August 20

Our use cases vary, but mainly we are using it for implementing a master data management platform. We get data from multiple sources and create a golden ticket record that can be used for ingesting the data from that single source to any of the platforms.

Jyoti Wilson - PeerSpot reviewer
ETL/SQL Developer
Real User
Leaderboard
2018-03-06T07:53:00Z
06 March 18

We have a legacy system (Wins + DB2), which stores all our data. For reporting purposes (from SQL), we need to analyze data. We use it for making decisions, for example, if we want to display data elements in our reports based on if a column ever gets a value entered by user or what are distinct values that we are receiving for transformation purposes. We use it to check patterns, like zip code, state codes, and phone numbers. We also check data value frequency for business decision in mapping from one system to another.

Find out what your peers are saying about Talend, Experian, Informatica and others in Data Quality. Updated: September 2022.
635,987 professionals have used our research since 2012.
Related Questions
Julia Frohwein - PeerSpot reviewer
Senior Director of Delivery at PeerSpot (formerly IT Central Station)
Mar 05, 2021
How do you or your organization use this solution? Please share with us so that your peers can learn from your experiences. Thank you!
See 2 answers
HU
Practice Manager (Digital Solutions) at a computer software company with 201-500 employees
30 August 20
Our use cases vary, but mainly we are using it for implementing a master data management platform. We get data from multiple sources and create a golden ticket record that can be used for ingesting the data from that single source to any of the platforms.
SP
IT Manager at a insurance company with 10,001+ employees
05 March 21
Talend has different modules. Talend has Talend Data integration (DI), Talend Data Quality (DQ), Talend MDM, and Talend Data Mapper (TDM). We have Talend DI, Talend DQ, and TDM. Our use cases span across these modules. We don't use Talend MDM because we have a different solution for MDM. Our EDF team is using an Informatica solution for that. We have a platform that deals with MongoDB, Oracle, and SQL Server databases. We also have Teradata and Kafka. The first use case was to ensure that when the data traverses from one application to another, there is no data loss. This use case was more around data reconciliation, and it was also loosely tied to the data quality. The second use case was related to data consistency. We wanted to make sure that the data is consistent across various applications. For example, we are a healthcare company. If I'm just validating the claim system, I need to see how do I inject the data into those systems without any issues. The third use case was related to whether the data is matching the configurations. For example, in production, I want to see: * If there is any data issue or duplicate data? * Is the data coming from different states getting fed into the system and matching the configurations that have been set in our different engines, such as enrollment, billing, and all those things? * Is it able to process this data with our configuration? * Is it giving the right output? The fourth use case was to see if I can virtually create data. For example, I want to test with some data that is not available in the current environment, or I'm trying to create some EDA files, which are 834 and 837 transaction files. These are the enrollment and claims processing files that come from different providers. If I want to test these files, do I have the right information within my systems, and who can give me that information. The fifth use case was related to masking the information so that in your environment, people don't have access to certain data. For example, across the industry, people pull the data from production and then just push it into the lower environment and test, but because this is healthcare data, we have a lot of PHI and PII information. If you have your PHI and PII information in production and I am pulling that data, I have everything that is in production in the test environment. So, I know your address, and I know your residents. I can hack into your systems, and I can do anything. This is the main issue for us with HIPAA compliance. How do we mask that information so that in your environment, people don't have access to it? These are different use cases on which we started our journey. Now, it is going more into the cloud, and we are using Talend to interact with various cloud environments in AWS. We are also interacting with Redshift and Snowflake by using Talend. So, it is expanding. We are using version 7.1, and we are migrating to version 7.3 very soon.
Miriam Tover - PeerSpot reviewer
Service Delivery Manager at PeerSpot (formerly IT Central Station)
Mar 05, 2021
Please share with the community what you think needs improvement with Talend Data Quality. What are its weaknesses? What would you like to see changed in a future version?
See 2 answers
HU
Practice Manager (Digital Solutions) at a computer software company with 201-500 employees
30 August 20
I would say that some of the support elements need improvement. It is built on open-source technology and they provide platinum support, but they need improvement. We have a large customer base and they need more customized support from them. I would like to see more advancements with certain big data technology that they have that hasn't been added to the platform. It's something that they could add in the future.
SP
IT Manager at a insurance company with 10,001+ employees
05 March 21
They don't have any AI capabilities. Talend DQ is specifically for data quality, which only has data profiling. With Talend DQ, I cannot generate any reports today, so I need an ETL tool. It provides general Excel files, or I have to create some views. If instead of buying a new tool, Talend provides a reporting capability or solution, it would be great. It will reduce the development effort for creating these kinds of reports. We also manage the infrastructure for Talend. From the licensing perspective, for cloud, they only have seat licenses where one person is tied to one license, but for on-premise, they have concurrent licenses. It would be really awesome if they can provide concurrent licenses for the cloud so that if one person is not there, somebody else can use that license. Currently, it is not possible unless a person deactivates his or her license and moves the same seat license to someone else. We are one of the biggest customers in the central zone of the US for Talend, and this is the feedback that we have provided them again and again, but they come back and say that they aren't able to provide concurrent licenses on the cloud. In version 7.3, there is a feature for tokenization and de-tokenization of data. This is the feature that we are looking for. It is useful if somebody wants to see what we have masked and how do we demask it. This feature is not there in version 7.1. There are also a few other capabilities on the cloud, but we don't yet have a big footprint in the cloud.
Related Articles
Manoj Narayanan - PeerSpot reviewer
Practice Director - Digital & Analytics Practice at HCL Technologies
Aug 12, 2022
What is a data mesh? For decades, entrprises have just stored and stored data without really having any ways of cataloguing making it difficult to use the same data for a good reason. The issues include difficulty in discovering relevant data, not being able to democratize data and use it for business strategies and most importantly not being able to trust the data. It is important to ensure ...
reviewer1925439 - PeerSpot reviewer
Manager at HCL Technologies
Jul 26, 2022
Why Data Quality Is Important Every organization needs good data quality to ensure the right business decisions are made. These days organizations are adopting new data cleansing strategies to elevate and ensure the quality of enterprise data used for analytical purposes. Data Quality can also be defined as a measurement of how to fit a data set is to serve the specific needs of an organizat...
Related Articles
Manoj Narayanan - PeerSpot reviewer
Practice Director - Digital & Analytics Practice at HCL Technologies
Aug 12, 2022
How to operationalize a Data Mesh using Data Fabric
What is a data mesh? For decades, entrprises have just stored and stored data without really hav...
reviewer1925439 - PeerSpot reviewer
Manager at HCL Technologies
Jul 26, 2022
Approaching Data Quality in Today's World
Why Data Quality Is Important Every organization needs good data quality to ensure the right bus...
Download Free Report
Download our free Data Quality Report and find out what your peers are saying about Talend, Experian, Informatica, and more! Updated: September 2022.
DOWNLOAD NOW
635,987 professionals have used our research since 2012.