We use it to bring live production data into a testing environment for developers and code testing. It works well because it seamlessly connects with the ETL engine and has its own policy pack, reducing the need for custom coding.
TDM also has strong masking capabilities for sensitive data like names, Social Security numbers, dates of birth, and bank accounts, which is crucial, especially in the financial industry.
We also primarily use Informatica PowerCenter, which is our ETL tool. TDM integrates well with PowerCenter, allowing us to bring in development data and incorporate it into the transformation phase of our ETL process. The masked and subsetted data is then loaded into our testing environments.
Informatica TDM improved our data compliance and security for testing.
Some companies use custom-built, in-house solutions to mask sensitive data, such as Social Security numbers or credit card information. But then, these solutions can vary in robustness and may not allow to track changes.
With Informatica TDM, any production data brought into a non-production environment is automatically securitized. TDM offers different algorithms (around 24 at the time) to choose from for masking data. This flexibility allows us to customize the masking process according to our specific needs.
Informatica TDM is also developer-friendly, making it easy for ETL developers to pick up and use without extensive training. It's not overly complicated and provides a streamlined approach to data masking and security.
Data subsetting and testing cycle:
Data subsetting is a key aspect of sampling in our testing process. All I need is a small representative sample, like ten thousand records or so. We can select the sample and ensure it covers all possible scenarios.
The subsetting algorithm in TDM helps us make sure that the subset is representative of the entire population, even with terabytes of data. This is crucial for efficient testing.
The main thing is, in version 9.7.1 (and I think it's still there), there are two features: persistent data masking (PDM) and dynamic data masking (DDM). Both options are used by companies in some way.
Dynamic data masking means that if someone with authority queries the database, they can see the actual data. However, lower-level users without the necessary permissions will see masked data. It depends on who runs the query. For example, a CEO would see everything, but others wouldn't. Dynamic data masking is implemented in data transit.
I had the privilege of working with and managing this product back in 2017 when we were implementing version 9.7.1.