There are two major use cases for Teradata. One is that whatever data we cleanse and aggregate, we push it to Teradata for our business users. We create some ETL pipelines and automate them. The second use case is for data wrangling. Whatever data we publish to Teradata is used for various analyses, various SQLs, and a lot of dashboards sit on top of Teradata.
My most recent project was for inferring the Net Promoter Score for one of the largest Australian banks where I used Teradata for ETL and data analysis. The entire cleaned data of the bank was stored in Teradata, wherein we had eight to ten different datasets coming in from different sources that were aggregated or converged into Teradata. Using that data, we developed certain business rules on top of that aggregated dataset, which was further fed into Tableau that sat on top of Teradata. Using that data, we were able to infer the customer Net Promoter Score for a rolling six-week average.
The first thing that I appreciate about Teradata is its multi-parallel processing. Whatever queries we execute on Teradata, they are blazingly fast, so it offers really fast connectivity. Secondly, it also provides the MultiLoad feature, by which I can upload my Excels directly to Teradata or CSVs to analyze the data. The third feature is the QUALIFY or ROW_NUMBER keywords that I really appreciate about Teradata. The fourth thing is the way Teradata stores data in a columnar format for faster query processing, which is also one of the best features.
The multi-parallel processing and fast query execution of Teradata have benefited me and my team greatly. What really happens is that we store multiple copies of the data, one in Teradata and the other in our HDFS or object storage. When we had to query the data from the object storage, it was really slow, but when we discovered that this dataset is also available in Teradata, it was really fast, especially related to the NPS project that I was discussing. That is probably one of the use cases that I can recall.
Teradata has positively impacted my organization since its inception. Earlier, whatever data we used to house was in HDFS, then we migrated to cloud, and now we are using Teradata, but Teradata has also moved to cloud. Teradata has immensely helped our organization to fetch the data at a faster rate, which has saved us quite a lot of time. That is probably the very best thing about Teradata.
Teradata could be improved by having a web interface that can really help users to plug and play. Right now, what is required is that I have to install a desktop app for Teradata and then set up the connections. If the same thing were available in a web interface, that would be really helpful.
Since I started my career, I have been using Teradata. It has been more than seven years that I have been using Teradata.
The scalability of Teradata is really great. Whenever we need more resources, we can add that in Teradata, and when not needed, we can scale it down as well. All in all, it is very good.
I have not used other solutions personally, but I have seen a use case of Redshift getting used earlier in my current organization. I have also seen the use of object storage prior to using Teradata fully. What I can see now is that we are moving away from object storage because we want faster results, which is why we use Teradata.
I have seen a return on investment through time saved, specifically saving fifteen to twenty percent of the time.
At least fifteen to twenty percent of our time has been saved using Teradata, which has positively affected team productivity and business outcomes.
Before choosing Teradata, I evaluated other options such as Snowflake and Redshift. These were some of the options available.
My advice to others looking into using Teradata is to go for it if you need faster processing, multi-parallel processing, or more security. I would rate this product an eight out of ten.