Collibra Governance Room for Improvement
Collibra is very good at talking to modern database systems such as a normal RDBMS (e.g.DB2, SQL server or Oracle). Where it isn't great is with older technologies that you'll typically find in finance or insurance industries (e.g. VSAM or ISAM, or those types of older technologies). It just doesn't connect with them very easily. They do provide an ability to use a separate product called MuleSoft, which they used to license (as a bundle) up until last year until Salesforce bought MuleSoft, and that division is happening in 2021. With this 'bolt-on', you could go and get that data, but you had to write that code and maintain it yourself. It wasn't an out-of-box (OOB) feature, which is what we really liked from the Collibra offering. Our only way to access these older technologies was to create a MuleSoft flow, maintain, and deploy it. This leaves us with technical debt which will need to continually be maintained. In fact, we built all our custom Mulesoft flows using Mule 3.x and will soon be pushed to upgrade to Mule 4.x. This will not be a simple upgrade and will likely result in additional cost to bring in consulting resources more familiar with the technology. Since we do have a lot of older legacy systems, things that aren't greenfield, if you will, it adds a lot more overhead than what we were originally led to believe when we originally purchased the product.
We're not that deep into the Collibra product yet because it's only been a couple of years. We do like their ability to automate the workflows, such that, for example, if somebody comes in to say, "I want to request access to this data," you can build your own workflows to automate the approval process. There are some that are out-of-box, I think they could go a little bit further with some of their out-of-box workflows instead of having to create a workflow manually, get somebody to code it, and implement it. I think they could offer a bit more in that respect.
The second item that I think they could do better at is to have other products, or have things where they have a set of taxonomy per industry that says, "Here's what a policy is. Here's what a customer is," that kind of thing. They don't implement that out-of-box in Collibra, you have to do that yourself, whereas other products bring that to the table. Informatica, I believe, has their own insurance industry or industry specific taxonomy that would come with the product.
It makes adding the new logical constructs to Collibra a more manual workup to take care of. The classification becomes more manual because you don't get that out-of-box to say, "Hey, I recognize that that's a policy, because I know that about that and the taxonomy." You have to manually make that connection.
I'm always putting in enhancement requests because we want everything to be perfect for us. We understand that there are thousands of companies that use the tool and many of them use it in different ways. They should allow the customer to have some additional flexibility.
Take Microsoft for example. Sometimes companies do something cool because their software programmer thinks it's cool, but for the user experience, it stinks. Something as simple as "Why did you decrease the font size from 12, down to nine on every page? Now eyes over 40, can't read it". As it relates to this specific software, I think that getting more feedback from users on changes to the software before implementing it would increased value for them. Overall, they're really great. We meet with our Collibra rep every other week. We talk about what's working, and what's not working. They're a fantastic company to collaborate with. Still, when they just go ahead and implement new things, a bunch of new tickets open, wanting it to go back to the way it used to be.
As far as the functionality of the tool is concerned, it's pretty slick. It's very comprehensive. Still, sometimes they create an interface or an integration without thinking about how the business is going to use it. Virtually all software companies do this. Oracle does it, Informatica does it — every tool does it. They make changes and then they're like, "Oh, I didn't know you were going to use it like that". Why? nine-tenths of their customers use it like that. I think it's a common business challenge that any company has. They're just trying to appeal to a vast audience of customers.
They should consider that what makes a financial institution happy will not necessarily make a healthcare institution happy, which will not necessarily make a high-tech vertical happy either.
There's a lot of things available in their Data Intelligence Cloud that are not available on-premise. For me, if they could boost the search functionality so that if somebody types in a word, it automatically will pull up similar terms first — that would be nice. For business users, instead of what is just the most common volume of content in your tool, now they have a very cool faceted search. It would be very similar to Amazon. You can refine your search down to a fly's eyelash if you'd like. I see a lot of people in the community commenting that this would be a big help.
Group Manager at a tech services company with 10,001+ employees
I am a business person — I am a team leader. My duty is to ensure that the data governance processes are set up; that's how I started to use Collibra. There are certain limitations I have observed in Collibra. With regards to our data lake, Collibra doesn't give us direct connectivity to the Azure Data Lake. We have to establish data lineages. We have to browse those files manually and then connect them via Collibra — that's how data dictionaries get published. Overall, it's quite a manual type of process which needs a lot of human intervention.
I've been hearing that tools like Talent are going to be available soon, which we hope to leverage in the near future. Talent is similar to other ETL or Informatica-type tools. It directly connects to the source system, captures all the transformation tools, and provides you with a spreadsheet that talks about data lineage, which can be fed into Collibra. If this functionality could be improved, it would be a great time-saving solution. It would require less effort and it would be a more automated kind of system, less dependent on human operation, which means that it would be less prone to errors as well.
We create and issue the management of workflows with Collibra. In regards to workflows, I find that they can be made very simple. For example, a request goes directly to the person who is in charge of that particular asset and some simpler workflows can be assigned to it. Recently, I find that the default process of issue management in Collibra is really complex — It wasn't really helpful to us.View full review »
The connectors are not very sophisticated. They can do, for example, Informatica and Tableau, but the connectors themselves could be improved.
I recently got a subscription for another 600K for Collibra for one more year, so the author licenses are not used much. And they keep changing the UI platform; that can also be improved.
From an administration perspective, I like the white-glove onboarding part of Collibra. That was actually nice and I really liked that. For administration in general, I like that you can use Collibra however you want. It's more raw and easily adaptable.
So you can cook it or you can steam it or you can make changes to it in a lot of different ways, but it would also be nice if there were an already available analytics tools like Tableau at hand. Though it is easily adaptable and you'll have a completed end product which you can really leverage.View full review »
Sr. Systems Analyst, Master Data Governance at a manufacturing company with 10,001+ employees
It's not necessarily a tool specific, however, with any sort of application, there's an investment as far as the way in which you need to use it. There is a lot of upfront work that has to be considered. That's just a common reality with any software implementation. There's a lot of pre-work. You just don't turn on the lights assume it's going to work exactly as you envisioned. There is input and planning required.
If anything, I would say that the licensing is one area that could get improved. We have basically three roles: an admin, an editor, and a view-only role. It is limiting. For example, we want view-only, however, if we want users to be able to approve workflows, they need editor rights. That makes sense, except it doesn't necessarily meet all the business cases we have. In some instances, you might just need proper approvals, and you are not necessarily asking anyone to edit things. Yet in order for them to approve, they must have edit rights.
The last implementation was very much focused more on IT and capturing more of the IT view of data and even data definitions really focused on data standards, such as how we're going to name the technical fields or how we're going to name the entities. This new deployment is really much more focused on not just the IT side but on the business side and the operational side. It's based more so around analytics and operational governance. I'm hoping to use more of the modules and have a better, more favorable opinion of the solution's capabilities. While overall I have the sense it's good, the last company I was with didn't have the right business partners and it really just became another IT tool, which wasn't helpful to the company as a whole.
The initial setup requires more of a trial and error approach and there isn't too much documentation available to help you figure things out. There needs to be more online support around the sharing of best practices. There are a lot of use cases and people like the tool. That said, you hear a lot of pain points around large amounts of data being ingested and creating backlogs of data that need to be cataloged and there's really no way to prioritize it.
Ultimately, it's a tool that should help to coordinate a lot of efforts and it would be nice to be able to look at something and understand how another experience could be similar or you can get a lesson learned before you actually make it your own lesson to learn.
This is more of a data governance tool, not necessarily a centralized tool for data cleansing. However, with the data quality module, that's the next evolution that's possible. Looking at data quality issues and then ultimately not necessarily being able to correct them, there's a lost opportunity. Data changes all the time. We're measuring it all the time. It would be advantageous to build this into more of a data quality tool in which users could cleanse data that could go back to source systems. That said, that's encroaching on more of the MDM solution.View full review »
Consultant II at a tech services company with 51-200 employees
While connecting with the data source, it's not very easy. If there's a firewall, it is difficult to connect with the database. It's not easy when you are configuring on the database.
Right now, the client is decommissioning the MuleSoft integration and they're moving to APIs. Collibra Connect and MuleSoft integration were there before, however, now there's a move to API. Within a year or two, they will all move to API. Whoever is using it now with MuleSoft and Collibra Connect needs to find another way for connecting with the API.
I don't think they are providing additional software for MuleSoft integration. Primarily, they are telling us, okay, we will decommission this and move to API. The only thing that's lacking in terms of the change is when connecting to database. Sometimes the connection causes issues if the data is breaking the firewall and ingesting the data.
The UI is good if you happen to be an administrator and are familiar with the technical side of the administration. If you're a business user, the UI is not good. It is hard to learn. It is hard for those who are administering it to teach to end-users and it can take hours of training to do it. Because it is difficult and non-intuitive, business users resist using it. It is a battle to get them on board and to keep them engaged because of the UI. On the other hand, Collibra just hired a person specifically to revamp the UI. So, they're dealing with it, but it isn't there yet.
They're working on the lineage harvesting for technical lineages. I don't know this for a fact, but my feeling is that this is new to them. So, they're still developing it and it feels awkward.
Technology Analyst at a tech services company with 10,001+ employees
It should have more integrations with things like CyberArk because its main purpose is GDPR implementation. We have to have more scope for things that implement more privacy. CyberArk makes sure your credentials are vaulted and your things are secure when you're creating your integrations or connecting to an application. I do believe that they are working on this feature.
One problem is the data lineage, especially extracting the ETL transformation from different ETL tools and identifying how the data is getting changed from one layer to different layers and how the transformation is applied. It doesn't support all the ETL tools for extracting the transformation logic. It supports some of the tools, but there are still some tools that need to be supported.
There is also a small pain point in terms of integration. There is a little bit of change in their strategy from Collibra's end. Earlier, they used to offer two solutions. One was out of the box, and one was a custom-built solution for which they used to provide a dual connector. Now the focus from the Collibra side is more on using the out of the box connector. They are discouraging doing the custom integration. That leaves us with two problems. The first problem is that the out-of-the-box connecter is not yet enabled for a lot of systems, and the second problem is that the out-of-the-box connecter has certain limitations. If we want to tweak those as per our needs, it is not possible. However, the custom-built is still supported, and you can still build a custom integration by using the API, but it is not very encouraged by Collibra.
Its dashboard also needs to be improved. There are options to use the HTML code to customize your dashboard, but it has a lot of limitations.
There are many new aspects of the solution, however, I haven't yet gone through the documentation to see if they really help solve for issues or not.
Many features have recently changed their appearance and I need to re-learn how they work.
Sometimes, if a client needs a specific customization, we cannot do it directly. The client needs to reach out to Collibra and request the customization.
The technical support is very poor.View full review »
Manager - Finance at a financial services firm with 10,001+ employees
The issue may be the way it's been implemented in my company but, for Collibra to be really useful, what's missing is an easy way to connect to different data sources and different types of data sources and actually ingest and profile some of that data. That's the trouble we've always had in getting wider adoption of the tool. Unless there's a mandate from the enterprise data office or the like, regular users are not going to use the tool for really robust business use cases without having some actual data in there. I know there is some out of the box capability for this, but I think it needs to be easier for Collibra to actually ingest and run some basic profiling on the data itself. That's currently missing from the tool.View full review »
The breadth of available connectors for metadata ingestion need to grow quickly to support customers as they expand their data governance programs to include a diverse list of source systems from which they want to derive business value. The connectors are needed to bring metadata into Collibra and enable lineage, workflows, definitions, etc. That said, this is not just a Collibra problem - this is an everybody problem. The central challenge is the availability of APIs to ingest text structural metadata, which is a common problem across any data governance platform or even any integration platform, honestly.
To be fair, I would say that Collibra's purpose and primary value is as a collaboration platform, which is the core value of business-centric data governance, and not as an integration platform. For this purpose, they are clearly the leading solution.
Data Governance Analyst
The solution needs to be controlled. It can go sometimes out of hand.
The speed sometimes, especially now, since we have moved to the Collibra Cloud, has not been the best. The management of the speed of the tool is not that great. It's also partially impacted by the fact that we need to use a VPN and we have got a lot of security measures. Sometimes it's not working well together with everything else. That is the main pain point that we are having.
Occasionally we get little bugs that occur, however, this is typical.
We would like to have a data lineage feature. It's just like on a different module. That's already available, as well as some advanced connectors.
From my perspective, I would like to see improvement in the dashboard creation, to make it easier to create a really nice dashboard, and to also be able to play with the user interface when it comes to those dashboards.View full review »
Collibra, as far as I know, does not have a connector like Oracle, or a mainframe. It's important to have a connector so that you have access to up-to-date information. Sometimes the data can be out-of-date as the updates are not automatic. Users could be looking at obsolete information.
You need to be precise about the names of the field and you have to develop them yourself.
It's my understanding that they are working on a solution where you can import all the information that you need from a data validation too, or from a CRM. It's something they really need to get better at.
It would be better if there was a way to import all data and metadata in an automatic way in one block form.View full review »
Consultant at a tech services company with 10,001+ employees
I'm fairly new to the product, however, what I generally hear from my clients is that the requirement around having ways to ingest more metadata. Currently, with Collibra, they provide you a catalog platform, which helps you integrate or get metadata from a few commonly known platforms, like Tableau and IBM Db2, and Informatica. If they could bring them through, or if they could bring in more connectors to help us ingest metadata from other systems as well, that would be really helpful. That would reduce a lot of time and effort from our end.
If people had backward compatibility as well, that would be much better. I've also worked on other technologies, primarily Java, which is very, very much backward compatible. Any new implementation which they bring in does not impact your existing work to a heavy extent. It would be helpful if Collibra was similar.
Senior Consulting Analyst at a construction company with 5,001-10,000 employees
There are still some new features being released and unveiled such as the certification workflows or questionnaires for dashboard certification, and stuff like that, that will improve a lot as releases go on. There are just minor things here and there that can change.
Every time you change or publish a new questionnaire for certification, you have to basically go into the code and update the new version of the questionnaire. Just little things here and there like that will, as features improve, be improved as well.
Much like other tools, some things can be a little bit immature sometimes.
I'm a little bit limited in some of the features that we have available. It can be a little bit clunky or require extra Windows that aren't necessarily needed. It can get a little bit over-complicated here. It would be nice to have some of the workflow functionalities a bit more usable. There could be more advanced customizations and more added features.
The workflows and the language they use needs to be improved. Programming the needs for every user on the workflows is a key improvement that is required. In addition, they haven't updated their training solution in a while. We need to implement a lot of things ourselves and they want us to move to the cloud but there are a lot of glitches in the system. There are three environments - stage, development and production. Often things work well in the first two stages and then when you get to production, they don't work. It happens a lot and their response is slow.
Business Analyst at a financial services firm with 1,001-5,000 employees
We have an issue with metadata history. If someone changes the metadata, we can't see who changed it. But they are trying to upgrade the system with this feedback and are still working on it. We are still waiting for a proper log to maintain the solution.