Our primary use case is to monitor the live game observability. When our game is live, we can see how the servers are performing and how players are connecting around the world. Generally, we just monitor everything to make sure the stability is running smoothly.
The secondary use of it is when we have things we want to dig deeper into, then we have a backup of the logging system. We can jump from a point in the metrics of the observability to a point in logging in which it happened, then observe what the server saw at that time in order to diagnose and fix whatever the issue might be.
Lockdown and COVID-19 made it impossible to go to work anymore. Having a central place where everybody could have a login, then we could share, take snapshots of graphs, or send people live data. Then, if anybody wants to know, they can just go and have a look for themselves. They don't have to ask us, "What is the state of the services like?" or, "What does the player population look like?" We don't have to answer those questions. People can just go and find the answers by looking at it themselves. Also, the customization means we can very quickly build graphs to analyze data or check something out. If somebody else wants something else that isn't there, then it is a very quick turnaround to add it very quickly.
Logz.io is useful for helping us to correlate between our log, metrics, and tracing data. They are integrated altogether, so we have our observability metrics dashboard with a link in there. You click it, then it goes to the logs. It is a holistic approach to having everything together.
It has definitely improved our troubleshooting: To have visibility on what is going on so we can identify things as early as possible. The alerting system is super powerful and hooked up to all our systems so we get alerts. Whenever there is an inkling of a problem, we know about it. That is just incredibly useful rather than having to wait for someone to tell us, which is theoretically what the old world used to be like: Someone would have to notice. Now, we can build alerts in it and the solution will tell us.
For anyone who uses it, Logz.io has increased their productivity. There is no restrictive policy on who can go and find out some basic information or look at graphs. Our design team, customer service team, engineering team, and executive team can all look at it from different levels. They can just go to the dashboard or graph where they want to see the information, without having to ask anybody. This adds a layer of transparency and demystifies a lot of stuff going on, which helps productivity for the sheer fact that anyone can see it. There is a lot of data in Logz.io to be consumed, but it is very clearly laid out and not confusing.