What is our primary use case?
I have been working with the Nutanix Kubernetes platform for the past five years.
Currently, I am supporting JetBlue Airlines in America, which uses a Nutanix environment and Nutanix Kubernetes Engine NKE. I am managing their infrastructure for around 2,500 VMs and over 200 Nutanix nodes.
We are supporting a Nutanix Kubernetes Engine NKE with around eight production clusters. The client is using it for API management. There's a project coming up to upgrade from NKE to NKP, the Nutanix Kubernetes platform. I am managing those eight Kubernetes clusters regarding infrastructure, including upgrades of the Kubernetes version when new ones are released and managing computer resources. If the DevOps team requires more resources, I deploy new worker nodes to add more CPU, RAM, and storage resources for them.
Recently, we deployed a few clusters for API management, where we used some kubectl extension commands that allow us to connect to the NKE cluster easily from the jump host without direct login, which is a newly added feature.
What is most valuable?
We implement RBAC for the Kubernetes clusters, connecting through a jump host from which the clusters are accessible by the DevOps team, application team, and infrastructure team. Each team has different RBAC levels; for example, the application team has read-only access, while the DevOps team has full admin privileges.
Recently, we deployed four Nutanix clusters, initially connecting to the Kubernetes cluster required downloading files from Prism Central. We requested Nutanix for an alternate connection method, which led to implementing the kubectl extension. The client is pleased with this solution as it eliminates the need to download files each time due to expiration concerns.
The upgrading happens in a rolling fashion, so one node reboots at a time and VMs are migrated to other nodes seamlessly, contributing to its strengths as a platform.
Regarding the Nutanix Kubernetes Engine NKE, I find it to be a stable platform, especially in terms of upgrading clusters. The process does not lead to downtime at the application or pod level, which I appreciate.
What needs improvement?
I have heard there are many features added in NKP that are not available in Nutanix Kubernetes Engine NKE, such as load balancer features.
The unified control plane feature is not being used in Nutanix Kubernetes Engine NKE.
We are currently using a third-party tool for load balancing, but going forward, NKP will include load balancer, ingress, and egress features, making infrastructure management much easier.
There is no auto-scaling feature in Nutanix Kubernetes Engine NKE; however, NKP offers it. In NKE, we created additional worker nodes and increased CPU and RAM capacity, but it required manual intervention.
Concerning policy-driven governance in Nutanix Kubernetes Engine NKE, we have the ability to implement RBAC at the cluster level, which helps maintain security by not granting full admin privileges to every end user, thus restricting access effectively.
For how long have I used the solution?
I have been working with the Nutanix Kubernetes platform for the past five years.
What was my experience with deployment of the solution?
I did not face any issues or complexities in the initial setup of Nutanix Kubernetes Engine NKE.
What do I think about the stability of the solution?
I have not experienced any downtimes, crashes, or stability issues with Nutanix Kubernetes Engine NKE; only connectivity issues have arisen occasionally, such as when etcd services do not come up. I have resolved these by restarting services and renewing expired SSL certificates, but I have never faced cluster crash issues.
What do I think about the scalability of the solution?
I have not worked with any other Kubernetes engines; I only began supporting Nutanix Kubernetes Engine NKE.
How are customer service and support?
My experience with Nutanix's technical support is excellent. Whenever I open a case, I receive great support from them, regardless of whether it is low or high priority.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
I have not worked with any other Kubernetes engines; I only began supporting Nutanix Kubernetes Engine NKE.
How was the initial setup?
I did not face any issues or complexities in the initial setup of Nutanix Kubernetes Engine NKE.
Recently, we deployed a few clusters for API management, where we used some kubectl extension commands that allow us to connect to the NKE cluster easily from the jump host without direct login, which is a newly added feature.
What about the implementation team?
Since I am from Tata Consultancy Services, we function as consultants supporting the infrastructure, but we do not involve ourselves in costing discussions.
What was our ROI?
I have not seen any time-saving, cost-saving, or resource-saving from the Nutanix Kubernetes Engine NKE solution, as I am an infrastructure person who does not directly engage in those aspects.
What's my experience with pricing, setup cost, and licensing?
The end client manages the licensing aspects.
Which other solutions did I evaluate?
Recently, we deployed four Nutanix clusters, initially connecting to the Kubernetes cluster required downloading files from Prism Central. We requested Nutanix for an alternate connection method, which led to implementing the kubectl extension.
What other advice do I have?
Every six months, we upgrade AOS on all the clusters. I manage around 25 plus Nutanix clusters, which includes upgrading Prism Central and utilizing Lifecycle Manager (LCM) for the Nutanix clusters.
After 10:00 PM Eastern Time, we have a maintenance window to implement changes. For critical application servers, we coordinate with the application team to get approvals and create change requests (CR) to proceed during off-business hours.
If the DevOps team faces any computer resource crunch, they open a request, and I increase computer resources by adding new worker nodes to the cluster. If they cannot connect to the cluster, there's a possibility that the client's SSL certificate could expire, so I renew the client SSL certificate for each cluster as part of my daily troubleshooting tasks.
I am not part of the DevOps team, as they handle application pod deployments and testing across different environments such as stage, development, and production. From my side as an infrastructure person, I solely manage the cluster level, not the application pods. However, I take care of Nutanix management pods, ensuring things like worker nodes or etcd nodes are operational.
I have not yet utilized NKP, as I am just learning about it since a project is coming up to upgrade from Nutanix Kubernetes Engine NKE to NKP.
The client has a minimal cloud footprint currently, but they plan to adopt NC2, the Nutanix Cloud Cluster, in the future, including moving some clusters from on-premises to the cloud. They also have a few Azure Kubernetes platforms but in very limited capacity, which the Windows team manages, so I do not have access to those environments.
On a scale of 1-10, I rate Nutanix Kubernetes Engine NKE a 9.
Which deployment model are you using for this solution?
On-premises
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?