We heavily depend on NVIDIA Tesla GPUs, as they are a vital part of our daily operations. We use them for various important tasks at my organization, where we are both a resource and an education institute. These GPUs are essential for our work in AI for health, genomics, and bioinformatics. From analyzing genomic data to driving progress in AI and machine learning, Tesla GPUs play a key role in our research and education efforts.
What is our primary use case?
How has it helped my organization?
Working with NVIDIA has been incredibly beneficial for us. The biggest benefits of working with it are its powerful GPU performance, reliable hardware, and excellent software ecosystem. Having everything from one source makes our work smoother. The NGC support with pre-built containers saves us time and effort, allowing quick adaptation without extensive testing. Unlike other GPU vendors, NVIDIA's solutions work seamlessly out of the box.
What is most valuable?
The most valuable aspects of Tesla are its CUDA software framework, which boosts our computing capabilities, and NVIDIA's NGC cloud support. The pre-built containers they offer, especially for tasks like potential flow simulations, are a big time-saver. These features make Tesla GPUs essential for our work in AI, genomics, and bioinformatics, simplifying our processes and helping us handle complex computations effectively.
What needs improvement?
While the current Tesla setup meets our needs well, it would be beneficial to see broader application support and compatibility with different workloads. The existing configuration handles our current use cases well, but expanding its capabilities to accommodate a wider range of applications would be a great improvement.
For how long have I used the solution?
I have been working with NVIDIA Tesla for five years.
What do I think about the stability of the solution?
We haven't experienced any stability issues with Tesla. It has been very stable for.
What do I think about the scalability of the solution?
The solution is scalable, especially with products like the NVIDIA DGX, which is designed for scalability. In our organization, we have over a thousand HPC users, with around 300 to 400 specifically using Tesla for their high-performance computing needs.
How are customer service and support?
The tech support for NVIDIA is excellent. They are very responsive and reliable. I would rate the support at an eight out of ten because we haven't encountered many issues and didn't have to reach out to them many times.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup process for Tesla was straightforward for me since I had prior experience working with the product. Setting it up requires two people.
The deployment process involves a two-step approach: hardware deployment and software development. After that, we use Ansible for automatic software installation. This includes getting the operating system in place using Foreman and adding necessary components like NVIDIA CUDA drivers. The deployment time varies based on the number of servers, but for around ten servers, it typically takes about two hours. We deploy them in parallel to streamline the process. Maintaining Tesla involves routine tasks like updating drivers and addressing security issues. We handle this by taking about 10% of our servers offline at a time, using a slow scheduler to ensure a controlled process.
What's my experience with pricing, setup cost, and licensing?
The majority of our Tesla GPUs operate on bare metal servers without additional licensing costs.
Which other solutions did I evaluate?
We considered other options before going with NVIDIA. Our focus is on what our users are comfortable with, and currently, NVIDIA is widely preferred by them. While we might explore other options like Xilinx FPGA cards or AMD GPUs in the future, our decision is mainly driven by meeting our users' current preferences.
What other advice do I have?
My advice for those considering working with Tesla is that if you can afford it, go for it. The ecosystem is robust and it is a worthwhile investment. Overall, I would rate NVIDIA Tesla as a nine out of ten.
Which deployment model are you using for this solution?
On-premises
![NVIDIA Tesla [EOL] Logo](https://images.peerspot.com/image/upload/c_scale,dpr_3.0,f_auto,q_100,w_100/rCrJ5QMmvucDdhnMyyNUFbAd.jpeg?_a=BACAGSGT)