Role Overview
We are seeking a skilled DevOps and Kubernetes Deployment Specialist to provide a comprehensive turnkey solution for the on-premise deployment of our application. This role involves ensuring that our application, which utilizes NVIDIA GPUs, is efficiently deployed and scalable across multiple servers in a multi-node Kubernetes environment.
Responsibilities
- Develop and implement a scalable on-premise deployment solution for our application (which uses kubernetes).
- We need to create a solution where users can easily configure their servers to run our applications without our involvement - our app runs on k8s.
- Ensure utilization of NVIDIA GPUs within the server infrastructure.
- Configure a multi-node Kubernetes cluster to support application scaling across multiple servers.
- Collaborate with the team to troubleshoot and resolve deployment issues.
- Optimize deployment processes for performance and reliability.
Required Skills
- Expertise in Kubernetes for deploying and managing applications in a multi-node environment.
- Expertise in virtualization, storage, rancher, ansible.
- Experience with NVIDIA GPUs and their integration into server infrastructure.
- Strong understanding of on-premise deployment strategies.
- Ability to troubleshoot and optimize deployment processes.
Nice to Have
- Experience with additional container orchestration tools.
- Familiarity with GPU-accelerated applications and their deployment requirements.