Description:
Job Summary: The Company is looking for a skilled DevOps Engineer to join our Infra team. As a key member, you will be responsible for setting up Data, ML, and Generative AI infrastructure for Fuelight 360, our cutting-edge digital product designed to optimize marketing investment allocation and other TCCC initiatives.
Key Responsibilities:
- Infrastructure Setup: Setting up, maintaining, and optimizing the Data, AI & ML infrastructure that supports Fuelight 360 including managing compute, storage, networking, and the overall cloud environment.
- Automation: Automate repetitive tasks related to the development, testing, and deployment processes to enhance efficiency and reduce the risk of human error.
- CI/CD Implementation: Work with the ML Engineer to set up and maintain the necessary infrastructure and patterns for CI/CD pipelines to automate the testing and deployment of code changes.
- Technical Expertise: Serve as a subject matter expert in DevOps and MLOps, providing guidance and expertise to enhance the overall capability of the team
- Collaboration and Communication: Work collaboratively with team members and stakeholders, fostering an environment of open communication and knowledge-sharing
Qualifications:
- BS or MS degree in Computer Science, Information Technology, or a related field.
- 4+ years using one of the following IaC frameworks: Terraform, Azure Resource Manager
- 4+ years of experience working on public cloud environments (Azure preferred; AWS, GCP), and associated deep understanding of failover, high-availability, high scalability, and security
- 2+ years of experience administering and managing Kubernetes clusters (EKS, GCP, or AKS)
- 2+ years of experience programming with Python, C/C++, Java, Go, or similar languages
- Experience building and delivering GenAI architectural solutions is a plus
Required Skills:
- Demonstrated experience as an MLOps Engineer.
- Deep understanding of cloud platforms and IaC tools.
- Proficiency in containerization technologies such as Docker and container orchestration platforms like Kubernetes.
- Solid programming skills in languages such as Python, C/C++, Java, Go, or similar languages and experience in scripting and automation.
- Familiarity with machine learning frameworks and libraries such as Jax, PyMC, and sci-kit-learn.
- Strong problem-solving and troubleshooting skills, with the ability to analyze and resolve complex technical issues.
Soft Skills:
- Excellent verbal and written communication
- Collaborative mindset with strong interpersonal skills
- Adaptable and flexible in a fast-paced environment
- Commitment to continuous learning and improvement