AI Engineer (Mid / Senior level)
• Design, build, and optimize AI/ML models and algorithms tailored to business needs.
• Develop and implement LLM-based solutions, including custom training and fine-tuning.
• Integrate AI models into existing or new applications.
• Work with Azure OpenAI Service to deploy and manage AI workloads.
• Collaborate with cross-functional teams (Data Engineers, DevOps, Product) to scale AI initiatives.
• Maintain documentation, evaluate model performance, and iterate on improvements.
• Experience with Azure Open AI.
Cloud Ops Engineer (Part-time / Project basic )
• Design and implement end-to-end integration between Azure and Datadog, including:
o Metrics and log collection (Azure Monitor, Activity Logs, Diagnostic Settings).
o Distributed tracing and custom dashboards.
o Alerting rules and incident response workflows.
• Build and maintain an integrated cloud support system:
o Automated monitoring and alerting pipelines.
o Incident triage and escalation playbooks.
o Ticketing system and notification integration (e.g., PagerDuty, ServiceNow, Slack).
• Ensure operational readiness through runbooks, dashboards, and SLA reporting.
• Collaborate with development and infrastructure teams to identify observability gaps and improve monitoring coverage.
• Maintain Infrastructure-as-Code (IaC) templates (ARM, Bicep, or Terraform) for repeatable deployment of monitoring and support tooling.
Requirements
• Strong experience with Microsoft Azure (App Services, AKS, Functions, Monitor, Log Analytics).
• Hands-on experience integrating Datadog with Azure using APIs, agents, and Azure-native connectors.
• Experience in building support or operations tools/systems (e.g., alert routing, self-healing scripts).
• Solid scripting skills (e.g., PowerShell, Python, or Bash).
• Familiarity with CI/CD, containerization (Docker/Kubernetes), and DevOps practices.
• Experience with infrastructure automation using Terraform, Bicep, or ARM templates.
• Strong troubleshooting, problem-solving, and incident management skills