Software Engineer III – Service Lifecycle
What You’ll Do:
- Help architect and develop the next-generation compute platform by leveraging industry best practices and adopting new technologies to support business growth.
- Influence product roadmaps of cloud providers (primarily GCP) to align with future-state architecture needs.
- Collaborate with business and technical stakeholders to design and enhance compute platform capabilities that enable faster and more reliable application development.
- Identify opportunities to introduce automation and reduce repetitive operational tasks (DRY).
- Participate in the on-call rotation to ensure operational excellence and maintain overall platform health.
We’re a Match Because You Have:
- 4+ years of experience in platform engineering or SRE roles using an object-oriented language (Python, Golang, etc.).
- Bachelor’s degree in Computer Science, Computer Engineering, or equivalent combination of education and experience.
- Strong hands-on experience with Kubernetes in a public cloud (GKE, EKS, AKS, etc.).
- Experience working with Istio/Service Mesh.
- Experience with Infrastructure as Code (Terraform, Pulumi, etc.).
- Familiarity with Public Cloud environments (GCP, AWS, Azure, etc.).
- Experience with CI/CD tools such as Argo, Buildkite, TravisCI, Jenkins, Spinnaker, etc.
- Experience with observability tools (Prometheus, Thanos, Grafana, Fluentbit, Cloud Monitoring, Datadog, PagerDuty, CloudWatch, Kibana, ElasticSearch, Splunk, VictorOps, etc.).
- Solid networking knowledge.
- Experience and enthusiasm for working in an agile environment.
- An analytical mindset and a passion for solving business problems with technology.
Nice to Haves:
- Experience with Dev Testing tools and patterns such as Garden, Flagger, Canary Deployments, Blue/Green Testing, and A/B Testing.
- Experience setting up and working with Kubernetes Admission Control (Kyverno, OPA, etc.).
- Experience with workload scaling (HPA, VPA, Capacity Planning/Reservations, etc.).