Personal details

Williams M. - Remote DevOps engineer

Williams M.

Based in: 🇬🇧 United Kingdom
Timezone: Edinburgh (UTC+1)

Summary

I am a Machine Learning/DevOps engineer with great experience building reliable and highly available platforms that scale and ensures uptime needed for customer satisfaction.

I am knowledgeable of the CAP theorem, PECELC theorem, advanced systems design, data partitioning, application and infrastructure level monitoring and improving latency of requests. One of my proudest achievements was improving the uptime of an infrastructure from 79% to 99.99%.

I am skilled in kubernetes, python, terraform, Jenkins, AWS, GCP, ansible, docker and always apply best practices like preventing escalating privileges, scanning images before deploy, using secrets correctly. I am also a certified kubernetes application developer.

I am also a Senior Full stack developer, I have developed multiple apis with TypeScript and Nestjs. I have built web applications with ReactJs and mobile applications with Flutter and React Native.

Work Experience

AWS Senior DevOps Engineer
The Weather Company, An IBM business | Sep 2023 - Present
Python
Bash
Jenkins
Terraform
Grafana
Prometheus
AWS EKS
Helm Charts
Docker & Kubernetes
Argo CD
Notion
AWS (Amazon Web Services)
  • Directed the design and implementation of scalable cloud infrastructure on AWS and GCP for a high-traffic platform, serving over 1 million monthly users with 99.99% uptime.
  • Implemented infrastructure-as-code practices using Terraform and ArgoCD, reducing deploymenttimes by over 50% and enhancing team productivity.
  • Pioneered the migration from EC2 to AWS EKS, as one of the first Devops Engineer on the team, cutting infrastructure costs by 40%, boosting deployment speed by 35%, and enhancing systemresilience by 50%
  • Authored and maintained 200+ pages of technical documentation on Notion, significantly improving the onboarding process for new team members and external collaborators.
  • Orchestrated the deployment of a new Jenkins server, enhancing CI/CD pipelines with Kubernetes pods; achieved a 50% increase in build process efficiency for over 30+ activedevelopment teams.
  • Integrated AWS Secret Manager, streamlining secret management for 100+ applications; improved security posture by 40% through robust IAM roles and security group configurations.
  • Led a critical migration project, transferring 20+ CI builds to the new Jenkins platform within a tight 1-month deadline, ensuring zero disruption to ongoing development activities.
  • Spearheaded the introduction of Docker in Docker (DIND) servers for Jenkins builds, reducing build times by 25% and enhancing the pipeline's reliability for containerized applications.
  • Initiated and successfully completed the migration of Helm charts to AWS ECR, facilitating a smoother deployment process and improving deployment efficiency by 30%.
  • Established a comprehensive monitoring solution using Prometheus and Grafana for real-time visibility into the health and performance of Linux servers and applications, enabling proactiveissue resolution and a 99.9% uptime
Lead DevOps Engineer
Skuad Pte | Jun 2022 - Present
LDAP
Docker
Google Cloud Platform
Kubernetes
GraphQL
Terraform
Grafana
Prometheus
Helm
GitLab CI/CD
● Setting up GitOps on the infrastructure using Argocd ● Migrating imperative codes to declarative using terraform as infrastructure-as-a-service ● Creating helm charts to manage releases of applications ● Managing deployment of new services ● Implementing high availability and disaster recovery ● Setting up Sonarqube for code quality checks, integrating with LDAP and exporting metrics to prometheus ● Setting up Thanos for long term storage of prometheus metrics ● Creating kubernetes workloads, container restarts, node pool auto-upgrade and VMs alerts ● Write post-mortems and RCA after every incident to ensure better crisis management ● Writing good code documentation to reduce knowledge gaps in the team ● Optimising Docker image build size ● Migrating domain nameservers from Godaddy to Cloudflare and integrating WAF on domain ● Code reviews on team gitlab organization

Personal Projects

FoodCourt Web AppIconOpenNewWindows
2020
React
Tailwind css
Prospa Mobile AppIconOpenNewWindows
2019
React Native