Actively recruiting / 39 applicants
We’re here to help you
Florencia Suarez Varady is in direct contact with the company and can answer any questions you may have. Email
Florencia Suarez Varady, RecruiterSite Reliability Engineer (SRE)
We are seeking a skilled and experienced Site Reliability Engineer to lead our efforts
in streamlining and standardizing our DevOps practices across multiple projects.
The ideal candidate will have a strong background in cloud infrastructure,
automation, and modern DevOps methodologies.
This role offers an exciting opportunity to make a significant impact on our
organization's DevOps practices, driving efficiency, security, and innovation across
multiple agencies and projects.
Responsibilities:
-
DevOps Standardization:
- Assess and consolidate our existing DevOps practices.
- Develop and implement standardized naming conventions for environments that
convey clear meaning.
- Create a centralized approach to manage deployments consistently across all
projects.
-
Resource Optimization:
- Analyze current resource usage and implement strategies to optimize utilization.
- Develop policies and procedures to right-size resources across development,
staging, and production environments
.
-
Infrastructure as Code (IaC):
- Establish a central repository for Infrastructure as Code to manage all cloud
services.
- Evaluate and recommend alternatives to Terraform.
- Implement and maintain IaC best practices across the organization.
-
Permissions and Access Management:
- Audit existing permissions and access controls.
- Design and implement a robust permissions structure with clearly defined
groups.
- Oversee the process of sorting users into appropriate permission groups.
-
Continuous Integration/Continuous Deployment (CI/CD):
- Standardize and automate deployment processes across all projects.
- Implement automated processes for updating the authentication server.
-
Monorepo Strategy:
- Evaluate, develop and execute a strategy to move towards a monorepo
architecture where appropriate.
- Ensure the monorepo approach aligns with our multi-project structure and
diverse project needs.
-
Kubernetes Implementation:
- Assess the feasibility and benefits of implementing Kubernetes for our
environment.
- If deemed appropriate, plan and oversee the migration of applications to
Kubernetes,
including services like RabbitMQ.
- Develop strategies for local development environments that mirror the
Kubernetes production setup.
-
Security and Compliance:
- Conduct a comprehensive assessment of our current security posture.
- Implement security best practices across all environments and projects.
- Ensure compliance with relevant industry standards and regulations.
-
Performance Monitoring and Optimization:
- Implement robust monitoring and alerting systems across all environments.
- Continuously analyze system performance and make recommendations for
improvements.
-
Documentation and Knowledge Sharing:
- Create comprehensive documentation and visualizations for all systems,
processes, and best practices.
- Facilitate knowledge sharing sessions to elevate the DevOps capabilities across
the organization.
Requirements:
- 5+ years of experience in DevOps, Site Reliability Engineering, system
administration or similar roles.
- Strong experience with cloud platforms (preferably AWS or GCP).
- Proficiency in Infrastructure as Code tools (e.g., Terraform, Pulumi, or
CloudFormation).
- Experience with containerization technologies, especially Docker and Kubernetes.
- Solid understanding of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, Google
Build or GitHub Actions).
- Knowledge of monitoring and logging solutions (e.g., Prometheus, Grafana, ELK
stack).
- Familiarity with message queuing systems like RabbitMQ.
- Strong scripting skills (e.g., Python, Bash, PowerShell).
- Experience with version control systems, preferably Git.
- Excellent problem-solving skills and ability to work in a multi-tenant environment.
- Strong communication skills and ability to work with diverse teams across multiple
diciplines.
Preferred Qualifications:
- Familiarity with monorepo architectures and tools (e.g., Bazel, Buck, Nx).
- Certifications in relevant cloud platforms or DevOps practices
(e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps
Engineer).
- Experience with implementing zero-trust security models.
- Knowledge of compliance frameworks relevant to government or regulated