1. About Our Client:
The organization operates in the AI-driven marketing technology industry, focusing on personalized customer engagement through unified communication channels such as SMS, RCS, email, and push notifications. It addresses the challenge of creating authentic customer relationships by combining advanced AI technology with human expertise to deliver tailored marketing experiences that enhance performance, revenue, and loyalty. The company supports more than 8,000 customers across over 70 industries, including notable global brands, facilitating billions of interactions and generating tens of billions in revenue. With a distributed global workforce and offices in major cities worldwide, this team has received multiple recognitions for its culture and growth.
2. About the Opportunity:
The Senior Site Reliability Engineer role is focused on enhancing the reliability, scalability, and operational excellence of the platform. This position involves designing and implementing systems for improved observability and incident management, leading significant projects, and collaborating across various engineering teams to build robust platforms and services. The role is critical in establishing standards and driving reliability goals to ensure the platform meets high operational standards. Additionally, this position includes mentoring junior engineers and fostering continuous innovation to maintain and improve the organization''s engineering capabilities.
3. Responsibilities:
• Design and implement systems to improve reliability, observability, traceability, and incident management
• Lead projects from discovery to execution, ensuring successful delivery
• Collaborate with AI/ML, Data, Platform, and Product engineering teams to develop advanced platforms and services
• Define and enforce production standards, processes, and tools for operational excellence
• Advocate for and implement SLIs, SLOs, and other reliability metrics across engineering teams
• Mentor junior team members to support technical growth and leadership development
• Drive continuous improvement by introducing creative solutions and challenging existing processes
4. Requirements:
• 5+ years of experience in Production Engineering, SRE, Platform Engineering, DevOps, Backend Engineering, or similar roles
• Proficient coding skills in at least one language such as Golang, Python, Java, or Typescript
• Experience with cloud-native technologies and Infrastructure-as-Code tools like Kubernetes, Terraform, and AWS
• Proven track record delivering medium to large-scale projects that improve platform reliability and scalability
• Strong understanding of production reliability concepts including SLIs, SLOs, and incident management
• Skilled in designing and maintaining CI/CD pipelines, deployment strategies, and release automation
• Familiarity with AI-assisted development tools such as Claude Code, Codex, or Cursor
• Excellent communication skills for collaborating with technical and non-technical teams
• Experience working in dynamic, reliability-focused production environments preferred
5. Pay Range and Compensation Package:
• The US base salary range for this full-time position is $220,000 - $250,000 annually plus equity and benefits
• Salary ranges are determined by role, level, and location
6. Benefits & Perks:
• Competitive health and wellness benefits
• Equity participation
Equal Opportunity Statement: Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.
Note:
RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose in this opportunity is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company’s career page or ATS.