The SRE is responsible for meeting the agreed upon SLO’s for the Enterprise Imaging systems in their area of responsibility. The SRE will plan and assign all maintenance and deployment work, will train and guide multi competency teams on how to work on the systems, will ensure the improvement of tools and procedures necessary for the operation of the systems and will perform work such as troubleshooting, root cause analysis and some complex maintenance tasks themselves. The SRE is responsible for developing, designing, automating, monitoring and maintaining our complex datacenter, on-premise, and cloud environments that host a variety of high throughput web services and applications.
Location: Remote opportunity inside the US or Canada
Duties/Responsibilities:
Participate in defining SLIs, SLOs and SLAs for Enterprise Imaging Systems
Collaborate with R&D, Monitoring and other Teams as necessary to develop and implement effective mechanisms to monitor SLO’s
Perform troubleshooting, deploy systems or execute maintenance tasks as necessary to meet the specified SLO’s on production and internal environments
Improve reliability, quality, and time-to-market of our suite of software solutions
Build software and systems to manage platform infrastructure and applications
Partner with architecture and development teams to improve services through rigorous testing and release procedures
Implement security infrastructure and sound security processes and controls partnering with our Security team
Create sustainable systems and services through automation and process improvements
Support 24/7/365 mission-critical healthcare environments
University or college education in science, technology, engineering, or equivalent industry experience
Strong sense of ownership and dedication to results
Approaches challenges as opportunities and sees every day as an opportunity to become a little bit better
A proactive approach to spotting problems, areas of improvement, and bottlenecks
Ability to adapt to working with a wide array of technologies and languages
Excellent verbal and written communication skills and ability to communicate technical subjects to a broad range of stakeholders
1+ years of CentOS/RHEL Linux-based system administration, any cloud computing platform (AWS, Azure, GCP, OpenStack, etc ) OR with demonstrated knowledge of Linux and cloud computing technologies
Experience with networking, firewall configuration, and troubleshooting
Ability to program with one or more high level languages, such as bash scripting, Python, Go, Java, C/C++
Knowledge of configuration management tools like Puppet, Chef and Ansible
Experience with DevOps technologies such as Jenkins, Maven, GitHub
Conceptual knowledge of containerization services (Docker, Kubernetes)
Desired Experience/Skills:
Ability to install, configure, and manage both physical and virtual storage implementations (ZFS, NFS, S3, GCP, EBS)
Experience with Systems Lifecycle Management Products (Foreman, Katello, RedHat Satellite)
Experience setting up and managing processes such as monitoring (Nagios/Check_MK), backup, patching etc.
Experience with any Microsoft Windows OS and development tools such as Visual Studio
Experience supporting 24/7/365 environments
Strong software and cloud computing security skills
Experience with sharding and data scalability for Postgres DB
This job description may not be inclusive of all assigned duties and the scope of the job may change as necessitated by business demands.
All applicants meeting minimum qualifications will be required to complete a 30 minute online assessment as part of your application**.**
Meet Intelerad’s Leadership Team: https://www.intelerad.com/en/about/leadership-team/
#LI-remote #remoteUSA