Providence Health & Services Sr Site Reliability Engineering (SRE) - Cloud Native (Digital Innovation Group) in Oregon
The Digital Innovation Group (DIG) at Providence St. Joseph Health is a small but mighty product development and incubation team that is building next-gen tools that give patients convenient and easy access to healthcare virtually anywhere, anytime.
DIG is calling for a Sr SRE - Cloud Native to one of our locations in Washington, Oregon, California, Alaska, Montana, New Mexico, or Texas.
We are seeking a Sr. SRE - Cloud Native who will be the front and center part of a team of software engineers focused on building software and automation tools to describe modern cloud native infrastructure, systems and application security, monitoring and operational excellence. You are a polyglot programmer familiar with many languages both interpreted (e.g. Python, Ruby, Perl, etc.) and compiled (C, C , Golang, Rust, etc.) and who has working familiarity with knowledge of modern software stacks in the cloud native landscape. You approach every problem from the viewpoint of it being a software problem that needs to be fixed by writing code. Your approach to software development closely adheres to a Reconciler Pattern. In addition to a software engineering centric view of the world, you are a capable engineer who understands the nuances of OS level concepts on a *NIX system, networking concepts in the cloud e.g. route tables and NAT gateways, security best practices as applies across the OSI stack and a champion of quality and operational best practices of running 24x7 applications in a distributed, fault-tolerant and secure environment.
In this position you will:
Write code to automate infrastructure, security and monitoring of systems and services - preferably in Python and/or Golang.
Have good knowledge Cloud Networking Infrastructure and related concepts such as DNS, TCP/IP, Routing Protocols, Security Groups, VPN, Firewalls, NATs etc.
Understand best practices for systems, networking and application security e.g. understand how common injection attacks play out.
Have an in-depth understanding of how modern day web application stacks are built starting from code running on the browser to how system calls in *NIX systems manifest themselves.
Be aware of best practices for observability, availability, latency, scalability and efficiency of distributed systems and applications running in a modern day Cloud Native Landscape.
Have intermediate to advanced knowledge in systems administration at the virtualization layer on *NIX systems as well as distributed operating systems such as Kubernetes.
Be part of an On-call team that is responsible for maintenance and stability of the infrastructure for web scale applications 24x7x365.
Required qualifications for this position include:
Bachelor's Degree in Computer Science and Engineering, Computer Science or Electrical and Computer Engineering or equivalent educ/experience
6 years experience being a systems engineer or part of a systems engineering or working with a cloud infrastructure team working on OS virtualization, cloud networking and Systems and Network security
6 years experience writing code in Python, Golang or equivalent languages to automate everything related to infrastructure with good understanding on how to write testable software, data structures and algorithms, microservices and data storage technologies
6 years experience with understanding how modern applications are deployed onto the cloud, Enterprise Integration Patterns and championing Operational Excellence in such an environment e.g. Observability best practices
6 years experience with Systems, Network and Application security best practices, threat models, defensive security best practices, writing code to test systems and applications by employing techniques such as Penetration Testing and Chaos Engineering
4 years knowledge of the Linux Operating System, Systems and Network Administration as it relates to modern cloud best practices e.g. Linux system calls, IP Tables, SE Linux, DNS/BIND, File Systems, HTTP Services, Email and File sharing
Preferred qualifications for this position include:
Master's Degree in Computer Science and Engineering, Computer Science or Electrical and Computer Engineering or equivalent educ/experience
2 years understanding of the Linux groups ecosystem and technologies surrounding it e.g. Docker, Mesos, Kubernetes
2 years being a member of the talent acquisition process and hiring excellent engineers for DIG
About the department you will serve:
Providence Strategic and Management Services provides a variety of functional and system support services for all eight regions of Providence Health & Services from Alaska to California. We are focused on supporting our Mission by delivering a robust foundation of services and sharing of specialized expertise.
We offer comprehensive, best-in-class benefits to our caregivers. For more information, visit
As expressions of God’s healing love, witnessed through the ministry of Jesus, we are steadfast in serving all, especially those who are poor and vulnerable.
Providence is a comprehensive not-for-profit network of hospitals, care centers, health plans, physicians, clinics, home health care and services continuing a more than 100-year tradition of serving the poor and vulnerable. Providence is proud to be an Equal Opportunity Employer. Providence does not discriminate on the basis of race, color, gender, disability, veteran, military status, religion, age, creed, national origin, sexual identity or expression, sexual orientation, marital status, genetic information, or any other basis prohibited by local, state, or federal law.
Job Category: Development
Other Location(s): Montana, Oregon-Portland, California, New Mexico, Oregon, Washington-Redmond, Washington, Texas, Alaska, California-Irvine
Req ID: 304456