Director, Reliability Engineering
Insight Engines
At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.
Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.
Position Summary
The F5 Digital team is looking for a strong leader to help lead and build a new capability within F5. This is a unique role that will blend strong leadership skills, a focus on technical implementation, and experience with IT practices and processes. The Director of Reliability will lead teams focused on IT Service Management (ITSM), infrastructure, including cloud and on-premise infrastructure, and observability. These are rapidly maturing functions, and it will be critical for this leader to have a proven track record of driving a strong strategic vision to completion with agility and a human first approach. This role will be critical to fostering cross-organizational adoption, driving robust ways of working within their teams, and have a strong sense of ownership.
Key Responsibilities
Reliability Engineering
- Support infrastructure operation teams focused on cloud and on-premise infrastructure. Support sustainable engineering practices, including systematic intake, driving infrastructure management practices, and modern configuration management.
- Support cloud platforms practices for a multi cloud ecosystem including AWS, GCP, and Azure. Establish cloud platform practices and help build technologies and practices that lower the barrier of entry for engineers using cloud infrastructure.
- Support engineering enablement services and practices. Help build a strategic roadmap for tooling and automation that will allow engineers to quickly, securely, and effectively build applications in cloud ecosystems.
IT Service Management
- Own critical ITIL practices including change management, problem management, and incident management.
- Drive adoption of documented process, using strong cross organizational relationships to ensure success and support maturity assessments within Digital.
- Build connectivity between process and practices, helping drive robust metrics and simplified strategies for turning documentation into engineering practices.
- Own incident management responses and ensure communications and escalations to senior leadership are simple and effective
- Drive change management activities, including CAB, release management, and audit execution
Observability
- Build and blend modern observability practices with tradition NOC/SOC teams to create a lean and robust monitoring ecosystem between SaaS, cloud, and on-premise services
- Integrate incident management practices with automated observability tools and methodologies to drive visibility into system health and ensure service owners know about issues before their users
- Establish clear metrics such as Key Performance Indicators (KPIs), Service Level Objectives (SLOs), and Service Level Indicators (SLIs) to measure and continuously improve operational performance.
- Foster a proactive culture of monitoring and early detection to identify and address system anomalies before they impact users or infrastructure reliability.
Strategic Leadership & Execution
- Build and lead high-performing global teams across infrastructure, ITSM, and observability teams.
- Create strategic roadmaps and participate as delivery lead in large program level initiatives
- Collaborate closely with Security, Engineering, Compliance, and Legal organizations to ensure alignment and transparency
- Mentor, develop, and support technical teams, driving a culture of ownership, innovation, and continuous improvement.
- Define KPIs and metrics to measure operational performance and developer productivity.
- Drive vendor strategy and manage partner relationships for infrastructure platforms and developer tools.
Required Qualifications
- 12+ years of experience in infrastructure, platform engineering, ITIL, or developer tooling, with 5+ years in senior leadership roles.
- Proven track record overseeing large-scale cloud environments and physical data centers in complex enterprise environments.
- Expertise in Agile methodologies and driving team cultures through iterative improvement and technical excellence.
- Expertise in infrastructure-as-code (Terraform, Ansible), cloud-native operations, and hybrid networking.
- Deep understanding of developer platforms, including source control (GitHub, GitLab, Perforce, ADO), artifact repositories, CI/CD frameworks, and observability stacks.
- Strong grasp of DevOps principles, platform engineering, and infrastructure automation practices.
- Experience with NOC/SOC operations or observability practices and driving operational resilience through system health metrics
- Experience with compliance, risk management, and operational excellence frameworks (e.g., ITIL, SOC2, ISO).
- Strategic thinker with excellent leadership, communication, and stakeholder management skills.
Preferred Qualifications
- Experience supporting F5 products and global engineering organizations in high-growth, SaaS-based companies.
- ITIL 4 experience and certification and utilizing ServiceNOW for driving ITIL implementations
- Previous technical certifications for cloud platforms, including AWS Solutions Architect, Azure Architect, or GCP Cloud Architect
- Familiarity with developer productivity measurement and platform SLOs.
- Background in software engineering, platform operations, or technical architecture.
- Bachelor's or Master’s degree in Computer Science, Engineering, or a related field.
The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.
The annual base pay for this position is: $228,800.00 - $343,200.00F5 maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, geographic locations, and market conditions, as well as to reflect F5’s differing products, industries, and lines of business. The pay range referenced is as of the time of the job posting and is subject to change.
You may also be offered incentive compensation, bonus, restricted stock units, and benefits. More details about F5’s benefits can be found at the following link: https://www.f5.com/company/careers/benefits. F5 reserves the right to change or terminate any benefit plan without notice.
Please note that F5 only contacts candidates through F5 email address (ending with @f5.com) or auto email notification from Workday (ending with f5.com or @myworkday.com).
Equal Employment Opportunity
It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws. This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination. F5 offers a variety of reasonable accommodations for candidates. Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting accommodations@f5.com.