OCI Network Availability is seeking a Senior Manager to lead a Networking Reliability Engineering team responsible for driving operational excellence across the OCI physical network in a broadly distributed, multi-tenant cloud environment.
This is a highly collaborative, full-spectrum leadership role that spans both engineering and operations. You will work closely with network engineers to build, operate, and continuously improve services and tooling, while partnering with other OCI teams to simplify and enhance management of the OCI network at scale.
Internal Responsibilities
In this role, you will:
- Attract, develop, and lead a team of highly skilled network engineers.
- Define and execute roadmaps that improve operational efficiency, reliability, and scalability.
- Establish, track, and report on key metrics that measure service availability and operational health.
- Drive strategic technology initiatives that enhance the reliability, performance, and scalability of OCI networking services.
- Solve complex problems across distributed systems, network infrastructure, and highly available services.
- Improve and expand OCI network monitoring, automation, and operational tooling.
- Collaborate with engineering teams across OCI to deliver highly available and resilient services for our customers.
- Participate in the manager on-call rotation and support operational excellence initiatives.
The right leader for this role will make a meaningful impact on our organization, products, and customers. Are you someone who can provide direction and structure while empowering teams to succeed? Do you enjoy mentoring and developing engineers? Are you open to feedback and continuous learning from peers and leaders across a large organization? Do you thrive in a fast-paced environment and enjoy solving challenging technical and operational problems? If so, we’d love to hear from you.
Preferred Qualifications
- 5+ years of experience in large-scale physical network reliability engineering.
- 3+ years of experience in an engineering and operations management role.
- Strong technical knowledge of cloud networking, distributed systems, and large-scale infrastructure operations.
- Proven experience in technical leadership and people management.
- Experience working in large enterprise, service provider, or cloud environments.
- Experience driving hiring, onboarding, employee development, and performance management.
- Excellent organizational, verbal, and written communication skills.
- Strong judgment and the ability to influence technical strategy, product direction, priorities, and operational improvements.
External Responsibilities
In this role, you will:
- Attract, develop, and lead a team of highly skilled network engineers.
- Define and execute roadmaps that improve operational efficiency, reliability, and scalability.
- Establish, track, and report on key metrics that measure service availability and operational health.
- Drive strategic technology initiatives that enhance the reliability, performance, and scalability of OCI networking services.
- Solve complex problems across distributed systems, network infrastructure, and highly available services.
- Improve and expand OCI network monitoring, automation, and operational tooling.
- Collaborate with engineering teams across OCI to deliver highly available and resilient services for our customers.
- Participate in the manager on-call rotation and support operational excellence initiatives.
The right leader for this role will make a meaningful impact on our organization, products, and customers. Are you someone who can provide direction and structure while empowering teams to succeed? Do you enjoy mentoring and developing engineers? Are you open to feedback and continuous learning from peers and leaders across a large organization? Do you thrive in a fast-paced environment and enjoy solving challenging technical and operational problems? If so, we’d love to hear from you.
Preferred Qualifications
- 5+ years of experience in large-scale physical network reliability engineering.
- 3+ years of experience in an engineering and operations management role.
- Strong technical knowledge of cloud networking, distributed systems, and large-scale infrastructure operations.
- Proven experience in technical leadership and people management.
- Experience working in large enterprise, service provider, or cloud environments.
- Experience driving hiring, onboarding, employee development, and performance management.
- Excellent organizational, verbal, and written communication skills.
- Strong judgment and the ability to influence technical strategy, product direction, priorities, and operational improvements.