Oracle Cloud Infrastructure (OCI) is redefining the cloud for the world’s largest enterprises. We operate with the agility and innovation of a startup while delivering the scale, security, and reliability expected from one of the world’s leading technology companies.
OCI powers mission-critical workloads for customers globally, offering a comprehensive cloud platform built for high performance, distributed systems, and enterprise-grade reliability. Our engineering culture is grounded in OCI Values — emphasizing integrity, inclusion, innovation, customer focus, and operational excellence. We invest deeply in our people and foster an environment where diverse perspectives, collaboration, ownership, and continuous learning drive breakthrough results.
At OCI, you’ll work alongside exceptional engineers solving some of the most complex distributed systems challenges at cloud scale.
The OCI Limits Team owns the foundational platform that manages service limits, quotas, and capacity governance across Oracle Cloud Infrastructure (OCI). The team enables customers and internal OCI services to scale reliably and securely by providing automated limit management, quota enforcement, and high-scale control plane integrations. We work closely with service teams across OCI to support rapid cloud growth, operational stability, and enterprise-grade resource governance. The organization operates highly distributed, mission-critical systems that directly impact customer onboarding, expansion, and cloud consumption experiences.
Who We’re Looking For
We are seeking an experienced engineering leader with a strong background in distributed systems and cloud infrastructure. You have successfully built and operated highly scalable services, led high-performing engineering teams, and delivered complex systems from architecture through production operations.
You thrive in environments where reliability, scalability, and operational excellence are critical. You understand how to balance long-term architectural investments with fast-paced execution and iterative delivery. You are passionate about building strong engineering cultures, simplifying complex systems, and enabling teams to move quickly while maintaining high quality.
The ideal candidate combines deep technical expertise with strong people leadership and a customer-first mindset.
Internal Responsibilities
- Lead and grow a high-performing software engineering team responsible for OCI services.
- Own the delivery, availability, scalability, and operational excellence of mission-critical OCI services.
- Drive technical strategy, architecture decisions, and execution for distributed cloud services operating at massive scale.
- Partner across OCI organizations to deliver foundational platform capabilities supporting customer growth and cloud expansion.
- Establish engineering best practices around service reliability, automation, observability, and operational readiness.
- Mentor and develop engineering managers and senior engineers through coaching, performance management, and career development.
- Manage roadmap planning, prioritization, execution, and cross-functional coordination.
- Lead teams operating large-scale, highly available systems with a strong focus on resiliency, fault tolerance, and performance optimization.
- Champion Agile development methodologies and foster a culture of continuous improvement and operational ownership.
- Participate in incident management, customer escalations, and operational reviews as needed.
This team operates at the center of OCI’s rapid growth and scale expansion. Key challenges include:
- Designing systems capable of handling exponentially increasing traffic and service growth.
- Improving scalability, availability, and performance of globally distributed services.
- Building resilient systems that can withstand regional outages and dependency failures.
- Balancing rapid feature delivery with long-term platform sustainability.
- Driving architectural simplification while maintaining operational excellence.
- Enabling OCI service teams and customers to scale seamlessly and securely.
We are looking for leaders who can help teams navigate complex technical trade-offs, execute decisively, and build systems that operate reliably at cloud scale.
This team is targeting candidates in the U.S. who can work ONSITE in Nashville-TN (priority location) [Austin-TX and Seattle-WA are secondary locations]. Relocation Assistance provided. (This is NOT a remote position).
Minimum Qualifications
- BS or MS in Computer Science or equivalent experience.
- 5+ years of engineering management experience leading software development teams.
- 7+ years of experience designing, building, and operating large-scale distributed systems.
- Strong experience with Java, Go, C++, or C, along with scripting languages such as Python.
- Deep understanding of distributed systems, scalability, networking, operating systems, and service-oriented architectures.
- Experience building and operating highly available, cloud-native services.
- Strong knowledge of databases, storage systems, and distributed persistence technologies.
- Experience driving operational excellence, observability, performance tuning, and incident response.
- Proven ability to recruit, grow, and retain high-performing engineering teams.
Preferred Qualifications
- Experience developing and operating services on public cloud platforms such as OCI, AWS, Azure, or GCP.
- Experience building multi-tenant infrastructure platforms or cloud control plane services.
- Familiarity with large-scale quota management, resource governance, or capacity management systems.
- Experience leading teams responsible for mission-critical infrastructure services.
External Responsibilities
- Lead and grow a high-performing software engineering team responsible for OCI services.
- Own the delivery, availability, scalability, and operational excellence of mission-critical OCI services.
- Drive technical strategy, architecture decisions, and execution for distributed cloud services operating at massive scale.
- Partner across OCI organizations to deliver foundational platform capabilities supporting customer growth and cloud expansion.
- Establish engineering best practices around service reliability, automation, observability, and operational readiness.
- Mentor and develop engineering managers and senior engineers through coaching, performance management, and career development.
- Manage roadmap planning, prioritization, execution, and cross-functional coordination.
- Lead teams operating large-scale, highly available systems with a strong focus on resiliency, fault tolerance, and performance optimization.
- Champion Agile development methodologies and foster a culture of continuous improvement and operational ownership.
- Participate in incident management, customer escalations, and operational reviews as needed.
This team operates at the center of OCI’s rapid growth and scale expansion. Key challenges include:
- Designing systems capable of handling exponentially increasing traffic and service growth.
- Improving scalability, availability, and performance of globally distributed services.
- Building resilient systems that can withstand regional outages and dependency failures.
- Balancing rapid feature delivery with long-term platform sustainability.
- Driving architectural simplification while maintaining operational excellence.
- Enabling OCI service teams and customers to scale seamlessly and securely.
We are looking for leaders who can help teams navigate complex technical trade-offs, execute decisively, and build systems that operate reliably at cloud scale.
This team is targeting candidates in the U.S. who can work ONSITE in Nashville-TN (priority location) [Austin-TX and Seattle-WA are secondary locations]. Relocation Assistance provided. (This is NOT a remote position).
Minimum Qualifications
- BS or MS in Computer Science or equivalent experience.
- 5+ years of engineering management experience leading software development teams.
- 7+ years of experience designing, building, and operating large-scale distributed systems.
- Strong experience with Java, Go, C++, or C, along with scripting languages such as Python.
- Deep understanding of distributed systems, scalability, networking, operating systems, and service-oriented architectures.
- Experience building and operating highly available, cloud-native services.
- Strong knowledge of databases, storage systems, and distributed persistence technologies.
- Experience driving operational excellence, observability, performance tuning, and incident response.
- Proven ability to recruit, grow, and retain high-performing engineering teams.
Preferred Qualifications
- Experience developing and operating services on public cloud platforms such as OCI, AWS, Azure, or GCP.
- Experience building multi-tenant infrastructure platforms or cloud control plane services.
- Familiarity with large-scale quota management, resource governance, or capacity management systems.
- Experience leading teams responsible for mission-critical infrastructure services.