The AI Infrastructure (Ai2) Host Management Organizationis responsible for securely and accurately wipe and update bare metal (BM) servers and bring them into OCI fleet as well as build services to operationally scale and manage firmware life cycle of these Bare Metal Servers.
As the team manager, you will drive the rollout of optimizations and granule tuning to achieve maximum performance from OCI's globally deployed BM fleet. You will drive and support engineers during customer escalations and troubleshooting sessions by applying your Systems and software architecture knowledge. You will also drive enhancements within an existing software/network architecture and suggest improvements to the architecture, including managing control activities in multi-functional areas of the business. Ensures appropriate operational planning is effectively executed to meet Corporate specifications. Demonstrated leadership and people management skills. Strong communication skills, analytical skills, and a thorough understanding of product development.
Internal Responsibilities
As an M4 Director in the AI Infrastructure (Ai2) Host Management organization, you are responsible for the maintenance, improvement, and operation of large-scale Bare Metal fleet in Oracle Cloud Infrastructure (OCI). This role primarily focuses on developing and supporting services and systems by leveraging a deep understanding of distributed systems with strong programming skills. Since OCI is cloud-based with a global footprint, your responsibilities will encompass support for hundreds of thousands of firmware programmable devices and servers, connected via a mix of dedicated network infrastructure.
Lead engineering network performance improvement programs across Ai2 org:
- Attract, develop & manage a team of highly skilled engineers.
- Define and develop roadmaps to deliver engineering and operational efficiencies.
- Develop, grow, and maintain data-driven metric programs that speak to both the operational and business status of the services owned within Ai2 domain.
- Solve difficult problems in distributed systems, infrastructure, and highly available services.
- Collaborate with Various Compute, Hardware, Firmware, Networking and GNOC operations.
- Support engineering organizations to deliver a highly available service to our customers.
- Participate in on-call rotation for managers
External Responsibilities
As an M4 Director in the AI Infrastructure (Ai2) Host Management organization, you are responsible for the maintenance, improvement, and operation of large-scale Bare Metal fleet in Oracle Cloud Infrastructure (OCI). This role primarily focuses on developing and supporting services and systems by leveraging a deep understanding of distributed systems with strong programming skills. Since OCI is cloud-based with a global footprint, your responsibilities will encompass support for hundreds of thousands of firmware programmable devices and servers, connected via a mix of dedicated network infrastructure.
Lead engineering network performance improvement programs across Ai2 org:
- Attract, develop & manage a team of highly skilled engineers.
- Define and develop roadmaps to deliver engineering and operational efficiencies.
- Develop, grow, and maintain data-driven metric programs that speak to both the operational and business status of the services owned within Ai2 domain.
- Solve difficult problems in distributed systems, infrastructure, and highly available services.
- Collaborate with Various Compute, Hardware, Firmware, Networking and GNOC operations.
- Support engineering organizations to deliver a highly available service to our customers.
- Participate in on-call rotation for managers