This role will own the development of a comprehensive Design and Product Quality & Reliability program spanning infrastructure design standards, product qualification, supplier quality expectations, reliability engineering, field performance analytics, and continuous improvement. The ideal candidate combines deep technical expertise in critical infrastructure systems with proven experience building organizations and quality programs in large-scale manufacturing, hyperscale infrastructure, semiconductor, power systems, or mission-critical environments.
The Sr. Director will partner closely with Engineering, Design, Construction, Supply Chain, Product Engineering, Operations, and strategic suppliers to ensure OCI infrastructure platforms consistently meet aggressive reliability, availability, and lifecycle performance objectives.
Key Responsibilities
Build and Lead the Function
- Establish and scale OCI’s Design Quality & Reliability organization for AI data center infrastructure.
- Develop the strategy, operating model, governance, metrics, and execution roadmap for the function.
- Build and lead a high-performing multidisciplinary team spanning reliability engineering, supplier quality, design assurance and validation,.
- Define organizational processes and standards for quality and reliability across the infrastructure lifecycle.
Design Quality & Reliability Leadership
- Ensure infrastructure designs meet OCI reliability, resiliency, maintainability, and lifecycle performance requirements.
- Drive design assurance processes that validate design intent against operational requirements and long-term reliability objectives.
- Lead cross-functional design reviews focused on reliability risk reduction, failure prevention.
- Establish reliability engineering methodologies including FMEA, fault tree analysis, accelerated life testing, and design-for-reliability practices.
Product Quality & Supplier Reliability
- Define qualification and acceptance criteria for critical infrastructure products and systems used in OCI data centers.
- Establish product quality benchmarks and reliability performance targets, including AFR (Annualized Failure Rate), IDR, MTBF, and other key reliability indicators.
- Develop supplier quality management frameworks and collaborate with strategic suppliers to improve product reliability and manufacturing quality.
- Drive root cause analysis and corrective action processes for field failures and reliability excursions.
Metrics, Benchmarking & Continuous Improvement
- Develop KPI dashboards and measurement systems to benchmark design and product reliability performance across the OCI infrastructure portfolio.
- Analyze field performance data, warranty trends, operational incidents, and failure modes to identify systemic improvement opportunities.
- Establish data-driven processes to recommend and implement design, component, or supplier changes that improve quality, reliability, and operational efficiency.
- Benchmark OCI performance against hyperscale and industry best practices.
Cross-Functional Partnership
- Partner with Infrastructure Engineering, Capacity Delivery, Operations, Supply Chain, and Product teams to ensure reliability objectives are embedded throughout the lifecycle.
- Influence strategic technology and supplier selection decisions using quality and reliability data.
- Provide executive-level reporting on reliability performance, risks, and improvement initiatives.
Internal Responsibilities
Qualifications
Required Experience
- 15+ years of experience in quality, reliability engineering, critical infrastructure, manufacturing quality, or related technical leadership roles.
- 7+ years leading large-scale engineering or quality organizations.
- Experience building or transforming quality and reliability programs in hyperscale infrastructure, cloud, semiconductor, power systems, telecom, or mission-critical environments.
- Deep expertise in reliability engineering methodologies and statistical analysis techniques.
- Proven experience with supplier quality management and complex hardware ecosystems.
- Strong understanding of critical infrastructure systems including power distribution, cooling, controls, mechanical, and electrical systems.
Preferred Qualifications
- Experience in hyperscale data center infrastructure or cloud infrastructure environments.
- Familiarity with AFR, IDR, MTBF, and reliability growth methodologies.
- Experience with GW-scale infrastructure deployment programs.
- Demonstrated success driving measurable reliability improvements across large operational fleets.
- Advanced degree in Engineering, Reliability Engineering, Mechanical Engineering, Electrical Engineering, or related field preferred.
Leadership Characteristics
- Strategic builder with the ability to create new functions and scale organizations.
- Data-driven decision maker with strong analytical rigor.
- Strong executive communication and influence skills.
- Bias for action and operational excellence.
- Collaborative leader capable of driving alignment across engineering, operations, and suppliers.
- Passion for quality, reliability, and continuous improvement at hyperscale.
Impact
This role will directly shape the reliability foundation of OCI’s next-generation infrastructure platform. The Sr. Director will establish the systems, standards, and culture that ensure OCI data center designs and infrastructure products achieve exceptional quality, operational resilience, and lifecycle performance at unprecedented scale.
External Responsibilities
Qualifications
Required Experience
- 15+ years of experience in quality, reliability engineering, critical infrastructure, manufacturing quality, or related technical leadership roles.
- 7+ years leading large-scale engineering or quality organizations.
- Experience building or transforming quality and reliability programs in hyperscale infrastructure, cloud, semiconductor, power systems, telecom, or mission-critical environments.
- Deep expertise in reliability engineering methodologies and statistical analysis techniques.
- Proven experience with supplier quality management and complex hardware ecosystems.
- Strong understanding of critical infrastructure systems including power distribution, cooling, controls, mechanical, and electrical systems.
Preferred Qualifications
- Experience in hyperscale data center infrastructure or cloud infrastructure environments.
- Familiarity with AFR, IDR, MTBF, and reliability growth methodologies.
- Experience with GW-scale infrastructure deployment programs.
- Demonstrated success driving measurable reliability improvements across large operational fleets.
- Advanced degree in Engineering, Reliability Engineering, Mechanical Engineering, Electrical Engineering, or related field preferred.
Leadership Characteristics
- Strategic builder with the ability to create new functions and scale organizations.
- Data-driven decision maker with strong analytical rigor.
- Strong executive communication and influence skills.
- Bias for action and operational excellence.
- Collaborative leader capable of driving alignment across engineering, operations, and suppliers.
- Passion for quality, reliability, and continuous improvement at hyperscale.
Impact
This role will directly shape the reliability foundation of OCI’s next-generation infrastructure platform. The Sr. Director will establish the systems, standards, and culture that ensure OCI data center designs and infrastructure products achieve exceptional quality, operational resilience, and lifecycle performance at unprecedented scale.