Remote: Reporting where work can/needs to be performed / collaboration should happen. If the person lives w/n 50 miles of such a location, they are expected to come in three times a week. If they do not live withing 50 miles of any of those locations, they don’t need to report in.
GM DOES NOT PROVIDE IMMIGRATION-RELATED SPONSORSHIP FOR THIS ROLE. DO NOT APPLY FOR THIS ROLE IF YOU WILL NEED GM IMMIGRATION SPONSORSHIP (e.g., H-1B, TN, STEM OPT, etc.) NOW OR IN THE FUTURE.
GM’s Infrastructure Engineering organization seeks a Technical Program Manager (TPM) to lead strategy, planning, execution, and reporting for cloud capacity and performance engineering projects spanning multiple teams.
This role focuses on shaping the Autonomous Vehicle (AV) cloud infrastructure strategy, advising on budget-impact decisions, and delivering expert guidance on capacity planning and performance to a broad group of stakeholders. You will drive both strategic and hands-on programs, collaborating closely with executives and technical leaders to ensure projects are executed smoothly with clear, measurable outcomes.
We are seeking a TPM who demonstrates strength in three key areas:
- Technical Expertise: Strong background in cloud infrastructure platforms with a focus on cost optimization/utilization improvements to support AI/ML workloads and tools at scale in the cloud.
- Program Management: Has hands-on experience directing cross-functional, highly technical programs while tracking and reporting on progress, resourcing requirements, and risks
- Communication: Can clearly convey complex technical concepts to diverse audiences, building trust and ensuring understanding among stakeholders
What you’ll be doing:
- Lead the planning, tracking, and reporting of engineering execution across a large 200+ engineer organization
- Drive cloud cost efficiency and capacity planning initiatives by partnering with teams across AV engineering, infrastructure, product, and finance
- Own cross-departmental technical programs that ensure cloud resource provisioning, deployment milestones, and ongoing performance are aligned to both business needs and budgetary objectives
- Develop, communicate, and execute quarterly and annual roadmaps for cloud infrastructure, prioritizing cost optimization, resource efficiency, and resolving performance bottlenecks
- Continually improve processes for proactive capacity management, including regular forecasting, Budget vs Actual reporting, usage metrics instrumentation, and showback/chargeback systems
- Standardize and publish engineering KPIs for cloud efficiency and cost across multiple levels (platform, cluster, job), and drive adoption of new tools and dashboards for stakeholders
- Assess and mitigate risks related to cloud scalability, system performance limitations, and cost overages; manage responses to abnormal usage patterns and support major DR scenarios
- Collaborate with Site Reliability Engineering (SRE) and other groups to identify, measure, and communicate areas of inefficiency, and drive targeted improvements
- Serve as a primary technical point of contact for CPE programs, translating technical requirements into actionable plans, tracking dependencies, and ensuring executive-level transparency for progress, risks, and results