DevOps Engineer, Lead

Toyota • Plano, Texas, United States of America • 11h ago

Overview

Who we are

Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world’s most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We’re looking for diverse, talented team members who want to Dream. Do. Grow. with us.

An important part of the Toyota family is Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America. While TFS is a separate business entity, it is an essential part of this world-changing company- delivering on Toyota's vision to move people beyond what's possible. At TFS, you will help create best-in-class customer experience in an innovative, collaborative environment.

Job Summary:

The primary responsibility of this role is to help modernize and implement DevOps methodologies to fully shift from a hybrid on-premises and cloud organization today to a fully cloud-centric organization with DevSecOps, data ops, automated delivery via CICD, and automated preventive controls. Reporting to the DevOps Engineering Manager, the person in this role will support the organization’s objective to ensure safe and reliable software deployments and measuring application service performance and availability using Infrastructure-as-Code, and Continuous Integration / Continuous Delivery Pipelines to handle the full application lifecycle. You will also lead the development of Intelligent Automation solutions to streamline our operations. The ideal candidate will have experience with Site Reliability Engineering (SRE) principles, AWS, Snowflake, and automation tools.

What you’ll be doing

Manage day-to-day support activities, including L3 support, releases, and infrastructure provisioning.
Develop and implement DevSecOps, DevOps, and data ops best practices, including test automation.
Build and implement best-in-class CI/CD capabilities to automate data pipeline delivery.
Collaborate with platform teams to integrate tooling into existing pipelines.
Drive adoption and alignment of DevOps and cloud engineering practices across the Data Engineering team.
Partner with Risk Management and Security teams to ensure secured and compliant cloud infrastructure and services.
Lead the design, implementation, and maintenance of cloud-based infrastructure on AWS.
Drive the process and approach our Data Engineering team use to document sensitive, protected, and shared data to ensure compliance with appropriate information and data governance policies (GDPR, CCPA, SOX, etc.).
Performing site reliability engineering development efforts to improve availability and performance of software systems (debugging, triaging and identifying root cause for failure in a production environment and performing postmortem analysis).
Defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems

Technical Leadership

Lead the design, implementation, and maintenance of cloud-based infrastructure on AWS
Develop and implement SRE principles to ensure high availability, scalability, and security
Collaborate with cross-functional teams to identify and prioritize project requirements
Provide technical guidance and mentorship to junior team members

Intelligent Automation

Design and develop Intelligent Automation solutions to streamline operations
Implement automation tools such as Ansible, Terraform, or CloudFormation
Collaborate with stakeholders to identify areas for automation and process improvement

Snowflake and Data Engineering

Collaborate with data engineering teams to design and implement data pipelines on Snowflake
Ensure data security, governance, and compliance with regulatory requirements
Optimize data storage and query performance on Snowflake

Site Reliability Engineering (SRE)

Implement SRE principles to ensure high availability, scalability, and security
Develop and implement monitoring, logging, and alerting solutions
Collaborate with teams to identify and resolve incidents and outages

Requirements

Technical Requirements

8+ years of experience in DevOps, SRE, or a related field
Strong experience with AWS, including EC2, S3, Lambda, and CloudWatch
Experience with Snowflake and data engineering principles
Strong experience with automation tools such as Ansible, Terraform, or CloudFormation
Experience with SRE principles and practices
Strong programming skills in languages such as Python, Java, or C++

Soft Skills

Strong leadership and communication skills
Ability to collaborate with cross-functional teams
Strong problem-solving skills and attention to detail
Ability to adapt to changing priorities and requirements

Nice to Have

Experience with containerization using Docker or Kubernetes
Experience with CI/CD pipelines using Jenkins, and GitLab
Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK
Experience with agile development methodologies such as Scrum or Kanban

Belonging at Toyota

Our success begins and ends with our people. We embrace diverse perspectives and value unique human experiences. Respect for all is our North Star. Toyota is proud to have 10+ different Business Partnering Groups across 100 different North American chapter locations that support team members’ efforts to dream, do and grow without questioning that they belong.

Applicants for our positions are considered without regard to race, ethnicity, national origin, sex, sexual orientation, gender identity or expression, age, disability, religion, military or veteran status, or any other characteristics protected by law.

Have a question, need assistance with your application or do you require any special accommodations? Please send an email to talent.acquisition@toyota.com.