Overview
Who we are
Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world’s most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We’re looking for diverse, talented team members who want to Dream. Do. Grow. with us.
An important part of the Toyota family is Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America. While TFS is a separate business entity, it is an essential part of this world-changing company- delivering on Toyota's vision to move people beyond what's possible. At TFS, you will help create best-in-class customer experience in an innovative, collaborative environment.
Job Summary:
The primary responsibility of this role is to help modernize and implement DevOps methodologies to fully shift from a hybrid on-premises and cloud organization today to a fully cloud-centric organization with DevSecOps, data ops, automated delivery via CICD, and automated preventive controls. Reporting to the DevOps Engineering Manager, the person in this role will support the organization’s objective to ensure safe and reliable software deployments and measuring application service performance and availability using Infrastructure-as-Code, and Continuous Integration / Continuous Delivery Pipelines to handle the full application lifecycle. You will also lead the development of Intelligent Automation solutions to streamline our operations. The ideal candidate will have experience with Site Reliability Engineering (SRE) principles, AWS, Snowflake, and automation tools.
What you’ll be doing
- Manage day-to-day support activities, including L3 support, releases, and infrastructure provisioning.
- Develop and implement DevSecOps, DevOps, and data ops best practices, including test automation.
- Build and implement best-in-class CI/CD capabilities to automate data pipeline delivery.
- Collaborate with platform teams to integrate tooling into existing pipelines.
- Drive adoption and alignment of DevOps and cloud engineering practices across the Data Engineering team.
- Partner with Risk Management and Security teams to ensure secured and compliant cloud infrastructure and services.
- Lead the design, implementation, and maintenance of cloud-based infrastructure on AWS.
- Drive the process and approach our Data Engineering team use to document sensitive, protected, and shared data to ensure compliance with appropriate information and data governance policies (GDPR, CCPA, SOX, etc.).
- Performing site reliability engineering development efforts to improve availability and performance of software systems (debugging, triaging and identifying root cause for failure in a production environment and performing postmortem analysis).
- Defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems
Technical Leadership
- Lead the design, implementation, and maintenance of cloud-based infrastructure on AWS
- Develop and implement SRE principles to ensure high availability, scalability, and security
- Collaborate with cross-functional teams to identify and prioritize project requirements
- Provide technical guidance and mentorship to junior team members
Intelligent Automation
- Design and develop Intelligent Automation solutions to streamline operations
- Implement automation tools such as Ansible, Terraform, or CloudFormation
- Collaborate with stakeholders to identify areas for automation and process improvement
Snowflake and Data Engineering
- Collaborate with data engineering teams to design and implement data pipelines on Snowflake
- Ensure data security, governance, and compliance with regulatory requirements
- Optimize data storage and query performance on Snowflake
Site Reliability Engineering (SRE)
- Implement SRE principles to ensure high availability, scalability, and security
- Develop and implement monitoring, logging, and alerting solutions
- Collaborate with teams to identify and resolve incidents and outages
Requirements
Technical Requirements
- 8+ years of experience in DevOps, SRE, or a related field
- Strong experience with AWS, including EC2, S3, Lambda, and CloudWatch
- Experience with Snowflake and data engineering principles
- Strong experience with automation tools such as Ansible, Terraform, or CloudFormation
- Experience with SRE principles and practices
- Strong programming skills in languages such as Python, Java, or C++
Soft Skills
- Strong leadership and communication skills
- Ability to collaborate with cross-functional teams
- Strong problem-solving skills and attention to detail
- Ability to adapt to changing priorities and requirements
Nice to Have
- Experience with containerization using Docker or Kubernetes
- Experience with CI/CD pipelines using Jenkins, and GitLab
- Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK
- Experience with agile development methodologies such as Scrum or Kanban
Belonging at Toyota
Our success begins and ends with our people. We embrace diverse perspectives and value unique human experiences. Respect for all is our North Star. Toyota is proud to have 10+ different Business Partnering Groups across 100 different North American chapter locations that support team members’ efforts to dream, do and grow without questioning that they belong.
Applicants for our positions are considered without regard to race, ethnicity, national origin, sex, sexual orientation, gender identity or expression, age, disability, religion, military or veteran status, or any other characteristics protected by law.
Have a question, need assistance with your application or do you require any special accommodations? Please send an email to talent.acquisition@toyota.com.