OverviewWe are seeking a Senior Site Reliability Engineer (SRE) to help develop our platform operations across Windows, Linux, and cloud-native environments. This role is central to our transformation from app-specific support to platform-wide reliability engineering. You will bring deep expertise in Google Cloud Platform (GCP), container orchestration, and automation, enabling scalable, secure, and resilient infrastructure that supports diverse applications across our enterprise.
Key ResponsibilitiesPlatform Reliability & Cloud Engineering- Ensure high availability, performance, and security of production systems across Windows, Linux, and GCP environments.
- Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling scalable microservices architectures.
- Lead infrastructure provisioning and configuration using Terraform, Ansible, and GCP-native tools.
Automation & Observability- Develop automation scripts and pipelines to eliminate manual toil and accelerate incident response.
- Implement observability frameworks using SLIs/SLOs, Prometheus, Grafana, and GCP Operations Suite.
- Drive proactive monitoring, alerting, and telemetry across hybrid environments.
Incident Management & Resilience- Lead incident response, root cause analysis, and postmortems.
- Build self-healing systems and automated remediation workflows using GCP-native services and scripting.
Security & Compliance- Collaborate with InfoSec to enforce hardening standards, manage vulnerabilities, and support compliance initiatives.
- Integrate security into CI/CD pipelines and container platforms using IAM, encryption, and policy enforcement.
Collaboration & Enablement- Partner with developers, application owners, and infrastructure teams to deliver reliable, cloud-native platforms.
- Document configurations, runbooks, and operational procedures to enable cross-team reuse and transparency.
Required Qualifications:- 4+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
- 4+ years of experience in Windows Server administration and production support.
- Strong scripting skills in PowerShell, Python, or Shell.
- Hands-on experience with GCP services, including GKE, IAM, Cloud Functions, and Cloud Monitoring.
- Proficiency in container technologies: Docker and Kubernetes.
- Familiarity with Linux system administration and hybrid cloud environments.
- Experience with infrastructure-as-code tools: Terraform, Ansible.
- Strong understanding of Active Directory, DNS, DHCP, and Windows security principles.
Desired Qualifications:- Security certifications (e.g., CISSP, Security+, GCP Professional Cloud Security Engineer).
- Experience with CI/CD tools (e.g., GitLab CI and Jenkins).
- Familiarity with ITIL practices and change management.
- Exposure to ServiceNow, load balancers, certificate management, and endpoint protection tools.
Job Expectations:- Ability to work on-site in one of the listed locations in a hybrid environment
- Ability to work outside of normal business hours including nights and weekends on a limited/rotational basis
- We are not considering candidates that require visa sponsorship
Posting End Date: 31 Oct 2025
*Job posting may come down early due to volume of applicants. We Value Equal OpportunityWells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.
Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements.
Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.
Applicants with DisabilitiesTo request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo .
Drug and Alcohol PolicyWells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.
Wells Fargo Recruitment and Hiring Requirements:a. Third-Party recordings are prohibited unless authorized by Wells Fargo.
b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.