About this role:
Wells Fargo is seeking a Generative AI Senior Software Engineer for Cloud and LLM API Systems within Digital Technology – AI Capability Engineering to design, build, and operate platform's poly-cloud foundation across GCP/Azure and on-prem OpenShift (OCP). This hands-on platform engineer will cover landing zones, network/IAM, secure perimeter patterns (e.g., VPC-SC/Private Service Connect), infrastructure-as-code provisioning, platform services, observability, DR/BCP, content security, and capacity planning to support Gen AI Studio, APIs, Guardrails, and agent workloads. This role is full stack in addition to infrastructure engineering, you will build automation services and UI experiences that enable onboarding, visibility, and operational workflows across environments. The role requires strong Kubernetes fundamentals (preferably GKE) and hands on knowledge of GenAI concepts to support state-of-the-art platform delivery.
In this role, you will:
Lead moderately complex initiatives and deliverables within technical domain environments
Contribute to large scale planning of strategies
Design, code, test, debug, and document for projects and programs associated with technology domain, including upgrades and deployments
Review moderately complex technical challenges that require an in-depth evaluation of technologies and procedures
Resolve moderately complex issues and lead a team to meet existing client needs or potential new clients needs while leveraging solid understanding of the function, policies, procedures, or compliance requirements
Collaborate and consult with peers, colleagues, and mid-level managers to resolve technical challenges and achieve goals
Lead projects and act as an escalation point, provide guidance and direction to less experienced staff
Required Qualifications:
4+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Desired Qualifications:
Hands-on experience engineering cloud landing zones (projects/subscriptions, org policies, VPC/VNETs, firewalls, service perimeters) and documenting control plane vs compute plane topology patterns
Experience provisioning resources in GCP and Azure using Terraform (IaC), including secrets integration patterns with HashiCorp Vault
Experience implementing secure hybrid connectivity (peering/PSC, DNS, egress) between on-prem OCP and cloud environments to support API calls to model endpoints and internal services
Strong Kubernetes knowledge (preferably GKE) and experience hosting applications on Kubernetes platforms (GKE/OCP or similar)
Experience establishing observability baselines and dashboards for runtime inferencing paths (latency, error rate, tokens/sec, TTFB) and operating SLO views
Python experience building automation, platform services, and tooling to support provisioning, operations, and developer workflows
UI skills: experience building internal portals/dashboards for onboarding, operational visibility, and workflow execution (developer/ops experiences)
GenAI experience: understanding of GenAI concepts (LLMs, RAG, LLM architecture) to support day-to-day platform design decisions and delivery