About your role:
We are seeking a highly skilled and experienced Senior Performance & Site Reliability Engineer to join our team and take a leadership role in ensuring the performance, reliability, and scalability of our production systems. This role is ideal for a seasoned professional who thrives in dynamic environments, possesses deep technical expertise across multiple layers of the stack, and is passionate about delivering exceptional client experiences through robust and high-performing systems.
This hybrid position combines the disciplines of performance engineering and site reliability engineering, with a strong focus on production environments. The successful candidate will proactively identify and resolve performance bottlenecks, recommend and implement system optimizations, and lead incident response efforts with speed and precision. Additionally, this role will be instrumental in evaluating infrastructure readiness during client onboarding and forecasting capacity needs to support future growth.
What you’ll do:
Production Performance Optimization
- Conduct comprehensive performance assessments of production systems, identifying inefficiencies across application code, JVM configurations, database queries, and infrastructure components
- SQL query optimization and strategic indexing to enhance database responsiveness; JVM memory tuning to reduce garbage collection overhead & improve application stability and application-level enhancements to support scalability & reduce latency
- Collaborate with development, QA, and infrastructure teams to ensure performance improvements are effectively integrated into deployment pipelines and release cycles
Monitoring, Observability & Diagnostics
- Utilize a robust suite of observability and diagnostic tools to monitor system health, detect anomalies, and perform deep root cause analysis: Splunk, Dynatrace, Extrahop, Foglight, and Wireshark
- Develop and maintain advanced dashboards, alerts, and automated reporting mechanisms to ensure proactive detection and resolution of performance issues.
Client Onboarding & Infrastructure Planning
- Participate in client onboarding processes by reviewing anticipated usage patterns and evaluating the impact on existing infrastructure; perform capacity planning and forecasting to ensure infrastructure readiness and scalability in alignment with client growth and business objectives
- Collaborate with infrastructure and operations teams to provision new hardware, optimize resource allocation, and maintain high availability and performance standards
Incident Response & Reliability Engineering
- Act as a senior technical lead during production incidents, driving rapid triage, root cause identification, and resolution efforts; lead post-incident analysis to identify systemic issues and implement long-term solutions that enhance system resilience and prevent recurrence
Experience you’ll need to have:
- 7+ years of hands-on experience in performance engineering, site reliability engineering, or a closely related technical discipline
- Advanced proficiency in SQL and database performance tuning (e.g., Oracle, PostgreSQL, MySQL)
- Deep understanding of JVM internals, memory management, and garbage collection strategies
- Strong programming skills in Java and scripting languages such as Python or Bash
- Familiarity with cloud platforms (AWS, Azure, GCP) and container orchestration technologies (e.g., Kubernetes, Docker)
- Demonstrated experience with Splunk, Dynatrace, Extrahop, Foglight, and Wireshark in production environments
- Exceptional analytical, problem-solving, communication and collaboration skills, with the ability to work effectively across cross-functional teams
- Bachelor’s degree in computer science, Engineering, or related field
Experience that would be great to have:
- Experience in infrastructure capacity planning and forecasting
- Background in incident management, postmortem analysis, and continuous improvement initiatives
- Relevant certifications in cloud technologies, performance engineering, or SRE practices (e.g., Google SRE, AWS Certified Solutions Architect)
How you’ll work:
- Fiserv emphasizes in-person collaboration to help you grow your career while shaping the future of fintech; this role is on-site Monday through Friday
Sponsorship:
- You must currently possess valid and unrestricted U.S. work authorization to be considered for this role. Individuals with temporary visas including, but not limited to, F-1 (OPT, CPT, STEM), H-1B, H-2, or TN, or any candidate requiring sponsorship, now or in the future, will not be considered for this role
R-10360880