What does a successful Site Reliability Engineer do at Fiserv?
A successful Site Reliability Engineer at Fiserv leverages strong coding and automation skills to maintain system reliability. Work hands-on with deployed systems, using software development practices to ensure availability, performance, and efficiency through automation. They design and implement tools, processes, and systems to enhance the reliability, scalability, and performance of large-scale applications, with a focus on Dynatrace and Splunk.
What you will do:
- Automate operational tasks and health checks to create sustainable systems and services.
- Monitor the production environment using Dynatrace and Splunk, creating dashboards for alerts and system health.
- Map business processes to identify reliability gaps and analyze performance metrics.
- Collaborate in system design consulting, platform management, and capacity planning.
- Create and maintain detailed architectural documentation, including SOPs and infrastructure maps.
What you will need to have:
- 5+ years of experience in Site Reliability Engineering (SRE) within a Fintech or product organization.
- 4+ years of experience with automation tools like Python, Java, Ansible, or PowerShell.
- 4+ years of experience with observability and monitoring tools, specifically Dynatrace and Splunk.
- Bachelor's degree in computer science or related technical field and/or 7+ years of relevant work experience.
What would be great to have:
- Experience managing CI/CD pipelines and automation tools like GitLab, Harness, Nexus, Terraform, or SonarQube.
- Strong problem-solving skills for root cause analysis and proactive solution implementation.
- Effective communication skills for collaboration with cross-functional teams and customer interactions.
R-10361875