What does a great Senior Cloud Reliability Engineer do?
In this role, you will be instrumental in building and maintaining the technology that powers our core services, with a particular focus on applications running in the Microsoft Azure Cloud. Your contributions will directly impact the success of companies worldwide. Our Technology team comprises experts dedicated to evaluating and enhancing current systems while innovating for the future.
You will leverage your analytical and troubleshooting skills to ensure seamless project participation, maintain support continuity, and handle rotating on-call escalations. Your responsibilities will include leading the detection and resolution of issues affecting the delivery of industry-leading Debit Web and Card API solutions. Teamwork and creativity are crucial in this role, as you will be the go-to point for escalations, tasked with resolving cardholder-impacting issues in a dynamic and fast-paced environment.
Essential Role Responsibilities:
• Provide hands-on support for existing environments to include performance of the following related tasks: software installation, patch installation, upgrades, query writing, configuration, security, system monitoring and tuning, disaster recovery planning, and release deployments.
• Collaborate with client services to understand customer needs, provide technical support, and ensure seamless integration and operation of our solutions, enhancing overall client satisfaction and experience.
• Implement tools and automation for build, configuration management, continuous integration (CI), deployment, and application monitoring.
• Automate and evolve infrastructure, deployment strategies and testing to support a quick turnaround of deployments.
• Work closely with Engineering to ensure all relevant KPI’s are implemented within the monitoring framework.
• Participate in all Production Support activities during incidents and outages. Hands-on technical resource capable of resolving all technical issues within lower and upper environments and making recommendation for performance and capacity improvements.
• Participate in capacity planning, tuning systems stability, provisioning, performance, and scaling of the application infrastructure.
• The desire to resolve issues for a 24x7 environment in a non-impacting yet fast-paced resolve time.
Basic Qualifications for Consideration:
• Bachelor’s degree required; relevant, equivalent work experience may be substituted for degree requirement
• 5+ Hands on experience working with Cloud technologies (Azure or AWS)
• 5+ Years experience with Kubernetes platforms such as Azure Red Hat OpenShift (ARO), Azure Kubernetes Service (AKS), or Amazon Elastic Kubernetes Service (EKS)
• Knowledge of Apigee or other API Management Platforms
• Experience working with third-party vendors
• Able to work effectively, both independently and as a member of a cross-functional team
• Demonstrate a desire to automate as much as possible
• Able to participate in on-call rotation
PREFERRED QUALIFICATIONS:
• Containerization technologies such Docker or Podman
• Continuous Integration / Continuous Delivery tools: Azure DevOps, Tekton
• Linux system administration – ability to manage and troubleshoot Linux systems and services, bash scripting
• Understanding of routing and networking concepts
• Experience working in an Agile development environment
• Collaboration platforms such as JIRA, Confluence, Wiki, ServiceNow
R-10358305