Title: System Reliability Engineer Specialist Requisition ID: Employee Referral Program – Potential Reward: $400,000.00 We are committed to investing in our employees and helping you continue your career at ScotiaTech. Purpose Combining the competencies of DevOps, Systems Administration, and Cloud Engineering, the role of System Reliability Engineer Specialist provides the opportunity to combine your technical ability, strategic thinking and detail‑oriented execution in a fast‑paced, dynamic environment. You will join a team with the purpose of constantly improving the reliability of our systems through continuous improvements to running infrastructure and applications. You will work with application teams to deliver continuous improvements to applications and support the transformation of our approach to both operations and development. You will work with the Cloud Engineering team to design and implement tools and processes that monitor and respond to the state of our systems. Accountabilities Managing the reliability of critical infrastructure platforms on our Public Cloud Platforms Improve and maintain site availability, scalability, service and system performance Investigate system errors and problems, bottleneck analysis for the systems we support (PEGA, Atlassian, Azure, GCP, Kubernetes, etc.) Provide solutions for performance management, disaster recovery, monitoring and access management Participate in solution design sessions Participate in planning and retrospective sessions, attending stand‑ups, etc. Build and operate highly available and scalable software and infrastructure. Supporting application teams on the use of the platform including providing guidance on design patterns, best practices, and security considerations. Education / Experience / Other Information Skills YOUR BACKGROUND AND SKILLS INCLUDE: A self‑starter with a strong sense of personal accountability and team responsibility 3-5 years of experience working in large enterprises Ability to work on ambiguous and complex problems Experience using Dynatrace and log analysis Solid verbal and written communication skills in English B2 Relevant working experience, System Administration, and/or Enterprise Operations skills Experience designing and implementing tasks in Continuous Integration systems (Jenkins, GitHub, Terraform.) OS Experience (RHEL 9.X and Windows 2K22 and above) Experience support GCP and/or Azure Experience with Kubernetes (GKE/AKS) Experience in Incident management, Change Management, troubleshooting, and root cause analysis in production environments You have strong knowledge of Agile & Lean methodologies for requirements / design methodology - detail oriented, analytical and capable of investigating complex / technical issues and provide alternative solutions, project & production methodologies. Experience in a modern technology stack. Knowledge of software design patterns, infrastructure architecture, DevOps, or security considerations. Understanding of software release process (environments, binary repositories, CI/CD). Attention to details, high standards for quality. GiTOps Experience Working Conditions Work in a standard office‑based environment; shift rotations. Limited international travel may be required. Location(s): Colombia : Bogota : Bogota #J-18808-Ljbffr
System Reliability Engineer Specialist
SCOTIABANK
bogotá, bogotá
Publicado hace 14 días
Denunciar empleo