Responsibilities Leverage PCM Tools capabilities to proactively detect anomalies, optimize performance, and support root cause analysis. Collaborate with application and infrastructure teams to integrate PCM Tools across services and environments. Develop Python/Salt scripts to automate monitoring tasks, data extraction, and alerting workflows. Integrate monitoring tools with CI/CD pipelines and ITSM platforms for streamlined operations. Work together with product owners, scrum masters, developers and testing teams (both cross‑functional and multi‑country teams) to perform gap analysis, observability assessment and monitoring requirements validation. Collaborate with multiple technology groups and vendors to ensure that the applications, integrations, infrastructure, and security architectures are designed to meet evolving business requirements. Ensure that our deliverables meet standards for reliability, scalability, performance, and availability, and align with the Bank’s Technology roadmap. Propose technical solutions and strategies for major applications and technology initiatives, aligning them to the technology roadmap to support GTEP’s digital plan. Work closely with other engineering departments on production releases and facilitates resolution on impacts on various projects/enhancements. Create process lifecycle documentation (guides, KB articles, incident playbooks) related to Monitoring Tools, including end‑to‑end process map. Independently investigate ad‑hoc issues, propose different options, and drive issue resolution. Effectively communicate release status and appropriately escalated impediments/risks. Coordinate and run planned and unplanned production implementation. Contribute to a collaborative team environment by information sharing and team cooperation. Be accountable on the execution of day‑to‑day project and task‑oriented work efforts and meeting project expectations for established time, cost and specification definitions. Be accountable on the assembling and implementation of project plans including release plans. Be responsible for providing regular, standardized reporting for team performance: velocity, stories completed and project status. Build and maintains solid, professional working relationships with peers within the project management and business lines. Meet scheduled milestones to ensure project/program objectives are met in a timely manner. Work with team to identify roadmap/feature dependencies and impediments. Be accountable for tracking the dependencies and impediments and removing them to enable the team: surface, track, elevate. Regularly communicate with other scrum teams to ensure tactical story alignment and identifies/tracks dependencies between team. Ensure the completion of required forms and processes for Bank‑wide release management, including the OR/NIRA process. Understand how the Bank’s risk appetite and risk culture should be considered in day‑to‑day activities and decisions. Education / Experience / Other Information You have 3+ years of experience with performance monitoring tools (as end‑user, administrator, or technical support). familiarity with Observability, Network concepts, Cloud solutions, SDLC, Continuous Integration and Continuous Delivery (Scotiabank experience considered a plus). 3 years of experience with Dynatrace. Understanding of SRE concepts and how Observability Tools can improve to the overall success of resilience and reliability strategy at The Bank. Basic proficiency in Python scripting for automation and data manipulation. Familiarity with cloud platforms (Azure, GCP) and containerized environments (Kubernetes, Docker). Requires basic knowledge of project management tools and methodologies. Ideally, you have an Engineering or Computer Science degree, and pride yourself on analysis, logical thinking, and problem‑solving skills. Working Conditions Work in a standard office‑based environment; non‑standard hours are a common occurrence. #J-18808-Ljbffr
Systems Reliability Engineer
SCOTIATECH
bogotá, bogotá
Publicado hace 17 días
Denunciar empleo