IT - Network Engineer
Role details
Job location
Tech stack
Job description
We are seeking a motivated Recovery Engineer / Analyst to join a Production Services team for a large financial client. This role is for a hands-on technical professional who performs well during high-severity production incidents, enjoys problem-solving, and is interested in long-term growth within the organization. You will work closely with senior recovery managers and various technical teams across the enterprise., * Participate in major and critical incident bridges, assisting with triage, diagnostics, and recovery activities.
- Gather and analyze logs, metrics, and alerts to support rapid issue identification.
- Assist in identifying the impacted service, symptoms, and contributing factors during incidents.
- Perform initial analysis using available diagnostics, observability tools, and documentation.
- Support post-incident reviews and root cause analysis (RCA) efforts by collecting data and timelines.
- Create, update, and validate runbooks, standard operating procedures (SOPs), and recovery playbooks based on incident learnings.
- Analyze incident trends to identify areas for operational improvement.
- Learn and apply SRE and reliability principles under the guidance of senior teammates.
Requirements
Experience: Experience in production support, operations, NOC, SRE, DevOps, or application support roles is required., * Candidates must have a working knowledge of application, infrastructure, or cloud environments, as well as logs, monitoring, alerts, and basic diagnostics.
- The ability to read and follow technical runbooks and SOPs is essential.
- Familiarity with observability tools such as APM, logging platforms, and metrics dashboards.
- Scripting or development exposure, including PowerShell, Python, or Bash.
Core Competencies:
- Strong problem-solving and analytical skills.
- Ability to remain calm and organized during high-severity incidents.
- Clear verbal and written communication skills.
- A willingness to learn from senior engineers and a strong sense of ownership.
- Ability to work effectively across teams.