Incident & Problem Manager

BlackRock, Inc.

Edinburgh, United Kingdom

yesterday

Role details

Contract type

Internship / Graduate position

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Job location

Remote

Edinburgh, United Kingdom

Tech stack

Artificial Intelligence

Data analysis

Azure

Software as a Service

Configuration Management

Computer Programming

Databases

DevOps

Infrastructure as a Service (IaaS)

Python

Platform as a Service (PAAS)

Power BI

Ansible

Tableau

Datadog

Microsoft Power Automate

Grafana

Data Analytics

Virtual Agents

Splunk

ServiceNow

Job description

The Returner Programme is designed to encourage and support a return to full-time work through a six-month paid internship starting in September 2025. The returner programme acts as a feeder to full-time permanent employment (subject to individual performance and market conditions).

How we will support you:

They will offer a variety of employee networks to help you integrate into the business and culture
Provide a mentor and buddy for your day-to-day questions
Provide coaching and peer support, The Service Management team provides industry-standard Incident, Problem and Change Management, alongside infrastructure operational support for Aladdin. We operate using modern engineering practices and tooling, including ServiceNow and AI-enabled workflows, and measure outcomes through clear operational metrics.

BlackRock operates a 24/7 Major Incident Management function supporting global clients across Europe, the Americas, Asia Pacific and India. This role is based in Edinburgh and is required to cover core European hours between 09:00 and 17:00.

Role

We are seeking an individual for a role of Problem Manager with a strong passion for technical troubleshooting.

This role exists to prevent the recurrence of incidents. A Problem Manager is accountable for identifying, owning, and eliminating the root causes of incidents to prevent recurrence and improve service stability

The role partners closely with Engineering and SRE / DevOps teams, leveraging automation, observability tooling and emerging AI-driven insights. The successful candidate will have a DevOps mindset, be able to actively troubleshoot, and utilise and enhance AI and automation.

The role also includes participation in continuous improvement initiatives aimed at improving the stability, performance and resilience of the Aladdin platform, and enhancing Service Management services.

K ey Responsibilities

Manage a robust Problem Management process to prevent incident recurrence

Ensure root causes and preventative actions are clearly captured and translated into an effective Problem Management process.
Identifyincident trends and repeat patterns, driving scalable remediation to reduce recurrence
Partner with Engineering and SRE / DevOps to embed learnings into automation, observability,runbooksand readiness controls.
Design, build and activelymaintaina Known Error Database that functions as a real-time operational asset.
Work with product teams to design, build and deliver a meaningful process for addressing repeat incidents.

Deliver executive-grade communications (AI-drafted, human-approved)

Validate,approveand issue regular communications that are concise,informativeandappropriate forstakeholders.
Ensure communications accurately reflect impact, mitigation progress, keyrisksand confidence-based ETAs.

Drive continuous service improvement and regulatory alignment

Drive process and tooling changes that support operational resilience and regulatory requirements, including DORA and GDPR, where applicable.
Provide input and ownership for continual service improvement initiatives, with a primary focus on Agentic AI and its application to Incident Management.

Creation of Reports to communicate Problem progress

Requirements

A career break of at least 24 months (no upper limit). During your career break you may have been working small-scale around other commitments or have not done any paid work.
Prior experience in Investment Management or Financial Services
Must be authorised to work in the UK, * Experience in Incident/Problem Management within a production environment supporting business-critical platforms.
Proven ability to lead multiple simultaneousproblemsand drive structured recovery under pressure.
DevOps mindset, with comfort using observability tooling,automationand operational engineering practices.
Ability to produce clear, high-quality communications suitable for senior stakeholders.
Experience operating AI systems for triage,correlationand narrative generation, with sound judgement on when outputs require validation or override.
Ability to translate repetitive incident activity into automation requirements and drive adoption with engineering partners.
Experience working in or with FinTech or regulated environments.
Knowledge of cloud platforms such as Azure and/or AWSand understanding of IaaS / PaaS / SaaS service models.
Experience with Microsoft Copilot and AI-enabled productivity tooling.
Programming capability (e.g. Python) to automate common tasks or prototype improvements.
Familiarity with configuration management,deploymentand orchestration tooling (e.g. Ansible).
Strong data analysis skills using tools such as Splunk, Grafana, Tableau,Exceland/or Power BI.
Strong experience with ServiceNow and operational reporting.

Benefits & conditions

To help you stay energized, engaged and inspired, we offer a wide range of employee benefits including: retirement investment and tools designed to help you in building a sound financial future; access to education reimbursement; comprehensive resources to support your physical health and emotional well-being; family support programs; and Flexible Time Off (FTO) so you can relax, recharge and be there for the people you care about.

About the company

BlackRock's hybrid work model is designed to enable a culture of collaboration and apprenticeship that enriches the experience of our employees, while supporting flexibility for all. Employees are currently required to work at least 4 days in the office per week, with the flexibility to work from home 1 day a week. Some business groups may require more time in the office due to their roles and responsibilities. We remain focused on increasing the impactful moments that arise when we work together in person - aligned with our commitment to performance and innovation. As a new joiner, you can count on this hybrid model to accelerate your learning and onboarding experience here at BlackRock. About BlackRock At BlackRock, we are all connected by one mission: to help more and more people experience financial well-being. Our clients, and the people they serve, are saving for retirement, paying for their children's educations, buying homes and starting businesses. Their investments also help to strengthen the global economy: support businesses small and large; finance infrastructure projects that connect and power cities; and facilitate innovations that drive progress. This mission would not be possible without our smartest investment - the one we make in our employees. It's why we're dedicated to creating an environment where our colleagues feel welcomed, valued and supported with networks, benefits and development opportunities to help them thrive.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all