AgentOps Engineer

Dnb
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Engineering
Continuous Integration
DevOps
Distributed Systems
Github
Python
Operational Databases
Reliability Engineering
Management of Software Versions
Datadog
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Machine Learning Operations
Terraform
Docker

Job description

  • Design and operate agentic AI solutions end-to-end in production: deployment, monitoring, incident handling, and recovery
  • Design and operate secure, scalable cloud infrastructure for agentic AI across AWS/Azure (Terraform, containers etc.)
  • Build observability for AI agents, covering traces, tool calls, reasoning steps, latency, cost, quality metrics, and failure modes
  • Establish continuous feedback loops using production data, automated evaluation and user signals
  • Implement CI/CD for models, agents, prompts, tools, and configurations, with gated promotions and safe rollbacks
  • Manage versioning and control rollout of agents including validated model upgrades
  • Diagnose and resolve AI-specific production issues such as hallucinations, tool failures, latency and cost anomalies and so on
  • Contribute to shared AgentOps standards, runbooks, and ways of working

Requirements

  • Relevant background as DevOps, platform or MLOps engineer
  • Proficiency in automating with languages like Python and scripting
  • Experience with Azure/AWS/GCP, CI/CD, Docker and so on
  • Understanding of reliability engineering, incident management, and post-mortems
  • Experience with observability tooling (OpenTelemetry, logging, metrics, tracing)
  • Understanding of distributed systems and system design
  • Experience deploying and managing agents, evacuations, knowledge bases etc.
  • Familiarity with frameworks (LangGraph, CrewAI, Strands and similar).

Tech stack example: Python, Strands, AWS AgentCore, AWS Bedrock, MCP, Mlflow, OTel, Docker, GitHub Actions

What you bring You take a role model approach to AI first operations, with a focus on observability and continuous improvement. You have an operational mindset, and care about reliability, failure modes, and recoverability. You communicate and collaborate effectively across roles and teams, and you take true ownership to the team's goals.

Benefits & conditions

You'll work on challenging and meaningful tasks in a strong engineering culture with solid opportunities for professional growth and career development. We offer attractive pension and insurance schemes, as well as employee benefits on DNB's products. You'll also have access to company cabins across Norway, sports, cultural and social activities, and a wide range of employee discounts. We support flexibility in everyday work through flexible working hours, a hybrid way of working, extra days off, and reduced working hours from May to August (summertime).

About the company

People are the very DNA of DNB. Since 1822, bright minds have worked together to find the best solutions for our customers. Today, DNB is much more than Norway's largest bank - we are a technology-driven financial institution that continuously connects people and ideas to knowledge and capital in new ways. Diversity is part of who we are, and inclusion is something we actively choose every single day. We promise to do our best to make you feel at home. A job at Norway's largest financial group offers professional challenges in an exciting work environment with many opportunities for development., AI Tech is DNB's new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact. We bring together deep technical expertise, modern AI platforms, and hands-on delivery to scale agentic AI across the group. We move fast, learn fast, and deliver real outcomes. We expect every member of AI Tech to be a role model in the everyday use of AI -using AI proactively in coding, automating tasks, improving documentation, and accelerating problem-solving. You help raise the overall AI maturity across DNB by demonstrating what AI-first engineering looks like.

Apply for this position