Cloud Infrastructure & Agentic Architect

thinkproject

Utrecht, Netherlands

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Utrecht, Netherlands

Tech stack

Agile Methodologies

Artificial Intelligence

Computing Platforms

Azure

Software as a Service

Cloud Computing

Cloud Engineering

Computer Security

DevOps

Virtual Private Networks (VPN)

Network Security

Platform as a Service (PAAS)

Ansible

Runbook

Data Logging

Google Cloud Platform

Cloud Platform System

Delivery Pipeline

Large Language Models

Firewalls (Computer Science)

Amazon Web Services (AWS)

Data Layers

Kubernetes

Deployment Automation

Google Cloud Functions

Data Analytics

Azure

Terraform

Dynatrace

Serverless Computing

Job description

We are looking for a Cloud Infrastructure & Agentic Architect (f/m/d) to own the technical foundation of our cloud platform. You will maintain our service catalog, design architectural blueprints, establish the observability standard, and - most distinctively - bring hands-on LLM-driven tooling skills to a team actively shaping what the next generation of cloud operations looks like., * Build and maintain a minimalistic, opinionated service catalog of approved cloud components across Google Cloud and Azure

Apply a serverless-first, PaaS-first philosophy - challenge complexity and push back on unnecessary infrastructure sprawl
For every approved catalog entry, deliver production-ready configuration: IaC, security baseline,observability hooks, and runbook
Scrutinise every new component request and justify its addition in terms of cost, operational overhead, and platform alignment

Cloud Architecture

Define, document, and govern cloud architectural blueprints across networking, compute, storage and data layers
Design event-driven pipelines that trigger AI-assisted validation, drift detection, and deployment gates
Serve as the technical authority on platform design decisions within your domain

LLM-Driven Operations

Implement LLM-based automated deployment capabilities using tools such as OpenCode, Claude Code, or equivalent frameworks
Design and operate infrastructure workflows augmented by AI agents - from deployment validation to configuration drift detection
Stay ahead of the market in AI-assisted tooling and bring relevant innovations into the platform

Observability

Define the observability standard: structured logging, distributed tracing, alerting, dashboards, and SLO/SLA frameworks
Establish platform-level KPIs and ensure consistent adoption across engineering teams

Collaboration

Partner with DevOps and Engineering teams to embed platform standards into delivery pipelines
Partner with SecOps to integrate all controls and security requirements
Document everything: blueprints, ADRs, runbooks, onboarding guides, Month 3: Service catalog baseline documented; at least 10 approved components with full IaC, security baseline, and runbooks in place Month 6: Observability standard adopted by 8+ engineering teams; LLM-assisted deployment workflows running in production Month 12: Platform architecture ADRs covering all major components; FinOps baseline established; catalog governance process self-sustaining

You're probably NOT a fit if

You need a large team to delegate to before making decisions
You prefer deep specialisation in a single cloud over breadth across two
You're not actively following the LLM/AI tooling space
You're looking for a role with clearly defined scope and minimal ambiguity

What we offer

Multicultural team across 7 countries (Germany, France, UK, UAE, Spain, New Zealand, Australia)
Hybrid-first working
Continuous learning & certification budget (GCP / Azure certifications fully sponsored)
Open, inclusive, and high-ownership culture

Requirements

You have operated multi-cloud Kubernetes environments in production, built IaC pipelines with Terraform and have hands-on experience with LLM-driven tooling in real engineering workflows. Specifically:

Proven cloud networking experience: firewalls, gateways, VPCs, VPNs, private service connect, and network security groups
Strong, current proficiency across Google Cloud Platform (GKE, Cloud Run, Cloud Functions, VPC, Cloud Armor, Pub/Sub, Artifact Registry) and Microsoft Azure
Proven IaC experience with Terraform (required); Ansible a strong plus
Proven hands-on LLM tooling experience: must have used OpenCode, Claude Code, or equivalent AI-assisted coding/deployment agents in real engineering workflows
Kubernetes at scale: demonstrated experience operating AKS (Azure Kubernetes Service) and GKE (Google Kubernetes Engine) - cluster lifecycle, upgrades, networking, and workload management
Experience with Vertex AI, Azure AI Foundry, or similar cloud model gardens Comfortable working in ambiguity, driving clarity from first principles without heavy process support
Experience in Agile/fast-iteration environments with high individual ownership

Nice-to-Have (Bonus Skills)

Knowledge of C5, SecNumCloud, ISO 27001, or equivalent cloud security frameworks FinOps - cost attribution, rightsizing, commitment strategies
Prior experience in a SaaS product company at scale
Platform engineering / SRE background

Soft Skills

Intense technical curiosity - you follow the market, test new tools, and form opinions before they become mainstream
Strong problem-solving discipline: you reach for first principles, not familiarity
Ownership mindset - you build it, you run it, you improve it
Clear written and verbal communication: able to document decisions and explain them to non specialists
Operates well under pressure and tight constraints - energised by hard problems
Strong negotiation skills - fact-driven and data-driven: you argue with evidence, not opinion
Highly collaborative: you build alignment across engineering, SecOps, and leadership

Benefits & conditions

By combining information management expertise and in-depth knowledge of the building, infrastructure, and energy industries, Thinkproject empowers customers to efficiently deliver, operate, regenerate, and dispose of their built assets across their entire lifecycle through a Connected Data Ecosystem.

About the company

thinkproject was founded in 2000 in Munich, Germany. Since then, the company has grown into the leading provider for cross-enterprise collaboration and information management in Europe.

Global customers from the construction and engineering industries are served from thinkproject’s home base in Munich and via a range of subsidiaries across Europe.

thinkproject addresses today’s digitization challenges in construction and engineering by providing state-of-the-art software solutions as well as industry expert consulting and services.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all